Leigh FinchinStackademiceBPF Tracepoints: Gaining Access to the TCP State MachineMy current research focus at UTS is around the inner workings of TCP Congestion Control, which as you might guess requires some detailed…Feb 51Feb 51
Leigh FinchinStackademicXDP and eBPF for Network Observability with PythonI’ve been playing with XDP and eBPF in my lab to see if it might be possible to create NetFlow/IPFIX style flow logs for network…Jan 171Jan 171
Leigh FinchinStackademicXDP and eBPF for Network Observability with PythonI’ve been playing with XDP and eBPF in my lab to see if it might be possible to create NetFlow/IPFIX style flow logs for network…Jan 14Jan 14
Leigh FinchAlert Fatigue: Why Too Many Alerts Can be Disastrous!Alert fatigue is a problem I’ve encountered so many times in IT Operations, especially as monitoring sprawl increases the number of tools…Dec 28, 2023Dec 28, 2023
Leigh FinchObservability Round Up Late December 2023December is often a quiet period as we wind down into the holidays, but in the last 24 hours we’ve seen an announcement from Cisco that it…Dec 22, 2023Dec 22, 2023
Leigh FinchImplementing Enterprise Observability for Success ReviewImplementing Enterprise Observability for Success by Manisha Agrawal and Karun Krishnannair takes a novel approach to implementing…Dec 13, 2023Dec 13, 2023
Leigh FinchTermshark: Command Line Wireshark for the Win!I was recently working on a headless server trying to troubleshoot an issue with Linux Bridging and IPTables and needed to understand where…Dec 6, 2023Dec 6, 2023
Leigh FinchMastering Python Networking ReviewI came across Mastering Python Networking by Eric Chou about a month ago on Twitter and immediately purchased it. I was excited to see book…Nov 28, 2023Nov 28, 2023
Leigh FinchSRE: Five Ways to Build a Blameless CultureOne of the main pillars of SRE (Site Reliability Engineering) is to introduce a blameless culture, however, building this takes more than…Nov 24, 2023Nov 24, 2023
Leigh FinchPerformance Diagnostics Part 6: 5 SRE Practices to Minimise Toilto minimise toil to increase resiliency and improve digital experience. SRE (Site Reliability Engineering) is a practice that Google…Nov 20, 2023Nov 20, 2023