Microsoft Azure Linux is looking to build open-source observability for cloud-native and Azure workloads by maintaining Inspektor Gadget (CNCF) to transform low-level Linux kernel signals into actionable insights across AKS and other Open Source and Microsoft platforms. They are seeking a Senior Software Engineer to make deep kernel observability approachable, safe, and scalable for production environments.
Requirements
- 4+ years of experience using Linux fundamentals (syscalls, namespaces, cgroups, networking/storage subsystems) and hands-on experience using or building eBPF-based tooling.
- 2+ years of experience with Kubernetes and container runtimes.
- Experience with libbpf, cilium/ebpf, BCC, CO-RE, or kernel tracing/profiling tools; familiarity with OCI packaging for artifacts.
- GPU/AI: low-level GPU work (e.g., CUDA/ROCm, GPU drivers/profiling) or building/operating AI/ML workloads—especially for observability or performance analysis.
- Background in AKS/k8s operations/performance tuning and in building or integrating observability tooling for large clusters, plus exposure to partner scenarios beyond Kubernetes.
- Demonstrated open-source contributions (e.g., Linux kernel, eBPF ecosystem, Kubernetes, container runtimes, or observability projects), with evidence of collaborative upstream work.
- coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
Responsibilities
- Design and implement Go services and eBPF programs that power Inspektor Gadget’s gadgets, enrichment pipeline, and developer APIs—prioritizing reliability, performance, and usability on AKS.
- Advance image-based gadgets (OCI) and operators; improve data interfaces and integration points for AKS and additional platforms.
- Optimize for scale: reduce agent footprint, improve enrichment fidelity (pods/nodes/services), and ensure smooth operation on large multi-node clusters.
- Collaborate with AKS, Azure Linux, and partner teams on priorities and roadmaps; contribute upstream to relevant open-source communities (eBPF, Kubernetes, observability).
- Act as DRI for components/services as needed—own outcomes and drive investigations and improvements when issues arise in high-impact customer and partner scenarios.
- Write high-quality Go and eBPF code, evolve image-based gadgets, enhance Kubernetes/AKS enrichment, and create integration pathways that help partner systems and customers consume kernel insights at scale.
- Contribute upstream to advance open observability.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Experience working in a SaaS (Software-as-a-Service) company, especially where the primary customers are businesses (B2B) rather than direct consumers (B2C).
- This includes understanding enterprise customer needs, supporting business-critical workloads, and collaborating across product and engineering teams to deliver scalable solutions.
- Microsoft will accept applications for the role until October 7, 2025