CrowdStrike is looking to hire a leader to build and operate hyper-scale hybrid cloud networks, improve network reliability and efficiency, and develop tools and approaches for monitoring and operating the network at scale.
Requirements
- 7+ years deploying and managing network infrastructure
- 7+ experience working with network protocols such as BGP, MPLS (TE, Auto-BW), VxLAN, eVPN, and CLOS Architectures
- Experience with building and maintaining network monitoring and graphing tools, as well as streaming telemetry
- Programming experience in Python, Perl, Go or other scripting language
- Experience with Cloud Providers such as AWS and GCP
- Experience with network simulation and testing tools (NS-3, NetSim, Batfish, Ixia)
- Production level experience supporting large scale network infrastructure
Responsibilities
- Set the direction for and improve the reliability and efficiency of the network
- Contribute to maintaining a high-performance, fault-tolerant, and scalable network
- Develop, track, and report on KPIs and metrics that measure network capacity, performance, and availability
- Build tools and monitoring systems that provide granular, real-time observability
- Develop automation to continuously assess and detect suboptimal network state and identify potential points of failure
- Review designs, and traffic patterns to continually assess network capacity and availability
- Lead resolution of network incidents, conduct internal post-mortems, perform root cause analysis, and ensure corrective actions are taken in a timely manner
Other
- United States Citizenship OR Permanent Residency is necessary to retain access to resources for this role
- Experience leading a sustaining engineering or SRE team
- This role will require the candidate to periodically undergo and pass additional background and fingerprint check(s) consistent with government customer requirements.
- Market leader in compensation and equity awards
- Comprehensive physical and mental wellness programs