CoreWeave is seeking a Director of Engineering to lead the development of their Observability product suite for AI/ML workloads, to design, build, and operate observability solutions at scale.
Requirements
- 10+ years of experience in infrastructure, or cloud systems.
- 5+ years in engineering leadership roles, including hiring, scaling, and mentoring teams.
- Proven track record of building and managing large-scale distributed systems or infrastructure.
- Prior experience in building telemetry solutions, such as logging, metrics and tracing, would be a plus.
- Understanding of cloud computing infrastructure using Kubernetes would be a plus.
- Strong communication and interpersonal skills, able to convey storage engineering strategies and practices to technical and non-technical audiences.
- Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.
Responsibilities
- Define and drive CoreWeave’s Observability roadmap and strategy.
- Lead and grow a high-performing team of software engineers and managers.
- Design and implement advanced solutions, including low-latency, high-scale Observability pipelines across all products.
- Build solutions that offer insights to customers for rapid troubleshooting of their AI workloads.
- Champion initiatives to improve reliability, durability, and self-healing capabilities of Observability metrics, and assume operational responsibilities.
- Develop operational review practices to assess performance against targets and iterating on those targets.
- Mentor and guide engineering teams on best practices in product engineering, fostering a customer-focused approach to systems design and technical excellence.
Other
- Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.
- Strong communication and interpersonal skills, able to convey storage engineering strategies and practices to technical and non-technical audiences.
- Travel may be required for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets.
- Must be eligible to access export controlled information, or eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.
- U.S. person, defined as a U.S. citizen or national, U.S. lawful permanent resident (green card holder), refugee under 8 U.S.C. § 1157, or asylee under 8 U.S.C. § 1158