Reddit is looking to improve its machine learning infrastructure to support its growing user base and enhance its content discovery and recommendation capabilities.
Requirements
- Deep expertise in systems design, performance optimization, and cloud-native architectures (AWS, GCP, or Azure).
- Strong understanding of ML infrastructure, especially feature engineering, data freshness, feature versioning, online/offline feature retrieval and online/offline consistency to serve 100s of millions of users.
- Proven experience building real-time data systems, e.g., feature stores, stream processing engines, or caching systems.
- 10+ years of professional software engineering experience
- 3+ years leading a team to design large-scale distributed systems.
Responsibilities
- Lead the technical strategy, architecture and development of Reddit’s next-generation, real-time feature store that supports both online and offline feature retrieval.
- Work with other engineers across ML Platform and Reddit infrastructure to significantly advance the ML Platform.
- Mentor other team members in adopting a rigorous DevOps approach to maintain and/or improve ML infra components and services health and quality
- Work with management on team goal setting, planning, and de-risk project execution
Other
- 10+ years of professional software engineering experience
- 3+ years leading a team to design large-scale distributed systems.
- Strong organizational & communication skills
- Comprehensive Healthcare Benefits and Income Replacement Programs
- 401k Match
- Family Planning Support
- Gender-Affirming Care
- Mental Health & Coaching Benefits
- Flexible Vacation & Reddit Global Days off
- Generous paid Parental Leave
- Paid Volunteer time off