Reddit is looking to improve the availability, scalability, latency, and efficiency of its products by enhancing its core storage infrastructure and developing a high-performance, large-scale distributed vector store for search and machine learning applications.
Requirements
- Software development experience in one or more general purpose programming languages; Golang, Python, C++, Java
- Prior experience with Vector storage technologies is a huge plus (Milvus, Pinecone, OpenSearch, ElasticSearch, Weaviate, Pgvector).
- Prior experience with operating a large scale critical infrastructure system with a focus on automation and workflow development is a plus, especially in a role where they were required to be on call.
Responsibilities
- Design and implement reliable, efficient workflows to automate a wide range of operational tasks critical to managing Reddit’s core storage infrastructure.
- Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Reddit’s products in Go, C++ and sometimes Python.
- Contribute to the design and implementation of a high-performance, large-scale distributed vector store to power search and machine learning applications.
- Collaborate closely with engineering teams and stakeholders to integrate vector storage capabilities into broader storage infrastructure and use cases across Reddit.
- Mentor and guide other engineers on how to design, build and evangelize vector storage services across Reddit
Other
- 7+ years of experience building internet-scale software, preferably with a focus on machine learning storage infrastructure.
- Excellent communication skills to collaborate with a service-oriented team and company.