Developing and optimizing high-performance distributed systems to power search and discovery across billions of Tumblr posts
Requirements
- Two (2) years experience deploying, maintaining, and scaling production search systems;
- Two (2) years experience with Elasticsearch, Solr, or Lucene;
- Two (2) years experience in PHP, Java and/or Scala;
- Two (2) years experience in Information Retrieval and/or algorithms;
- At least one (1) year of experience with Hadoop and MapReduce, including frameworks such as Spark;
- At least one (1) year of experience in Airflow, Kafka, or Flink;
Responsibilities
- Enhance Tumblr’s Search and Dashboards, implementing new signals and features, and building scalable data solutions to enhance content relevance and user experience;
- Explore and implement new signals and features to improve content quality and relevancy; Launch A/B tests and perform analysis to validate hypotheses, and build tools to enable continuous experimentation;
- Build out new content datasets using Elasticsearch, Hadoop, Kafka, Druid, Flink and/or Spark;
- Maintain and optimize a fleet of Elasticsearch servers via Kubernetes to ensure search and discovery services operate efficiently;
- Continuously improve and refine search quality and relevance;
Other
- Requires at minimum a Bachelor’s degree in Computer Science, Engineering, or related technical field or foreign equivalent
- 2 years of Progressively responsible experience in a software engineering role
- Less than 24% incidental travel to various locations for team building and business strategization a few times per year
- Full-time telecommuting from anywhere in the U.S. is permitted