Airbnb is looking to improve the quality, efficiency, and speed of the Host Pricing team’s ML training data and infrastructure.
Requirements
- Expertise in SQL
- Proficient in at least one data engineering language, such as Python or Scala
- Experience with Superset and Tableau
- Expertise in large-scale distributed data processing frameworks like Presto or Spark
- Experience with an ETL framework like Airflow
- Extensive knowledge of data management concepts, including data modeling, ETL processes, data warehousing, and data governance.
- Understanding of data security and privacy principles, as well as regulatory compliance requirements (e.g., GDPR, CCPA)
Responsibilities
- Design and implement data pipelines by leveraging best-in-class tools and infrastructure to meet critical business and product requirements.
- Develop high quality data assets for product and AI/ML use-cases
- Collaborate with cross-functional teams to gather requirements, assess data needs, and design efficient solutions that align with business objectives.
- Contribute to the development of long-term data strategies and roadmaps and ML infrastructure development within the organization.
- Influence the trajectory of data in decision making
- Improve trust in our data by championing for data quality across the stack
- Identify and actively work upon opportunities for automation and implement data management tools and frameworks to enhance efficiency and productivity.
Other
- 9+ years of experience with a BS/Masters or 6+ years with a PhD
- Excellent communication skills, both written and verbal, ability to distill complex ideas for technical and non-technical stakeholders
- Strong capability to forge trusted partnerships across working teams
- Mentor and coach team members, providing guidance in data engineering best practices and support to enhance their skills and performance.
- Remote - USA, with occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager.