Khan Academy is looking to provide researchers access to student learning data in a secure containerized system that strongly protects student privacy by implementing the creation of the SafeInsight container, ETL processes, and tools.
Requirements
- 5+ years of experience in data engineering within a production environment
- Experience with Google infrastructure services and data services particularly GCS & Big Query
- Advanced knowledge of Python
- Strong SQL skills
- Demonstrated proficiency with Docker and Kubernetes
- Experience with Airflow
- Knowledge of Go is a strong plus
Responsibilities
- Collaborate with our Data Insights Group to understand the desired end data product and the overall projects goals
- Collaborate with our data architect and Data Infrastructure team to understand available data source and tooling
- Propose tooling and processes that will work with extant Khan Academy systems to meet the projects goals
- Build test and refine these tools and processes
- Deliver the SafeInsights container and associated project deliverables
- Provide documentation and training required to maintain and enhance the above
Other
- Motivated by the Khan Academy mission “to provide a free world-class education for anyone, anywhere.”
- Proven cross-cultural competency skills demonstrating self-awareness, awareness of other, and the ability to adopt inclusive perspectives, attitudes, and behaviors to drive inclusion and belonging throughout the organization.
- Background in applied math, statistics, economics, or a related technical field
- Experience in a data science research capacity is a plus
- Commitment to equal employment opportunity regardless of race, color, ancestry, religion, sex, gender, gender identity or expression, national origin, sexual orientation, age, citizenship, marital status, disability, or Veteran status