Anduril Industries is looking to solve the biggest problems in defense by transforming U.S. and allied military capabilities with advanced technology, including cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology.
Requirements
- 5+ years of experience in a data engineering role building products, ideally in a fast-paced environment
- Good foundations in Python or another language
- Experience with Spark, PySpark, SQL and dbt
- Experience with Enterprise Data Systems like Palantir Foundry
- Experience with or interest in learning how to develop data services and data products
- Experience with AWS, Azure, or GCP security ecosystem, containerization, and associated tooling
- Knowledge of data & visualization tools, such as Tableau
Responsibilities
- Lead the design and roadmap for our data platform
- Partner with operations, product, and engineering to advocate best practices and build supporting systems and infrastructure for the various data needs
- Own the ingest and egress frameworks for data pipelines that stitch together various data sources in order to produce valuable data products that drive the business
- Manage a large user base and provide true data self service at scale using Palantir Foundry
- Use SQLMesh for data transformations, giving our engineers the ability to work with SQL while enjoying software engineering best practices like version control, testing, and CI/CD for data pipelines
- Utilize Athena, an aws serverless solution that allows our team to run analytics directly on our data lake without managing infrastructure
- Implement Apache Iceberg as our table format, providing advanced data management capabilities like schema evolution and time travel queries
Other
- Must be a U.S. Person due to required access to U.S. export controlled information or facilities
- You are motivated by our mission
- You are empathetic: you are eager to see the world from your users’ perspective
- You’re energized by business impact & a self-starter: you love to drive the direction of ambiguous projects
- Drive to take ownership of and debug complex data transformation pipelines and data models