TRM Labs is looking for a Senior Data Engineer to help them design, implement, and scale core components of their lakehouse architecture, specifically focusing on building the foundational data infrastructure powering next-generation analytics at scale.
Requirements
- 5+ years of experience in data or software engineering, with a focus on distributed data systems and cloud-native architectures.
- Proven experience building and scaling data platforms on GCP, including storage, compute, orchestration, and monitoring.
- Strong command of one or more query engines such as Trino, Presto, Spark, or Snowflake.
- Experience with modern table formats like Apache Hudi, Iceberg, or Delta Lake.
- Exceptional programming skills in Python, as well as adeptness in SQL or SparkSQL.
- Hands-on experience orchestrating workflows with Airflow and building streaming/batch pipelines using GCP-native services.
Responsibilities
- Architect and scale a high-performance data lakehouse on GCP, leveraging technologies like StarRocks, Apache Iceberg, GCS, BigQuery, Dataproc, and Kafka.
- Design, build, and optimize distributed query engines such as Trino, Spark, or Snowflake to support complex analytical workloads.
- Implement metadata management in open table formats like Iceberg and data discovery frameworks for governance and observability using Iceberg compatible catalogs.
- Develop and orchestrate robust ETL/ELT pipelines using Apache Airflow, Spark, and GCP-native tools (e.g., Dataflow, Composer).
- Collaborate across departments, partnering with data scientists, backend engineers, and product managers to design and implement
Other
- Join a mission-driven, fast-paced team made up of experts in law enforcement, data science, engineering, and financial intelligence, tackling complex global challenges daily.
- We are looking for people who can elevate the quality our tech and our execution.
- If you enjoy a remote-first and async friendly environment to achieve efficacy and efficiency at petabyte scale, our team could be a great pick for you!
- Team members are based in the US across almost all timezones!
- We do try to reserve some overlap in the day for meetings. Our north star - no IC spends more than 3-4 hours/week in meetings.