Topsort is looking to solve the business problem of democratizing the secret technologies of the walled gardens and creating a privacy-first cookie-free world of clean advertising with modern tech, friendly products, and AI. They aim to make advertising intuitive, intelligent, and genuinely cool, without creepy ads or cookie-obsession. The Senior Data Engineer will enable data-driven decision-making across the organization by ensuring the availability, reliability, and efficiency of data systems.
Requirements
- Strong proficiency in SQL and database technologies with (e.g., PostgreSQL, MySQL, Snowflake, BigQuery).
- Experience with data pipeline orchestration tools (e.g., Apache Airflow, Prefect, Dagster).
- Proficiency in programming languages such as Python and Scala.
- Hands-on experience with AWS cloud data services.
- Familiarity with big data processing frameworks like Apache Spark.
- Knowledge of data modeling, warehousing concepts, and distributed computing.
- Experience implementing CI/CD for data pipelines.
Responsibilities
- Design, develop, and maintain robust ETL/ELT pipelines to process and transform large datasets efficiently.
- Optimize data architecture and storage solutions to support analytics, machine learning, and business intelligence.
- Work with cloud platforms (AWS) to implement scalable data solutions.
- Ensure data quality, integrity, and security across all data pipelines.
- Collaborate with data scientists, analysts, and software engineers to support data-driven initiatives.
- Monitor and troubleshoot data workflows to ensure system performance and reliability.
- Create APIs to provide analytical information to our clients.
Other
- 2+ years of experience in data engineering or a related field.
- Strong problem-solving skills and the ability to work independently and collaboratively.
- Work onsite at one of our offices in Santiago, Sao Paulo or Mexico City 4 days a week.
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Real-time data processing and streaming architectures (RisingWave, Kafka, Flink).