Hebbia is seeking a Data Engineer to refine their data infrastructure and drive best practices for building data pipelines to meet the company's data needs.
Requirements
- Significant hands-on experience in data engineering (ETL development, data warehousing, data lake management, etc.)
- Proficient in Python and SQL
- Comfortable working with cloud-based data stack tools
- Familiar with big data processing frameworks (e.g., Spark, Hadoop) and data integration technologies (e.g., Airflow, DBT, or similar)
- Experience implementing data governance, security, and compliance measures
Responsibilities
- Architect, build, and maintain ETL pipelines and workflows that ensure high data quality and reliability
- Design and manage a central data lake to consolidate data from various sources, enabling advanced analytics and reporting
- Implement best practices in data security and governance to ensure compliance and trustworthiness
- Evaluate and integrate new technologies, tools, and approaches to optimize data processes and architectures
- Continuously monitor, troubleshoot, and improve data pipelines and infrastructure for performance, scalability, and cost-efficiency
Other
- Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field
- 5+ years software development experience at a venture-backed startup or top technology firm, with a focus on data engineering
- Strong collaboration and communication skills, with the ability to translate business requirements into technical solutions
- Prior experience in a high-growth or startup environment is a plus
- You are comfortable working in-person 5 days a week