CVS Health is looking to hire a Data Engineer to analyze data engineering problems and develop, build and manage large-scale data structures, pipelines and efficient Extract/Load/Transform (ETL) workflows to address complex problems and support business applications.
Requirements
- CI/CD, Jenkins, GIT, or DevOps
- Programming in Python, R, or SQL
- Spark, Airflow, Kafka, Hbase, Pig, MySQL, or NoSQL
- Data warehouse technologies: Oracle, Teradata, or DB2
- Visualization tools, including Tableau
- Software development for enterprise or web applications
- Unit and automation testing
Responsibilities
- develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and addresses reporting needs
- write ETL (Extract/Transform/Load) processes, design database systems, and develop tools for real-time and offline analytic processing that improve existing systems and expand capabilities
- collaborate with Data Science team to transform data and integrate algorithms and models into automated processes
- test and maintain systems and troubleshoot malfunctions
- leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines
- utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems
- build data marts and data models to support Data Science and other internal customers
Other
- Master’s degree (or foreign equivalent) in Information Technology, Computer Science, Computer Information Systems, Engineering, or a related field
- two (2) years of experience in the job offered or a related occupation
- Analyzing large data sets from multiple data sources
- SQL programming languages
- Software development lifecycle