Abbott is looking to solve complex business problems using data science and machine learning to help people with diabetes manage their health with life-changing products that provide accurate data to drive better-informed decisions. This role will help define and implement the organization’s Big Data strategy, working closely with data engineers, analysts, and scientists.
Requirements
- Knowledge of or direct experience with Databricks and/or Spark.
- Software development experience, ideally in Python, PySpark, Kafka or Go, and a willingness to learn new software development languages to meet goals and objectives.
- Knowledge of strategies for processing large amounts of structured and unstructured data, including integrating data from multiple sources
- Knowledge of data cleaning, wrangling, visualization and reporting
- Ability to explore new alternatives or options to solve data mining issues, and utilize a combination of industry best practices, data innovations and experience
- Familiarity of databases, BI applications, data quality and performance tuning
- Knowledge of or direct experience with the following AWS Services desired S3, RDS, Redshift, DynamoDB, EMR, Glue, and Lambda.
Responsibilities
- Design and implement data pipelines to be processed and visualized across a variety of projects and initiatives
- Develop and maintain optimal data pipeline architecture by designing and implementing data ingestion solutions on AWS using AWS native services.
- Design and optimize data models on AWS Cloud using Databricks and AWS data stores such as Redshift, RDS, S3
- Integrate and assemble large, complex data sets that meet a broad range of business requirements
- Read, extract, transform, stage and load data to selected tools and frameworks as required and requested
- Customizing and managing integration tools, databases, warehouses, and analytical systems
- Process unstructured data into a form suitable for analysis and assist in analysis of the processed data
Other
- Ability to work effectively within a team in a fast-paced changing environment
- Excellent written, verbal and listening communication skills
- Comfortable working asynchronously with a distributed team
- Experience working in an agile environment
- Practical Knowledge of Linux software