Data Ventures at Walmart is looking to unlock the full value of Walmart’s data by developing and productizing B2B data initiatives that empower merchants and suppliers to make better, faster decisions for the business.
Requirements
- Experience programming in an object-oriented language (Java or Scala).
- Experience using Spark in batch jobs to process large scale data.
- Experience in creating and maintaining data processing workflows with tools including Airflow.
- Experience using Spark, Hive, or SQL to perform advanced data investigation.
- Experience implementing statistical and machine learning methods for data classification and regression.
- Experience developing techniques to ascertain correctness of data processing and transformation implementations using unit, integration, and end-to-end pipeline testing.
- Experience designing and developing software to perform ETL operations on large datasets.
Responsibilities
- Build data systems that ingest, model, and analyze massive flow of data from online and offline user activities, processing hundreds of millions of sales and impressions data to obtain insights and analytics related to advertising campaign performance.
- Develop big data applications for precise audience targeting and cutting-edge measurement for campaign reporting, leveraging the wealth of data within the Walmart ecosystem.
- Set up ETL jobs in Airflow to move large volume of distributed data from various sources to secondary data centers for business continuity and disaster recovery.
- Troubleshoot business and production issues by gathering information (issue, impact, criticality, possible root cause), engage support teams to assist in resolution of issues, formulate an action plan, performing actions as designated in plan, interpret the results to determine further action, and complete online documentation.
- Develop complex software features to streamline and scale batch jobs to support advertising propensity models.
- Design, develop, and maintain software for the targeting and reporting data pipelines in Spark, Hadoop and Map-Reduce.
- Develop software using object-oriented languages such as Scala and Java. Implement advertising measurement systems that leverage machine learning and statistical techniques.
Other
- Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 5 years’ experience in software engineering or related area.
- 7 years’ experience in software engineering or related area.
- Master’s degree in computer science, computer engineering, computer information systems, software engineering, or related area and 3 years' experience in software engineering or related area.
- Travel requirements not specified
- Clearance requirements not specified
- Must be able to work in the United States