Walmart/VIZIO Data Lake Team is looking to drive data-driven decisions and innovation within the company by building and maintaining high-performance, high-availability data structures.
Requirements
- Proficiency in Python, Pyspark, SQL, and/or Scala.
- Experience with relational SQL and NoSQL databases.
- Strong understanding of in-memory processing and data formats (Avro, Parquet, JSON, etc.).
- Experience with AWS cloud services (EC2, MSK, S3, RDS, SNS, SQS).
- Knowledge of stream-processing systems (Storm, Spark-Structured-Streaming, Kafka).
- Familiarity with data pipeline and workflow management tools (Apache Airflow, AWS Data Pipeline).
- Bonus: Experience with Databricks, Snowflake, and Thoughtspot.
Responsibilities
- Extract and transform data from various internal and external sources.
- Develop and maintain data pipelines and ETL processes.
- Implement data governance practices and ensure data quality.
- Design and develop data models, both logical and physical.
- Lead projects and mentor junior team members.
- Collaborate with cross-functional teams to support business needs.
- Stay updated with current data science and analytics trends.
Other
- BS or MS in Computer Science or a related field.
- 3+ years of experience in data engineering.
- Travel requirements not specified.
- Must be eligible to work in the United States.
- Option 1: Bachelor’s degree in Computer Science and 2 years' experience in software engineering or related field.