Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Wikimedia Foundation Logo

Data Engineer

Wikimedia Foundation

$115,334 - $144,201
Aug 12, 2025
Remote, US
Apply Now

The Wikimedia Foundation's Data Platform team needs to unify data systems across the organization to deliver scalable solutions that support the open knowledge movement, enabling internal teams and the global community to leverage vast data for research, feature development, and advancing artificial intelligence responsibly.

Requirements

  • 3+ years of data engineering experience, with exposure to on-premise systems (e.g., Spark, Hadoop, HDFS).
  • Understanding of engineering best practices with a strong emphasis on writing maintainable and reliable code.
  • Hands-on experience in troubleshooting systems and pipelines for performance and scaling.
  • Working experience with data pipeline tools like Airflow, Kafka, Spark, and Hive.
  • Proficient in Python or Java/Scala, with working knowledge of development tools and its ecosystem.
  • Knowledge of SQL and experience with various database/query dialects (e.g., MariaDB, HiveQL, CassandraQL, Spark SQL, Presto).
  • Working knowledge of CI/CD processes and software containerization.

Responsibilities

  • Designing and Building Data Pipelines: Develop scalable, robust infrastructure and processes using tools such as Airflow, Spark, and Kafka.
  • Monitoring and Alerting for Data Quality: Implement systems to detect and address potential data issues promptly.
  • Supporting Data Governance and Lineage: Assist in designing and implementing solutions to track and manage data across pipelines.
  • Collaborate with peers to improve and evolve the shared data platform, enabling use cases like product analytics, bot detection, and image classification.
  • Enhancing Operational Excellence: Identify and implement improvements in system reliability, maintainability, and performance.

Other

  • Good communication and collaboration skills to interact effectively within and across teams.
  • Ability to produce clear, well-documented technical designs and articulate ideas to both technical and non-technical stakeholders.
  • Desirable: Exposure to architectural/system design or technical ownership.
  • Desirable: Experience in data governance, data lineage, and data quality initiatives.
  • The Wikimedia Foundation is a remote-first organization with staff members including contractors based 40+ countries.