The Wikimedia Foundation's Community Growth team is looking to improve knowledge equity by supporting global communities of volunteers through grantmaking, capacity development, and strategic partnership building, and needs a Senior Data Scientist to support this work with impactful, accessible, and ethical data and insights.
Requirements
- Proven experience with programming languages, particularly SQL and Python
- Experience calling and working with APIs in Python, including authentication, pagination, and JSON data handling
- Experience working with large-scale data processing & storage tools (we use Hadoop, Hive, Presto, and Spark)
- Experience with data visualization and dashboarding tools like Apache Superset, Google Looker Studio, or similar tools
- Experience with Google Suite (Including the full suite of collaborative apps and advanced coding in Google Sheets/MS Excel)
- Advanced statistical training and experience
- Interest in or knowledge of Wikimedia projects, the free knowledge movement, global education and/or international development
Responsibilities
- In partnership with the DL&E Manager, building data systems for grant-funded activities and outcomes that enable standardization and connection with our primary data pipelines
- Creation and maintenance of internal data dashboards to track Community Growth programs' outcomes and impacts
- Identifying actionable insights from trends and themes across datasets to inform reports and decision-making about grantmaking, partnerships, and programmatic interventions
- Designing Community Growth data processes for maximum effectiveness, building data frames and connecting to pipelines for effective and efficient storage and retrieval, basic aggregation and ETL
- Collaborating across departments, including with technology data teams, to ensure that calculation of metrics are aligned with Foundation-level metric practices
- Proactively identifying requirements to support and improve an evolving data ecosystem
Other
- Bachelor’s degree and five years related experience OR a Master’s degree and at least three years related experience; or equivalent advanced work experience in data analysis, data science, research and analytics or a related field
- Fluency in English
- Ability to address problems of diverse scope using good judgment in selecting methods and techniques for obtaining quality and efficient solutions
- Ability to communicate findings and recommendations clearly to colleagues with diverse backgrounds and areas of expertise
- Desire to learn, share knowledge, and help other colleagues