Stanford RegLab is looking for a Data Scientist to collaborate with research teams on machine learning and public sector projects, aiming to modernize governance through data science and machine learning.
Requirements
- Basic knowledge and demonstrated experience using and applying analytical software, database management system software, database reporting software, database user interface and query software, and data mining software.
- Basic knowledge of software engineering principles and strong knowledge of at least one programming language, such as Python, R, and/or SQL); basic statistical ability.
- Ability to collect data using a variety of methods, such as data mining and hardcopy or electronic documentation study, to improve or expand databases.
- Ability to use logic to calculate data; efficiently construct a database or scrutinize the form of a question.
- Ability to work with data of varying levels of quality and validity.
- Demonstrated ability to produce data in a clear and understandable manner meeting user requirements.
- May test prototype software and participate in approval and release process for new software.
Responsibilities
- Support collecting, managing, and cleaning of data sets from regulatory agencies, understand discrepancies and build out and document a data dictionary to improve knowledge of different systems and how to manage discrepancies.
- Produce reports and publications on findings to our government partners and for academic publications, based on extensive data analysis.
- Work with PI and Research Director to ensure data management/collection is clear, secure, and robust; improve processes for collecting and managing data.
- Support training of research fellows on best practices and technical skills
- Identify and select usable data from subtle and complex data patterns.
- Assess and produce relevant, standard, or custom information (reports, charts, graphs and tables) from structured data sources by querying data repositories and generating the associated information.
- Design methods to validate data to ensure high quality product.
Other
- This is a one-year fixed term position, with the option of renewal.
- Strong listening, verbal and written communication skills.
- Ability to manage multiple activities in a deadline-oriented environment; highly organized, flexible and rigorous attention to detail.
- Ability to work effectively with multiple internal and external stakeholders.
- new staff hires must successfully pass a background check prior to starting work at Stanford University.