SANDAG's Data Science Department needs to ensure the quality and reliability of the data used and produced by various teams across the agency to support regional planning, funding, and policymaking efforts.
Requirements
- Intermediate-advanced level programming experience in Python (and other object-oriented program languages) is critical.
- Knowledge of design principles for relational database management systems; experience creating SQL queries, stored procedures, and data views; familiarity with MS SQL Server or other enterprise relational database systems.
- Experience using geographic information system software such as ESRI ArcGIS or QGIS.
- Experience using Business Intelligence/Information Sharing software (Power BI, Tableau, etc.) to create reports and dashboards.
- Experience updating and maintaining project databases, files, and records, including data documentation.
- Knowledge of quality assurance and quality control practices used for validating data and ensuring data accuracy and integrity, including statistical analysis and sampling techniques, preferably for demographic and economic data.
- Experience with data analysis and research methodologies; knowledge of data acquisition and quality control methods used for gathering and compiling various types of information; knowledge of factors that contribute to the reliability and integrity of collected data.
Responsibilities
- Analyze and validate regional economic, demographic, land use, and transportation data for use in operational, procedural, and policy decision-making activities; interpret multi-disciplinary data demonstrating a thorough understanding of issues that could affect the validity and reliability of information.
- Develop and perform quality control tests and analysis of data using tools such as Python, R, ArcGIS, and SQL to identify and report data anomalies and identify root cause patterns.
- Examine datasets and supporting documentation to identify risks and problem areas and recommend standards, policies, and procedures to facilitate a comprehensive data quality management system.
- Support Peer Review Process through review of datasets and methodologies, including suggestions for improvement as well as documentation, including meeting notes and action items.
- Create and maintain documentation related to data compliance, business rules, data flows, process flows, and reporting.
- Coordinate the preparation of metrics and statistical reports, perform reviews, and complete special studies; prepare spreadsheets, charts, maps, and data visualizations to support information sharing.
- Participate on inter-departmental and interagency teams assembled for various agency projects.
Other
- The minimum education, training, and experience include a bachelor’s degree with major course work in data science, computer science, management information systems, regional planning, geography, demography, economics, statistics, mathematics or a related field, and one to two years of professional experience in data analysis and programming.
- Prepare and present written, oral, and visual reports and recommendations to various teams and upper management.
- Experience communicating highly technical information effectively to a broad range of audiences
- Familiarity with the principles, practices, and objectives of regional planning and forecasting.
- Familiarity with formalized data governance processes and procedures.