Rutgers University seeks to develop and support biomedical and clinical informatics research by creating efficient data systems and fostering collaborations to advance research computing and position the university as a leader in the field.
Requirements
- Proven experience in data pipeline building, optimization, and data system architecture within a healthcare or clinical research environment.
- Expertise in data wrangling techniques, ETL (Extract, Transform, Load) processes, and staging data for analysis.
- Proficiency in programming languages (e.g., Python, SQL, Java, etc.) and experience with database technologies (e.g., SQL Server, PostgreSQL, etc.).
- Ability to demonstrate deep knowledge of statistical methods and will demonstrate practical knowledge of machine learning model building and deployment.
- Knowledgeable of research processes and language in biological or medical fields and be able to effectively communicate and support researchers in these domains with technological and methodological expertise.
- Experience working with Enterprise Infrastructure teams to align data engineering efforts with broader infrastructure strategies is preferred.
- Experience with cloud-based data technologies and distributed computing frameworks (e.g., AWS, Azure, Hadoop, Spark, etc.) is advantageous.
Responsibilities
- Participates in development of new biomedical and clinical informatics research and support model, potentially leading to creation of a core facility and/or center of excellence.
- Develops and maintains data pipelines to facilitate the smooth flow and collection of clinical research data, integrating various sources for comprehensive analysis.
- Utilizes expertise in data wrangling techniques to clean, transform, and prepare raw data for analysis, ensuring data quality and consistency.
- Optimizes data systems for performance, ensuring efficient storage, retrieval, and staging of data for analysis and reporting purposes.
- Collaborates closely with Enterprise Infrastructure (EI) teams to align data engineering efforts with broader infrastructure strategies and capabilities.
- Develops and coordinates instructor-based training focused on advanced computing.
- Develops and expands internal and external partnerships, including industry, to promote advanced computing-related collaborations.
Other
- Master’s degree and 6 years of experience supporting computationally intensive and/or data intensive research projects or an equivalent combination of education and/or experience.
- Strong verbal and written communication skills.
- Positive attitude and enjoys working in a dynamic and flexible environment.
- Ability to work with minimal supervision and demonstrate a history of self-motivation.
- Ph.D. in a related field is preferred.