Microsoft Cloud Operations + Innovation (CO+I) is seeking a Senior Data Scientist to support the growth of its cloud services by delivering high-quality infrastructure and leveraging data to influence product strategy and enhance customer experience.
Requirements
- 5+ years of experience in data science, analytics, or machine learning.
- 2+ years of experience within model deployment & operations: experience deploying agents in Azure cloud environments, with familiarity in containerization, continuous integration/continuous development pipelines and model monitoring.
- 5+ years of experience within data handling & feature development: Experience in handling large-scale structured and unstructured datasets, including time-series and text data, and applying advanced feature engineering techniques.
- 3+ years of experience in coding and design, specifically in the development of AI models for production services.
- 4+ years of experience in developing solutions with Microsoft Power Platform, including Power BI, Fabric, Power Automate & M365 Dataverse.
- Experience with data cloud computing technologies such as – Azure Synapse, Azure Data Factory, SQL, Azure Data Explorer
- Knowledge of MLOps practices, including containerization, infrastructure-as-code, and monitoring
Responsibilities
- modification techniques to transform raw data into compatible formats for downstream systems. Utilize software and computing tools to ensure data quality and completeness. Implement code to extract and validate raw data from upstream sources, ensuring accuracy and reliability.
- Writes efficient, readable, extensible code from scratch that spans multiple features/solutions. Develops technical expertise in proper modeling, coding, and/or debugging techniques such as locating, isolating, and resolving errors and/or defects.
- Leverages technical proficiency of big-data software engineering concepts, such as Hadoop Ecosystem, Apache Spark, continuous integration and continuous delivery (CI/CD), Docker, Delta Lake, MLflow, AML, and representational state transfer (REST) application programming interface (API) consumption/development
- Acquires data necessary for successful completion of the project plan. Proactively detects changes and communicates to senior leaders. Develops usable data sets for modeling purposes. Contributes to ethics and privacy policies related to collecting and preparing data by providing updates and suggestions around internal best practices.
- Adhere to data modeling and handling procedures to maintain compliance with laws and policies. Document data type, classifications, and lineage to ensure traceability and govern data accessibility.
- Perform root cause analysis to identify and resolve anomalies. Implement performance monitoring protocols and build visualizations to monitor data quality and pipeline health. Support and monitor data platforms to ensure optimal performance and compliance with service level agreements.
- Leverages knowledge of machine learning solutions (e.g., classification, regression, clustering, forecasting, NLP, image recognition, etc.) and individual algorithms (e.g., linear and logistic regression, k-means, gradient boosting, autoregressive integrated moving average [ARIMA], recurrent neutral networks [RNN], long short-term memory [LSTM] networks) to identify the best approach to complete objectives.
Other
- Bachelor’s or Master’s degree in computer science, Math, Software Engineering, Computer Engineering or related field AND 3+ years’ experience in business analytics, data science, data modeling, or data engineering.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Travel 0-25%
- Ability to communicate technical concepts effectively with both technical and non-technical stakeholders
- Microsoft will accept applications for the role until October 31, 2025.