Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

CloudTech Innovations Logo

Data Scientist

CloudTech Innovations

Salary not specified
Aug 20, 2025
Dallas, TX, US
Apply Now

The company is looking to modernize its data and AI platforms by migrating existing solutions to the cloud and implementing production-grade AI/ML solutions at scale, specifically leveraging Generative AI, LLMs, and RAG.

Requirements

  • 8–10 years in data science, machine learning, and AI/ML solution delivery.
  • Strong hands-on expertise in at least one major cloud platform (AWS, Azure, or GCP) with proven production deployments.
  • Proficiency in Python, PySpark, and SQL.
  • Proven experience with Apache Spark, Hadoop ecosystem, and Big Data processing.
  • Hands-on experience with Generative AI, Hugging Face Transformers, LangChain, or LlamaIndex.
  • Expertise in RAG architectures and vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB).
  • Experience with MLOps workflows using MLflow, Docker, Kubernetes, and CI/CD tools (Jenkins, GitHub Actions, GitLab CI).

Responsibilities

  • Design, develop, and deliver end-to-end ML/AI solutions in cloud-native environments from design to deployment and monitoring.
  • Architect and implement Generative AI solutions leveraging LLMs (e.g., GPT, LLaMA, Claude, Mistral) and RAG pipelines with vector search.
  • Build and optimize Big Data pipelines using Apache Spark, PySpark, and Delta Lake integrated with cloud storage (AWS S3, Azure Data Lake, GCP Cloud Storage).
  • Design and maintain data lakehouse architectures with Databricks, Snowflake, or Delta Lake.
  • Deploy scalable MLOps pipelines using MLflow, SageMaker, Azure ML, or Vertex AI with Docker, Kubernetes (EKS, AKS, GKE), and CI/CD.
  • Implement and manage vector databases (Pinecone, FAISS, Milvus, Weaviate, ChromaDB) for RAG applications.
  • Migration projects, on-prem to cloud, cross-cloud, or legacy platform upgrades (e.g., Hadoop to Databricks, Hive to Delta Lake) , ensuring data integrity and minimal downtime.

Other

  • 8-10 Years Experience
  • Remote
  • Contract
  • Mentor junior data scientists and guide best practices for AI/ML development and deployment.
  • Collaborate with product, engineering, and executive teams to align AI solutions with business KPIs and compliance requirements.