Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Inspira Financial Logo

AI Solution Architect (Remote)

Inspira Financial

$106,000 - $134,000
Dec 5, 2025
Oak Brook, IL, US
Apply Now

Inspira Financial is looking to accelerate the delivery of AI-driven solutions that delight customers and improve business outcomes by guiding engineering teams on the application of AI tools to real business problems.

Requirements

  • Extensive hands-on experience with Google Cloud Platform (GCP), including Vertex AI, BigQuery, Dataflow, Pub/Sub, and cloud-native microservices, APIs, event streaming, containers/orchestration (Kubernetes/GKE), and Infrastructure as Code (Terraform/Deployment Manager).
  • Practical expertise with GenAI patterns: Retrieval-Augmented Generation (RAG), vector databases (e.g., BigQuery Vector Search), prompt engineering/evaluation, agent design, function/tool calling, and orchestration using Google AI tools.
  • Strong grasp of MLOps/LLMOps: CI/CD for models and prompts, offline/online evaluation, telemetry, drift and safety monitoring, with a focus on Google’s Vertex AI Pipelines, Model Registry, and continuous deployment.
  • Experience designing for security, privacy, and compliance within GCP; proficiency with OAuth/OIDC, secrets management (Secret Manager), data protection, and model/content safety controls aligned with Google’s best practices.
  • 8+ years in software/solution architecture or platform engineering, including 3+ years delivering applied ML/GenAI solutions in production environments.
  • GCP Architect, Security (e.g., CCSK) or equivalent certifications (nice-to-have).

Responsibilities

  • Own end‑to‑end solution architectures for AI and AI‑enabled products (discovery design * deployment), ensuring security, reliability, cost efficiency, and maintainability across cloud and on‑prem boundaries.
  • Establish reference architectures and reusable patterns for GenAI and applied AI (RAG, agents/orchestration, vector search, prompt & tool design, event‑driven microservices, API gateways).
  • Select fit‑for‑purpose models and services (e.g., Azure OpenAI/Bedrock/Vertex, OSS LLMs, embedding models) with clear tradeoffs on performance, latency, privacy, and cost.
  • Partner with product and platform teams to ship solutions: define requirements, review designs and PRs, and drive prototype pilot * production with CI/CD, IaC, and MLOps/LLMOps (model versioning, prompt/config management, evals, drift & safety monitoring).
  • Coach teams to use copilots (e.g., GitHub/Claude), agent frameworks (e.g., LangGraph/Semantic Kernel), and integration SDKs responsibly to improve velocity without compromising quality or security.
  • Ensure observability (tracing, guardrails, red‑teaming, cost dashboards), SLOs, and runbooks are in place before cutover.
  • Run architecture discovery with business stakeholders; frame problems, quantify constraints, and map KPIs (time‑to‑first‑value, cost‑to‑serve, task success, CSAT/NPS, deflection rate, accuracy).

Other

  • Bachelor’s degree in Computer Science, Engineering, Data Science, Artificial Intelligence, or equivalent experience.
  • Exceptional written and verbal communication skills; proven ability to influence and collaborate across product, engineering, security, and business teams.
  • Ability to work occasional overtime.
  • Occasional travel (up to ~15%).
  • Occasional after-hours work to support releases or incident response.