Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

AI Solution Architect (Remote)

Inspira Financial

$106,000 - $134,000

Dec 5, 2025

Oak Brook, IL, US

Inspira Financial is looking to accelerate the delivery of AI-driven solutions that delight customers and improve business outcomes by guiding engineering teams on the application of AI tools to real business problems.

Requirements

Extensive hands-on experience with Google Cloud Platform (GCP), including Vertex AI, BigQuery, Dataflow, Pub/Sub, and cloud-native microservices, APIs, event streaming, containers/orchestration (Kubernetes/GKE), and Infrastructure as Code (Terraform/Deployment Manager).
Practical expertise with GenAI patterns: Retrieval-Augmented Generation (RAG), vector databases (e.g., BigQuery Vector Search), prompt engineering/evaluation, agent design, function/tool calling, and orchestration using Google AI tools.
Strong grasp of MLOps/LLMOps: CI/CD for models and prompts, offline/online evaluation, telemetry, drift and safety monitoring, with a focus on Google’s Vertex AI Pipelines, Model Registry, and continuous deployment.
Experience designing for security, privacy, and compliance within GCP; proficiency with OAuth/OIDC, secrets management (Secret Manager), data protection, and model/content safety controls aligned with Google’s best practices.
8+ years in software/solution architecture or platform engineering, including 3+ years delivering applied ML/GenAI solutions in production environments.
GCP Architect, Security (e.g., CCSK) or equivalent certifications (nice-to-have).

Responsibilities

Own end‑to‑end solution architectures for AI and AI‑enabled products (discovery design * deployment), ensuring security, reliability, cost efficiency, and maintainability across cloud and on‑prem boundaries.
Establish reference architectures and reusable patterns for GenAI and applied AI (RAG, agents/orchestration, vector search, prompt & tool design, event‑driven microservices, API gateways).
Select fit‑for‑purpose models and services (e.g., Azure OpenAI/Bedrock/Vertex, OSS LLMs, embedding models) with clear tradeoffs on performance, latency, privacy, and cost.
Partner with product and platform teams to ship solutions: define requirements, review designs and PRs, and drive prototype pilot * production with CI/CD, IaC, and MLOps/LLMOps (model versioning, prompt/config management, evals, drift & safety monitoring).
Coach teams to use copilots (e.g., GitHub/Claude), agent frameworks (e.g., LangGraph/Semantic Kernel), and integration SDKs responsibly to improve velocity without compromising quality or security.
Ensure observability (tracing, guardrails, red‑teaming, cost dashboards), SLOs, and runbooks are in place before cutover.
Run architecture discovery with business stakeholders; frame problems, quantify constraints, and map KPIs (time‑to‑first‑value, cost‑to‑serve, task success, CSAT/NPS, deflection rate, accuracy).

Other

Bachelor’s degree in Computer Science, Engineering, Data Science, Artificial Intelligence, or equivalent experience.
Exceptional written and verbal communication skills; proven ability to influence and collaborate across product, engineering, security, and business teams.
Ability to work occasional overtime.
Occasional travel (up to ~15%).
Occasional after-hours work to support releases or incident response.