Important: after confirming your application on this platform, you’ll receive an email with the next step: completing your application on our internal site, LaunchPod. So keep an eye on your inbox and don’t miss this step — without it, the process can’t move forward.

About the role

As a Senior AI Engineer, you’ll build AI-powered systems that turn complex data into actionable insights, tackling high-impact challenges with modern cloud and LLM workflows. You’ll shape technical direction, influence team culture, and apply AI-first thinking to real-world problems, driving innovation and measurable business value in a fast-paced, collaborative environment.

What you will do

Build AI applications: Design and deploy intelligent systems that parse tariffs, optimize utility spend, and automate workflows—shipping production-grade features quickly while maintaining quality.

Document-centric RAG with OpenAI: Implement RAG using structured tool/JSON outputs, streaming and batch flows, with robust guardrails, red-teaming, and RAG evaluation (e.g., RAGAS, TruLens).

Productionize agent workflows: Integrate cutting-edge AI models into resilient pipelines and services that run reliably in real-world environments.

Scraping/ingestion at scale: Create pipelines for automated utility logins → parse/store bills \& usage → anomaly detection → “ready-to-audit” bills, with full auditability and data lineage.

Production services on cloud: Build and operate on GCP (Cloud Run and/or GKE); use BigQuery as the analytics backbone feeding Looker; leverage Firestore for app state and permissions. (AWS experience transferable.)

APIs \& full-stack delivery: Develop APIs and backend services in Python/TypeScript and collaborate with frontend integrations as needed.

Reliability, cost \& latency controls: Lead feature-flagged rollouts, implement end-to-end tracing, and enforce p95/p99 SLOs, budgets, and rate-limiting to balance performance and spend.

Iterate rapidly: Prototype, test, and launch features fast; harden successful prototypes into scalable, observable, secure services.

Shape foundations: Establish engineering standards, architecture principles, and AI-first practices that set the bar for the company.

Must haves

Experience level: 4+ years as a software engineer and at least 2+ years at an AI-first company or building AI-powered applications.

Production engineering: Professional experience building and maintaining APIs, data pipelines, or full-stack applications in Python and TypeScript.

LLM workflow deployment: Hands-on deploying AI/LLM workflows to production (e.g., LangChain, LlamaIndex, orchestration frameworks, vector databases).

Startup DNA: Thrives in ambiguity, bias to action, problem-first mindset, and high ownership.

RAG in production: Proven track record shipping document-centric RAG (retrieval, chunking, embeddings/vector DBs, re-ranking) with OpenAI, structured tool/JSON outputs, and streaming responses.

RAG evaluation: Hands-on use of RAGAS and/or TruLens (faithfulness, answer relevance, context precision/recall) with measurable quality gates.

Guardrails \& safety: JSON Schema/Pydantic validation, moderation and grounding checks, plus red-teaming practices in production.

Cloud production (GCP-first): Experience operating services on Cloud Run/GKE, using BigQuery (consumed in Looker) and Firestore for app state/permissions; strong CI/CD discipline. (AWS familiarity is a plus/transferable.)

Scraping/ingestion at scale: Built and operated pipelines with authentication (e.g., multi-tenant logins), robust parsing/storage, and audit-ready artifacts (data lineage, repeatability).

Observability \& controls: Structured logging, tracing (e.g., OpenTelemetry), metrics; cost/latency guardrails and safe releases (feature flags, canary, rollback) meeting p95/p99 SLOs.

English: Upper-Intermediate English level.

Nice to haves

Experience with parsing unstructured data, optimization algorithms, or time-series forecasting.

Background in energy, utilities, or IoT data (not required, but valuable context).

Prior experience in a founding or early-stage engineering role.

Vector databases (pgvector, Pinecone, Weaviate) and re-ranking experience.

GCP IaC (Terraform), Secrets/IAM hardening; Looker/LookML modeling.

About us

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.

If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you!

Perks and benefits

Professional growth: Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps.

Competitive compensation: We match your ever-growing skills, talent, and contributions with competitive USD-based compensation and budgets for education, fitness, and team activities.

A selection of exciting projects: Join projects with modern solutions development and top-tier clients that include Fortune 500 enterprises and leading product brands.

Flextime: Tailor your schedule for an optimal work-life balance, by having the options of working from home and going to the office, whatever makes you the happiest and most productive.

Job Type: Full-time

Artificial Intelligence Engineer ID42044

Job Description

Login / Register

👋 Let's find you a Dream Job

Check Your Email!

Get job updates in your inbox