Forward Deployed AI/ML engineer

GW90
  • $200,000-$250,000
  • Bay Area, CA
  • Permanent

About the job


We’re looking for a highly skilled Python engineer with hands-on experience designing and deploying LLM-driven applications in production environments. The right candidate has built real-world systems leveraging OpenAI, Anthropic, Gemini, or other large-scale machine learning models—and understands their practical tradeoffs across latency, reliability, and cost. A strong background in agentic systems, retrieval-augmented generation (RAG), prompt design, and LLM orchestration at scale is essential.


This is a forward-deployed position, meaning you’ll work closely with customers—often on-site—to guide technical discovery, architect solutions end-to-end, and ensure successful delivery. The role requires both deep technical execution and the ability to build lasting client partnerships.


Core Requirements:


● Advanced Python proficiency in production environments, including experience with FastAPI, Pydantic, strong typing, Pyright, Ruff, and Alembic.

● Proven track record of building and shipping LLM-based systems—from agentic frameworks and chatbots to RAG pipelines.

● Comfortable managing both technical and business stakeholders in dynamic settings.

● Prior experience as a technical or team lead on complex, high-impact initiatives.

● Hands-on familiarity with AI developer productivity tools (e.g., Cursor, Claude Code, OpenAI Codex) in production workflows.

● Deep knowledge of LLM APIs (OpenAI, Anthropic, Gemini) and expertise in evaluating latency, context limits, cost structures, and reliability tradeoffs.

● Experience with vector databases and embedding models (e.g., Voyage AI, Weaviate, Pinecone).

● Understanding of model evaluation, non-deterministic testing, and LLM observability practices.


Nice to Have:


● Exposure to our preferred stack: Temporal (for workflow orchestration), GCP, Terraform, Posthog, and the Ciridae LLM Gateway.

● Experience in customer-facing engineering or consulting environments with high-touch client interaction.

● Strong consultative communication and problem-solving skills.

Derek Gemski ML Research & Engineering Recruiter

Apply for this role