Back to results

Software Engineer, ML Inference

GW478 Posted: 08/05/2026

$250,000-$320,000
San Francisco, CA
Permanent

About the job

Software Engineer, ML Inference

San Francisco (On-Site)

$250,000–$320,000 base + equity

Why this role

Early-stage infrastructure company building a next-generation AI cloud — rethinking how frontier models run across heterogeneous compute environments.

This team is focused on the hardest part of the stack: making large-scale model inference fast, reliable, and production-ready.

You’ll own the systems that actually execute models in production — working across runtime, serving infrastructure, memory management, and hardware optimisation.

What you’ll do

Build and scale end-to-end inference systems from request → runtime → response
Optimise latency, throughput, concurrency, and reliability under real production workloads
Design batching, scheduling, and queuing systems for high-performance serving
Improve KV cache management and memory efficiency at scale
Debug performance bottlenecks across model, runtime, and hardware layers
Work closely with systems, infrastructure, and ML teams to push inference performance forward

What makes this interesting

Deep work on LLM inference internals including prefill, decode, and attention optimisation
Solving real-world trade-offs between tail latency and throughput
Optimising workloads across GPUs and next-generation accelerators
Hands-on work with vLLM, TensorRT-LLM, and custom inference runtimes
Opportunity to shape core infrastructure at an early-stage company

What they’re looking for

Experience building ML inference or model serving systems
Strong systems engineering or backend infrastructure fundamentals
Experience working on performance, scaling, memory, or distributed systems challenges
Strong Python and/or C++ skills
Familiarity with modern inference frameworks and runtimes is a plus

APPLY NOW!

Anna Heneghan Senior ML Research & Engineering Recruiter

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 08/05/2026

Senior AI Engineer

GW482

$150,000-$180,000
United States
Permanent

About the job📍 Remote across the U.S. (Eastern/Central time zones) 💰 $150-180k + strong benefitsT...

View Job

Posted: 08/05/2026

Senior Backend Engineer

GW481

$180,000-$250,000
Boston, MA
Permanent

About the jobSenior Software Engineer (Agentic AI) - Boston, MA A fast-growing deep tech startup is...

View Job

Posted: 08/05/2026

Lead AI Engineer

GW480

$160,000-$245,000
United States
Permanent

About the jobWe are looking for a Lead AI Engineer with 7+ years’ experience buildi...

View Job

Posted: 08/05/2026

Lead AI Engineer

GW479

$180,000-$240,000
United States
Permanent

About the jobLead AI Engineer – Enterprise AI Transformation - US Eastern (Remote) A global o...

View Job

Posted: 08/05/2026

Software Engineer, ML Inference

GW478

$250,000-$320,000
San Francisco, CA
Permanent

About the jobSoftware Engineer, ML InferenceSan Francisco (On-Site)$250,000–$320,000 base + equity...

View Job

Posted: 08/05/2026

Senior Software Engineer / Software Engineer

GW477

$170,000-$250,000
New York City, NY
Permanent

About the jobHiring for 2 roles:🚀 Senior Software Engineer - $215K–$250K + equity🚀 Software Engi...

View Job

Posted: 08/05/2026

VP Of Engineering- AI

GW476

$300,000-$350,000
Cambridge, MA
Permanent

About the jobVP Engineering – AI & Scientific Discovery - Cambridge, MA An advanced AI-dr...

View Job

Posted: 08/05/2026

Inference Engineer

GW475

$200,000-$300,000
San Francisco, CA
Permanent

About the job📍 Onsite – San Francisco💰 $200k–300k base + meaningful equityAcceler8 Talent ...

View Job

Posted: 08/05/2026

Forward Deployed Engineer

GW474

$150,000-$250,000
San Francisco, CA
Permanent

About the jobFounding Forward Deployed Engineer (German Speaking)Join a fast-scaling AI company transfor...

View Job

Posted: 08/05/2026

Senior AI Engineer

GW473

$140,000-$180,000
United States
Permanent

About the jobSenior AI Engineer – Enterprise AI Transformation - Remote East Coast, US A glob...

View Job

Quick Resume Dropoff

Software Engineer, ML Inference

About the job

Apply for this role

Still looking? What about...

Featured Jobs

Senior AI Engineer

Senior Backend Engineer

Lead AI Engineer

Lead AI Engineer

Software Engineer, ML Inference

Senior Software Engineer / Software Engineer

VP Of Engineering- AI

Inference Engineer

Forward Deployed Engineer

Senior AI Engineer

Contact Us

Find us on social

Useful Links

Legal