ML Infrastructure Engineer

GW271 Posted: 28/01/2026

$200,000-$275,000
San Francisco, CA
Permanent

About the job

Senior ML Infrastructure / Backend Engineer

Series C Startup | AI-Powered 3D & Avatar Platform | Hybrid (LA or SF)

We’re hiring a Senior ML Infrastructure / Backend Engineer to join a well-funded AI company building the visual and interaction layer for the next generation of AI-powered digital identities.

This team is developing production systems that bring AI characters out of chat boxes and into real-time, interactive 3D experiences.

You’ll own backend and infrastructure systems that serve ML-powered functionality at scale — supporting high-concurrency user traffic, low-latency inference, and rapid iteration as the platform grows.

You’ll work closely with ML researchers, platform engineers, and product teams to take models from experimentation to reliable, scalable production services.

What You’ll Do:

Own backend services and APIs that expose ML-powered features to real users
Design and operate orchestration layers for ML workloads (routing, batching, retries, concurrency)
Deploy and scale ML-backed services in cloud environments
Take systems from architecture → implementation → production ownership
Scale infrastructure to support thousands to hundreds of thousands of daily requests
Implement observability, monitoring, and alerting to ensure system reliability
Partner closely with ML teams to productionize generative and ML models
Improve end-to-end efficiency across inference, post-processing, and data pipelines

What We’re Looking For:

Strong, production-level experience building and owning backend or distributed systems
Hands-on experience designing and operating APIs (Python preferred — FastAPI, Flask, or gRPC)
Experience deploying and running ML-backed systems in production environments
Proven ability to scale systems under real user traffic with attention to latency and reliability
Experience with cloud platforms (AWS, GCP, or similar) and containerized deployments
Strong debugging, performance tuning, and operational ownership skills

Nice to have:

Experience with ML inference optimization (quantization, mixed precision, ONNX, TensorRT)
Familiarity with scalable inference frameworks (Ray Serve, Triton, TorchServe, SageMaker)
Exposure to generative models (diffusion or transformer-based systems)
Experience running GPU-backed or high-performance workloads in production

This Role Is:

Hybrid (Los Angeles or San Francisco office)
Salary range: upto $275k base, depending on experience

If you’re excited about building the infrastructure that powers real-time, embodied AI — and want ownership over systems that actually ship — we’d love to talk.

Anna Lynch Principal Account Manager

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 28/01/2026

Software Engineer — LLM Agents & Automation Systems

GW280

$200,000-$250,000
San Francisco, CA
Permanent

About the jobSoftware Engineer — LLM Agents & Automation SystemsSF Bay Area | On-siteWe’...

View Job

Posted: 28/01/2026

Founding Engineer

GW279

$180,000-$300,000
New York City, NY
Permanent

About the jobFounding Engineer - Onsite NYC - up to $300k A well-funded, seed-stage startup rebuild...

View Job

Posted: 28/01/2026

Compiler Engineer

GW278

$200,000-$300,000
Mountain View, CA
Permanent

About the jobCompiler Engineer - AI Hardware Hybrid from Mountain View, CAAcceler8 Talent is seekin...

View Job

Posted: 28/01/2026

Emulation Engineer

GW277

$200,000-$300,000
Mountain View, CA
Permanent

About the jobAcceler8 Talent is seeking an experienced Design Verification / Principal Emulation En...

View Job

Posted: 28/01/2026

Research Scientist (Gen AI)

GW276

$200,000-$250,000
San Francisco, CA
Permanent

About the jobResearch ScientistLocation: SF Bay Area (On-site)We’re representing a cutting-ed...

View Job

Posted: 28/01/2026

Principal Python Engineer — ML Product Company (Remote)

GW275

$225,000-$250,000
United States
Permanent

About the job🚀 Principal Python Engineer — ML Product Company (Remote)Acceler8 Talent is partneri...

View Job

Posted: 28/01/2026

Machine Learning Research Engineer

GW274

$150,000-$250,000
San Francisco, CA
Permanent

About the jobMachine Learning Research Engineer - Video Intelligence - Up to $250kA Series A Funded star...

View Job

Posted: 28/01/2026

Senior Backend Engineer

GW273

$150,000-$220,000
United States
Permanent

About the jobSenior Backend Engineer (Python) - US RemoteA Boston based deep-tech startup building AI-dr...

View Job

Posted: 28/01/2026

Software Engineer

GW272

$150,000-$250,000
San Francisco, CA
Permanent

About the jobSoftware Engineer - San Francisco, CAA revolutionary startup that's completely rebuildi...

View Job

Posted: 28/01/2026

ML Infrastructure Engineer

GW271

$200,000-$275,000
San Francisco, CA
Permanent

About the jobSenior ML Infrastructure / Backend EngineerSeries C Startup | AI-Powered 3D & Avatar Pl...

View Job

Quick Resume Dropoff

ML Infrastructure Engineer

About the job

Apply for this role

Still looking? What about...

Featured Jobs

Software Engineer — LLM Agents & Automation Systems

Founding Engineer

Compiler Engineer

Emulation Engineer

Research Scientist (Gen AI)

Principal Python Engineer — ML Product Company (Remote)

Machine Learning Research Engineer

Senior Backend Engineer

Software Engineer

ML Infrastructure Engineer

Contact Us

Find us on social

Useful Links

Legal