Cloud Inference Tech Lead

GW28 Posted: 19/08/2025

$250,000-$325,000
Remote
Permanent

Cloud Inference Tech Lead

About the Role

Developers often struggle to deploy trained ML models due to fragmented solutions that require heavy customization. This role leads a GenAI cluster-level inference initiative—built as a Kubernetes-native platform to deliver reliable, high-performance deployment across diverse GPUs and other accelerators.

What You’ll Do

Technical Leadership: Mentor engineers and help set the technical direction for large-scale GenAI inference.
Strategic Vision: Drive product strategy and design for an enterprise-grade cloud inference serving platform; partner closely with customers to ensure success.
Technical Excellence: Support rapid growth and new use cases in a fast-moving environment, applying cutting-edge solutions.
Customer Success: Collaborate across product and engineering leadership to deliver integrated, large-scale cloud inference deployments.

What You Bring

10+ years in cloud infrastructure.
Proven experience building production-quality, high-performance cloud software.
Background in AI/ML infrastructure or model serving.
Demonstrated technical leadership and cross-team collaboration.
Track record of customer excellence.
Strong understanding of operating large-scale systems on cloud infrastructure.

Work Setup

US or Canada; work remotely or from the company’s office. (Onboarding is conducted in person at the Los Altos, CA office.)

Equal Opportunity & Compliance

Committed to equal employment opportunity and providing reasonable accommodations. Participates in E-Verify in the US.

Tyler Long Software Systems & HPC Recruiter

Apply for this role

First Name

Last Name

Telephone Number

Email Address

Resume, LinkedIn or Dropbox URL

Resume Upload

Choose File

LinkedIn / Dropbox URL

Message

By submitting this form you agree to our Terms & Conditions, Privacy Policy & Cookie Policy

Not yet registered? Create an account today

Already have an account? Sign in now

Still looking? What about...

Featured Jobs

View all jobs

Posted: 19/08/2025

Cloud Inference Tech Lead

GW28

$250,000-$325,000
Remote
Permanent

Cloud Inference Tech LeadAbout the RoleDevelopers often struggle to deploy trained ML models due to frag...

View Job

Posted: 19/08/2025

Head of Cloud Inference

GW27

$300,000-$400,000
Remote
Permanent

Head of Cloud InferenceDevelopers face challenges deploying trained machine learning models due to fragm...

View Job

Posted: 18/08/2025

Applied Research Engineer

GW26

$200,000-$250,000
San Francisco, CA
Permanent

About the job🌟 Applied Research Engineer – Video Intelligence 🌟Our client is on a miss...

View Job

Posted: 18/08/2025

Applied Research Engineer

GW25

$200,000-$250,000
New York, NY
Permanent

About the job🌟 Applied Research Engineer – Video Intelligence 🌟Our client is on a miss...

View Job

Posted: 18/08/2025

Founding Engineer

GW24

$150,000-$250,000
San Francisco, CA
Permanent

About the jobFounding Engineer - San Francisco, CAA YC backed cutting-edge AI startup that empowers busi...

View Job

Posted: 18/08/2025

AI Researcher

GW23

$170,000- $300,000
San Francisco, CA
Permanent

AI Researcher - San Francisco (Onsite) A San Francisco-based AI infrastructure startup on...

View Job

Posted: 12/08/2025

Post-Graduate Role: Fullstack Software Engineer

GW22

$125,000-$135,000
Denver, CO
Permanent

Post-Graduate Role: Full Stack Software EngineerAbout the CompanyJoin a fast-growing AI startup, backed ...

View Job

Posted: 12/08/2025

Frontend Engineer

GW21

$150,000-$180,000
New York City
Permanent

Front-End Engineer - San Francisco/New YorkAn AI strike force spun out of a16z and Apple, working direct...

View Job

Posted: 12/08/2025

Frontend Engineer

GW20

$150,000-$180,000
San Francisco, CA
Permanent

Front-End Engineer - San Francisco/New YorkAn AI strike force spun out of a16z and Apple, working direct...

View Job

Posted: 12/08/2025

Fullstack Engineer

GW19

$150,000-$180,000
New York City
Permanent

Full Stack Engineer (Front-End Leaning) - San Francisco/New YorkAn AI strike force spun out of a16z and ...

View Job

Quick Resume Dropoff

Cloud Inference Tech Lead

Cloud Inference Tech Lead

About the Role

What You’ll Do

What You Bring

Work Setup

Equal Opportunity & Compliance

Apply for this role

Still looking? What about...

Featured Jobs

Cloud Inference Tech Lead

Head of Cloud Inference

Applied Research Engineer

Applied Research Engineer

Founding Engineer

AI Researcher

Post-Graduate Role: Fullstack Software Engineer

Frontend Engineer

Frontend Engineer

Fullstack Engineer

Contact Us

Find us on social

Useful Links

Legal