Inference Jobs

77 jobs from companies building with AI · Avg salary $276k (33 with data)

ML inference engineering roles focused on model serving, latency optimization, quantization, and efficient deployment. Inference engineers make models fast and cost-effective in production.

Staff Software Engineer, Inference

Anthropic · Dublin, Ireland
alignment llm distributed-systems cloud inference infrastructure
Hybrid full-time lead 2 months ago

Software Engineer, ML Inference Performance

SambaNova Systems · Palo Alto, CA
deep-learning generative-ai pytorch tensorflow inference
On-site full-time senior 2 months ago

Staff Software Engineer, Inference

Anthropic · London, UK
cloud distributed-systems llm alignment inference infrastructure
Hybrid full-time lead 3 months ago

Member of Technical Staff - Edge Inference Engineer

Liquid AI · San Francisco, CA
generative-ai deep-learning research inference
Remote full-time lead 3 months ago

Full-Stack Software Engineer, Inference

Cohere · Toronto, Canada
payments generative-ai search fullstack inference
Remote full-time senior 4 months ago

Staff Software Engineer, Inference Infrastructure

Cohere · San Francisco, CA
generative-ai llm mlops nlp distributed-systems search infrastructure inference
Hybrid full-time lead 4 months ago

Architecture Intern - Inference

Etched · San Jose, CA
llm pytorch distributed-systems inference
On-site contract junior 5 months ago

Staff Inference ML Runtime Engineer

Cerebras · Sunnyvale, CA
agents deep-learning pytorch generative-ai llm inference
On-site full-time lead 5 months ago

Generative AI Inference Engineer

Stability AI · United States
gpu diffusion-models pytorch generative-ai deep-learning inference
On-site full-time senior 6 months ago

LLM Inference Engineer

Hippocratic AI · Palo Alto, CA
payments healthcare llm generative-ai gpu fine-tuning research inference
On-site full-time mid 6 months ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Audio Inference Engineer, Model Efficiency

Cohere · New York, NY
tensorflow llm deep-learning generative-ai search pytorch mlops inference
Remote full-time mid 6 months ago

Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai

Cerebras · Europe
agents generative-ai tensorflow pytorch llm python inference frontend
Remote full-time lead 6 months ago

Software Engineer, Inference – AMD GPU Enablement

OpenAI · San Francisco, CA
mlops llm gpu inference
On-site full-time mid 7 months ago

Principal Engineer, Inference Cloud

Cerebras · Sunnyvale, CA
agents generative-ai api-design cloud mlops distributed-systems inference
On-site full-time principal 7 months ago

Senior/Staff Software Engineer - Machine Learning Platform (Inference)

Snowflake · US-CA-Menlo Park
agents deep-learning pytorch fine-tuning llm tensorflow machine-learning inference
On-site full-time lead 9 months ago

Staff Technical Lead for Inference & ML Performance

Fal · San Francisco, CA
pytorch mlops gpu inference
On-site full-time lead 9 months ago

LLM Inference Performance & Evals Engineer

Cerebras · Toronto, Canada
agents gpu llm generative-ai evaluation inference
On-site full-time mid 10 months ago

Senior Site Reliability Engineer — Token Factory (Inference Platform)

Nebius · Amsterdam, Netherlands
llm mlops generative-ai gpu cloud inference devops
Remote full-time senior 11 months ago

Senior Software Engineer, Inference

Anthropic · Dublin, Ireland
llm cloud alignment distributed-systems inference infrastructure
Hybrid full-time senior 1 year ago

Inference Technical Lead, Sora

OpenAI · San Francisco, CA
mlops generative-ai research inference
Hybrid full-time lead 1 year ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Weekly AI Jobs Digest

Top new roles from 37+ companies. Curated, not scraped. One email, every Monday.

No spam. Unsubscribe anytime.

Hiring AI engineers?

Post the role first. Your company profile and analytics connect from the employer flow.

Post a Job See Pricing

Agentic API & MCP Server

Wire AI Dev Jobs into your agent at build time — MCP server live, REST API public for discovery, free API keys for recurring search.

# Add as MCP server
claude mcp add --transport http aidevjobs https://aidevboard.com/mcp

# Or hit the REST API
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

13 tools via com.aidevboard/jobs · Free keyed access · Pro $49/mo →