Inference Jobs

77 jobs from companies building with AI · Avg salary $276k (33 with data)

ML inference engineering roles focused on model serving, latency optimization, quantization, and efficient deployment. Inference engineers make models fast and cost-effective in production.

Software Engineer - GenAI inference

Databricks · San Francisco, CA · $142k - $204k
generative-ai data-pipeline mlops distributed-systems gpu cloud llm inference
On-site full-time mid 7 months ago

Senior Software Engineer II, Inference

CoreWeave · Sunnyvale, CA · $165k - $242k
gpu llm distributed-systems inference
Hybrid full-time senior 7 months ago

Senior Backend Engineer, Inference Platform

Together AI · San Francisco, CA · $160k - $250k
microservices gpu llm distributed-systems generative-ai platform backend inference
On-site full-time senior 9 months ago

Engineering Manager, Inference

Anthropic · San Francisco, CA · $425k - $560k
distributed-systems alignment inference research
Hybrid full-time junior 11 months ago

Member of Technical Staff - Inference

xAI · Palo Alto, CA · $180k - $440k
code-generation llm mlops gpu inference infrastructure
On-site full-time lead 1 year ago

Software Engineer, AI Inference

SkildAI · San Francisco, CA · $100k - $300k
agents pytorch robotics tensorflow deep-learning gpu inference
On-site full-time senior 1 year ago

Machine Learning Engineer - Inference

Together AI · San Francisco, CA · $160k - $230k
llm gpu pytorch research inference machine-learning
On-site full-time mid 1 year ago

Engineering Manager, Inference Routing and Performance

Anthropic · San Francisco, CA · $405k - $485k
llm research inference
On-site full-time senior 2 months ago

Staff + Sr. Software Engineer, Inference Deployment

Anthropic · San Francisco, CA · $320k - $485k
alignment inference infrastructure
Hybrid full-time lead 3 months ago

Member of Technical Staff - Inference

Prime Intellect · Remote · $150k - $300k
agents reinforcement-learning gpu api-design llm pytorch cloud inference
Hybrid full-time lead 8 months ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Senior Data Scientist, Causal Inference

Airbnb · Remote · $179k - $210k
inference data-science
Remote full-time senior 2 months ago

Staff ML Performance Engineer (Inference Optimisation)

Wayve · London, UK
gpu autonomous-vehicles generative-ai inference
Hybrid full-time lead 6 days ago

Lead Member of Technical Staff, Inference Infrastructure

Cohere · San Francisco, CA
llm nlp generative-ai mlops distributed-systems search infrastructure inference
Hybrid full-time lead 3 weeks ago

Software Engineer - Voice AI (Inference Runtime)

Baseten · San Francisco, CA
healthcare pytorch speech mlops llm inference
Remote full-time mid 3 weeks ago

Engineering Manager (AI Inference)

Perplexity · San Francisco, CA
tensorflow pytorch gpu llm inference
On-site full-time senior 1 month ago

Member of Technical Staff (AI Inference Engineer)

Perplexity · San Francisco, CA
llm pytorch tensorflow distributed-systems gpu api-design deep-learning inference
On-site full-time lead 1 month ago

Member of Technical Staff (AI Inference Engineer)

Perplexity · London, UK
llm pytorch gpu deep-learning api-design tensorflow distributed-systems inference
On-site full-time lead 1 month ago

Senior Performance Engineer, Inference

Cerebras · Sunnyvale, CA
llm agents search code-generation gpu generative-ai inference
On-site full-time senior 1 month ago

Engineering Manager, Model Routing & Inference

Cursor · San Francisco, CA
llm gpu mlops distributed-systems data-pipeline inference
On-site full-time mid 1 month ago

Sr. Software Engineer, Inference

Anthropic · London, UK
distributed-systems cloud alignment llm infrastructure inference
Hybrid full-time senior 2 months ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Weekly AI Jobs Digest

Top new roles from 37+ companies. Curated, not scraped. One email, every Monday.

No spam. Unsubscribe anytime.

Hiring AI engineers?

Post the role first. Your company profile and analytics connect from the employer flow.

Post a Job See Pricing

Agentic API & MCP Server

Wire AI Dev Jobs into your agent at build time — MCP server live, REST API public for discovery, free API keys for recurring search.

# Add as MCP server
claude mcp add --transport http aidevjobs https://aidevboard.com/mcp

# Or hit the REST API
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

13 tools via com.aidevboard/jobs · Free keyed access · Pro $49/mo →