Inference Jobs

77 jobs from companies building with AI · Avg salary $276k (33 with data)

ML inference engineering roles focused on model serving, latency optimization, quantization, and efficient deployment. Inference engineers make models fast and cost-effective in production.

Software Engineer, Model Inference

OpenAI · San Francisco, CA
cloud pytorch gpu distributed-systems inference
On-site full-time senior 1 year ago

Inference Engineer

Cartesia AI · San Francisco, CA
gpu llm distributed-systems generative-ai research inference
Hybrid full-time mid 1 year ago

Staff Software Engineer, Inference Cloud

Cerebras · Sunnyvale, CA
mlops cloud distributed-systems agents generative-ai inference
On-site full-time lead 1 year ago

Solution Architect (AI/LLM Inference)

Baseten · San Francisco, CA
llm inference
Remote full-time mid 1 week ago

Software Engineer, Productivity - Inference Runtime

OpenAI · San Francisco, CA
distributed-systems inference
On-site full-time mid 1 week ago

Software Engineer, Inference - Performance Optimization

OpenAI · San Francisco, CA
distributed-systems inference
On-site full-time mid 3 weeks ago

Applied AI Inference Engineer

Baseten · San Francisco, CA
fine-tuning inference
Remote full-time junior 4 weeks ago

Software Engineer, Model Routing & Inference

Cursor · New York, NY
distributed-systems data-pipeline inference
On-site full-time mid 1 month ago

TL, Research Inference

OpenAI · San Francisco, CA
distributed-systems research inference
On-site full-time mid 2 months ago

Inference Technical Lead, On-Device Transformers

OpenAI · San Francisco, CA
gpu inference transformers
Hybrid full-time lead 2 months ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

AI Researcher — Inference Optimization

Featherless AI · Remote
deep-learning llm gpu pytorch research inference
Remote full-time mid 3 months ago

Senior Data Scientist - Inference, Global Markets

Airbnb · China
inference data-science
On-site full-time senior 5 months ago

Principal Engineer, AI Inference Reliability

Cerebras · Remote
generative-ai agents distributed-systems inference
Remote full-time principal 6 months ago

Software Engineer, Collect

Cohere · Toronto, Canada
generative-ai search inference
Remote full-time mid 7 months ago

Sr Engineer, Server Inference

Tenstorrent · Belgrade, Serbia
cloud inference
On-site full-time senior 10 months ago

Inference Software Engineer

Etched · San Jose, CA
distributed-systems pytorch inference
On-site full-time mid 11 months ago

Software Engineer, Inference - Multi Modal

OpenAI · San Francisco, CA
generative-ai llm inference
On-site full-time mid 12 months ago

Weekly AI Jobs Digest

Top new roles from 37+ companies. Curated, not scraped. One email, every Monday.

No spam. Unsubscribe anytime.

Hiring AI engineers?

Post the role first. Your company profile and analytics connect from the employer flow.

Post a Job See Pricing

Agentic API & MCP Server

Wire AI Dev Jobs into your agent at build time — MCP server live, REST API public for discovery, free API keys for recurring search.

# Add as MCP server
claude mcp add --transport http aidevjobs https://aidevboard.com/mcp

# Or hit the REST API
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

13 tools via com.aidevboard/jobs · Free keyed access · Pro $49/mo →