Evaluation Jobs

92 jobs from companies building with AI · Avg salary $239k (44 with data)

AI evaluation engineering roles focused on benchmarking, testing, and measuring model performance. Evaluation engineers build the frameworks that determine whether AI systems are improving.

Staff Data Scientist - Behavior Evaluation

Zoox · Foster City, CA · $256k - $307k
autonomous-vehicles data-science evaluation
On-site full-time lead 4 years ago

Senior Software Engineer, Cloud, Simulation

Wayve · London, UK
computer-graphics robotics autonomous-vehicles generative-ai evaluation
Hybrid full-time senior 1 week ago

Senior Software Engineer, Simulation

Wayve · London, UK
cloud generative-ai autonomous-vehicles robotics computer-graphics evaluation
Hybrid full-time senior 1 week ago

Triage Specialist

Wayve · Sunnyvale, CA
generative-ai robotics autonomous-vehicles evaluation
Hybrid full-time mid 1 week ago

Model Evaluation & Data Quality Lead

Twelve Labs · San Francisco, CA
generative-ai data-pipeline alignment llm evaluation
Remote full-time lead 1 week ago

Senior/Staff Software Engineer, Search & Retrieval Infrastructure

Pinecone · Tel Aviv, Israel
llm embeddings agents rag search distributed-systems evaluation infrastructure
Hybrid full-time lead 3 weeks ago

Member of Engineering (Evaluations / Engineering)

Poolside · Remote (Europe)
distributed-systems data-pipeline mlops evaluation research
Remote full-time lead 1 month ago

Senior AI Software Engineer - Model Evaluation (f/m/d)

Aleph Alpha · Heidelberg
llm generative-ai nlp distributed-systems pytorch pre-training research evaluation
Hybrid full-time senior 1 month ago

Software Engineer, Foundations Retrieval

OpenAI · San Francisco, CA
search pre-training agents distributed-systems research evaluation
On-site full-time mid 1 month ago

Research Engineer – Benchmarking, Evals & Failure Analysis

Mercor · San Francisco, CA
llm agents search data-pipeline evaluation research
On-site full-time mid 1 month ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Staff Data Scientist

Wayve · London, UK
generative-ai distributed-systems pytorch autonomous-vehicles evaluation data-science
On-site full-time lead 1 month ago

Senior Data Scientist

Wayve · London, UK
distributed-systems pytorch autonomous-vehicles generative-ai data-science evaluation
On-site full-time senior 1 month ago

Senior Software Engineer, Data & Eval Platform

Dyna Robotics · Redwood City, CA
generative-ai robotics data-pipeline evaluation
On-site full-time senior 1 month ago

Senior/Staff Software Engineer, Search & Retrieval Infrastructure

Pinecone · Remote (US)
llm distributed-systems agents search rag embeddings evaluation infrastructure
Hybrid full-time lead 1 month ago

Machine Learning Eval Engineer

Reducto · San Francisco, CA
cloud llm machine-learning evaluation
On-site full-time mid 2 months ago

Senior Software Engineer, AI Retrieval

Asana · Warsaw
healthcare search llm evaluation
Hybrid full-time senior 2 months ago

Research Scientist (Measurement and Evaluation)

Abridge · NYC Office
payments healthcare generative-ai research evaluation
Hybrid full-time senior 2 months ago

Engineer - Agents & Evals

Lovable · Stockholm, Sweden
agents llm fine-tuning evaluation
On-site full-time mid 2 months ago

Member of Technical Staff (Data Scientist, Evals)

Perplexity · London, UK
llm search agents cloud data-science evaluation
Remote full-time lead 3 months ago

Model Evaluation QA Lead

Deepgram · United States
pre-training cloud speech nlp gpu generative-ai evaluation
Remote full-time lead 3 months ago
Hiring AI developers? Start with a job post. Claiming the profile comes after. Post a job →

Weekly AI Jobs Digest

Top new roles from 50+ companies. Curated, not scraped. One email, every Monday.

No spam. Unsubscribe anytime.

Hiring AI engineers?

Post the role first. Your company profile and analytics connect from the employer flow.

Post a Job See Pricing

Agentic API & MCP Server

Wire AI Dev Jobs into your agent at build time — MCP server live, REST API public for discovery, free API keys for recurring search.

# Add as MCP server
claude mcp add --transport http aidevjobs https://aidevboard.com/mcp

# Or hit the REST API
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

13 tools via com.aidevboard/jobs · Free keyed access · Pro $49/mo →