Research, ML

Exa · San Francisco, CA
full-time mid Posted 9 months ago

About this role

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M H200 GPU cluster that regularly lights up tens of thousands of machines. On the ML team, we train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database. We're looking for an ML Research Engineer to train embedding models for perfect search over the web. The role involves dreaming up novel transformer-based search architectures, creating datasets, creating evals, beating our internal SoTA, and repeat. Desired Experience - You have graduate-level ML experience (or are an exceptionally strong undergrad) - You can code up a transformer from scratch in PyTorch - You like creating large-scale datasets and diving deeply into the data - You care about the problem of finding high quality knowledge and recognize how important this is for the world Example Projects - Pre-training: Train a hundred billion parameter model - Fine-tuning: Build an RLAIF pipeline for search - Dream up a novel architecture for search in the shower, then code it up and beat our best model's top score - Build an eval system that answers how do we know we're advancing our search quality? (this is an incredibly difficult question to answer) This is an in-person opportunity in San Francisco. We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3). In addition to premium healthcare benefits (medical, dental, vision), we also offer fertility benefits and a monthly wellness stipend to all of our employees.

Similar Jobs

Related searches:

On-site Jobs Mid-Level Jobs On-site Mid-Level Jobs Mid-Level AI ResearchMid-Level Healthcare AIMid-Level AI InfrastructureMid-Level Data EngineeringMid-Level Data ScienceMid-Level Machine LearningMid-Level Generative AIMid-Level NLP & Language AI AI Jobs in San Francisco AI Research in San FranciscoHealthcare AI in San FranciscoAI Infrastructure in San FranciscoData Engineering in San FranciscoData Science in San FranciscoMachine Learning in San FranciscoGenerative AI in San FranciscoNLP & Language AI in San Francisco gpusearchhealthcarepre-trainingpytorchembeddingsfine-tuningresearch