ML Engineer — Pre-Training
full-time
senior
Posted 1 month ago
About this role
Work on Mistral's next-generation model pre-training pipeline. Design training recipes, optimize distributed training, and push model capabilities forward.
Paris-based with top-tier compute. Competitive with the best labs globally.
Requirements
Strong ML fundamentals. Experience with distributed training (FSDP, DeepSpeed, Megatron). Python/PyTorch required. Publications preferred.
Similar Jobs
Related searches:
On-site Jobs
Senior Jobs
On-site Senior Jobs
Senior Machine LearningSenior AI ResearchSenior Backend & SystemsSenior NLP & Language AI
AI Jobs in Paris
Machine Learning in ParisAI Research in ParisBackend & Systems in ParisNLP & Language AI in Paris
pytorchdistributed-trainingllmpythoncudatransformerspre-training
Get jobs like this delivered weekly
Free AI jobs newsletter. No spam.