ML Engineer — Pre-Training

Mistral AI · Paris, France · $180k - $350k
full-time senior Posted 1 month ago

About this role

Work on Mistral's next-generation model pre-training pipeline. Design training recipes, optimize distributed training, and push model capabilities forward. Paris-based with top-tier compute. Competitive with the best labs globally.

Requirements

Strong ML fundamentals. Experience with distributed training (FSDP, DeepSpeed, Megatron). Python/PyTorch required. Publications preferred.

Similar Jobs

Related searches:

On-site Jobs Senior Jobs On-site Senior Jobs Senior Machine LearningSenior NLP & Language AISenior AI ResearchSenior Backend & Systems AI Jobs in Paris Machine Learning in ParisNLP & Language AI in ParisAI Research in ParisBackend & Systems in Paris pytorchdistributed-trainingllmpythoncudatransformerspre-training

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.