Senior AI Inference Engineer - Model Optimization & Deployment

Zoox · Foster City, CA · $242k - $290k
full-time senior Posted 1 month ago

About this role

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence. As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

Similar Jobs

Related searches:

On-site Jobs Senior Jobs On-site Senior Jobs Senior NLP & Language AISenior AI InfrastructureSenior Machine LearningSenior Generative AI AI Jobs in Foster City NLP & Language AI in Foster CityAI Infrastructure in Foster CityMachine Learning in Foster CityGenerative AI in Foster City generative-aillmgpuinference

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.