Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning
full-time
senior
Posted 1 week ago
About this role
About the Company
At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.
A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family , we are focused solely on developing software for automated trucks to transform how the world moves freight.
Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.
Meet the Team As a Senior Machine Learning Engineer – Learned Planner / Reinforcement Learning, you will develop and deploy machine learning models that drive decision-making for autonomous trucks. Working closely with teams across perception, prediction, planning, and safety, you will build learned behavior systems that enable safe, efficient, and human-like driving in real-world freight environments. This role focuses on owning model development and delivery for scoped problem areas, contributing to architecture decisions, and driving improvements in model performance, reliability, and iteration speed within the autonomy stack. What You’ll Do
Design, develop, and deploy learned behavior models using approaches such as reinforcement learning, behavior cloning, and imitation learning
Own end-to-end model development for scoped problem areas, from data ingestion and training to evaluation and deployment
Write production-quality ML code to support scalable training, evaluation, and inference workflows
Analyze model performance, identify failure modes, and iterate to improve robustness and generalization across driving scenarios
Contribute to training pipelines, data workflows, and infrastructure, including working with large-scale datasets from simulation, fleet logs, and on-vehicle data
Collaborate with simulation, validation, and autonomy teams to test and evaluate learned behavior models across diverse environments
Support integration of learned planning models into simulation and validation frameworks, enabling faster iteration and improved coverage
Contribute to model architecture discussions and technical decision-making within the team
Mentor junior engineers on implementation, experimentation, and best practices
What You’ll Need to Succeed
Bachelor’s degree in Computer Science, Robotics, Electrical Engineering, Machine Learning, or related technical field with 6+ years of industry experience, OR Master’s degree with 3+ years OR PhD with 1+ years of experience
Experience applying reinforcement learning, imitation learning, or sequence modeling to robotics, autonomous systems, or complex control problems
Strong programming skills in Python and PyTorch, with experience writing production-quality ML code
Experience training, evaluating, and improving models using large-scale datasets and distributed compute environments
Solid understanding of ML architectures used in autonomy systems (e.g., transformers, RNNs, graph neural networks, policy networks)
Experience debugging model behavior, analyzing performance metrics, and improving model reliability
Ability to translate ambiguous problems into structured ML solutions and deliver results independently
Experience collaborating cross-functionally to integrate ML models into larger autonomy systems
Bonus Points:
Experience in autonomous driving, robotics, or simulation-based training environments
Experience with reinforcement learning frameworks or distributed training systems (e.g., Ray)
Experience working with simulation environments, scenario generation, or large-scale behavior datasets
Familiarity with vehicle dynamics, motion planning, or multi-agent decision-making systems
Experience deploying ML models into production or real-world robotics systems
Experience with learned planning systems or policy learning in real-world or simulation environments
Experience integrating learned behavior models into validation and V&V workflows
Background in multi-agent modeling, driver behavior modeling, or long-horizon decision-making systems
Work Location: For this position, we are open to hiring in either the Ann Arbor, MI OR Blacksburg, VA (U.S.) office work locations in a hybrid capacity. We are also open to hiring Remote in the United States
Perks of Being a Full-time Torc’r
Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:
A competitive compensation package that includes a bonus component and stock options
100% paid medical, dental, and vision premiums for full-time employees
401K plan with a 6% employer match
Flexibility in schedule and generous paid vacation
Similar Jobs
Related searches:
Hybrid Jobs
Senior Jobs
Hybrid Senior Jobs
Senior AI Agents & RAGSenior Backend & SystemsSenior Robotics & AutonomySenior Fintech & Payments AISenior AI InfrastructureSenior Machine LearningSenior Computer Vision
AI Jobs in Ann Arbor
AI Agents & RAG in Ann ArborBackend & Systems in Ann ArborRobotics & Autonomy in Ann ArborFintech & Payments AI in Ann ArborAI Infrastructure in Ann ArborMachine Learning in Ann ArborComputer Vision in Ann Arbor
autonomous-vehiclespaymentspytorchdeep-learningreinforcement-learningroboticsdistributed-systemsagents