Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Torc Robotics · Ann Arbor, MI · $226k - $271k

full-time senior Posted 3 months ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

AI Market Demand Pack · $29 one-time

Compare this role's skills with the full AI hiring market. Get ranked demand, salary bands, leading companies, public source URLs, and a decision brief.

See the live sample →

deep-learning reinforcement-learning distributed-systems payments robotics pytorch agents autonomous-vehicles

About this role

About the Company At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business. A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners.  Now a part of the Daimler family , we are focused solely on developing software for automated trucks to transform how the world moves freight. Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer. Meet the Team As a Senior Machine Learning Engineer – Learned Planner / Reinforcement Learning, you will develop and deploy machine learning models that drive decision-making for autonomous trucks. Working closely with teams across perception, prediction, planning, and safety, you will build learned behavior systems that enable safe, efficient, and human-like driving in real-world freight environments. This role focuses on owning model development and delivery for scoped problem areas, contributing to architecture decisions, and driving improvements in model performance, reliability, and iteration speed within the autonomy stack. What You’ll Do Design, develop, and deploy learned behavior models using approaches such as reinforcement learning, behavior cloning, and imitation learning Own end-to-end model development for scoped problem areas, from data ingestion and training to evaluation and deployment Write production-quality ML code to support scalable training, evaluation, and inference workflows Analyze model performance, identify failure modes, and iterate to improve robustness and generalization across driving scenarios Contribute to training pipelines, data workflows, and infrastructure, including working with large-scale datasets from simulation, fleet logs, and on-vehicle data Collaborate with simulation, validation, and autonomy teams to test and evaluate learned behavior models across diverse environments Support integration of learned planning models into simulation and validation frameworks, enabling faster iteration and improved coverage Contribute to model architecture discussions and technical decision-making within the team Mentor junior engineers on implementation, experimentation, and best practices What You’ll Need to Succeed Bachelor’s degree in Computer Science, Robotics, Electrical Engineering, Machine Learning, or related technical field with 6+ years of industry experience, OR Master’s degree with 3+ years OR PhD with 1+ years of experience Experience applying reinforcement learning, imitation learning, or sequence modeling to robotics, autonomous systems, or complex control problems Strong programming skills in Python and PyTorch, with experience writing production-quality ML code Experience training, evaluating, and improving models using large-scale datasets and distributed compute environments Solid understanding of ML architectures used in autonomy systems (e.g., transformers, RNNs, graph neural networks, policy networks) Experience debugging model behavior, analyzing performance metrics, and improving model reliability Ability to translate ambiguous problems into structured ML solutions and deliver results independently Experience collaborating cross-functionally to integrate ML models into larger autonomy systems Bonus Points: Experience in autonomous driving, robotics, or simulation-based training environments Experience with reinforcement learning frameworks or distributed training systems (e.g., Ray) Experience working with simulation environments, scenario generation, or large-scale behavior datasets Familiarity with vehicle dynamics, motion planning, or multi-agent decision-making systems Experience deploying ML models into production or real-world robotics systems Experience with learned planning systems or policy learning in real-world or simulation environments Experience integrating learned behavior models into validation and V&V workflows Background in multi-agent modeling, driver behavior modeling, or long-horizon decision-making systems Work Location: For this position, we are open to hiring in either the Ann Arbor, MI OR Blacksburg, VA (U.S.) office work locations in a hybrid capacity. We are also open to hiring Remote in the United States Perks of Being a Full-time Torc’r Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers: A competitive compensation package that includes a bonus component and stock options 100% paid medical, dental, and vision premiums for full-time employees 401K plan with a 6% employer match Flexibility in schedule and generous paid vacation