Principal Engineer, Data & Compute
full-time
principal
Posted 5 days ago
Apply Now
Stand out: build a proof-of-work pitch →
Free GitHub-based preview. Direct apply stays one click away.
Get weekly job alerts like this →Hiring for this role?
About this role
About us
Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.
Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving. In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.
At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.
Make Wayve the experience that defines your career!
The Role
At Wayve, we are teaching machines to drive—not by coding rules, but by training end-to-end neural networks that learn from vast streams of real-world data. Achieving this requires unprecedented scale in both data infrastructure and compute orchestration. Our workloads span thousands of GPUs, petabytes of driving data, and geographically distributed training and inference clusters.
As Architect for AI Infrastructure, you will design and guide the evolution of the foundational compute and storage systems that fuel our model development lifecycle. Your leadership will directly accelerate AI research, enable rapid model deployment, and ensure our platform meets the demands of a company pushing the boundaries of autonomy.
You’ll sit at the strategic core of AI, systems, and cloud infrastructure—owning challenges that few companies have the ambition or scale to tackle.
Key Responsibilities
Global Compute Strategy – Define and evolve the architecture for how Wayve allocates and orchestrates training and inference workloads across thousands of GPUs and multiple data centers, ensuring optimal throughput, resiliency, and cost efficiency.
Petabyte-Scale Data Federation – Design systems that enable fast, reliable access to high-volume sensor and simulation data across geographies, ensuring the right data is always available for training, evaluation, and inference. Furthermore, preparing Wayve for being an exabyte-scale company.
Cross-Region GPU Job Execution – Build the foundations that enable large-scale AI workloads to run seamlessly across hybrid and multi-cloud environments.
Cloud Infrastructure Advisory – Act as a trusted partner to leadership in aligning compute investments and architecture with company strategy, growth plans, and performance goals.
Technical Leadership & Mentorship – Uplift the broader engineering org through architectural coaching, technical deep dives, and by cultivating a culture of operational and engineering excellence.
About You
In order to set you up for success at Wayve, we’re looking for the following skills and experience.
Essential
10+ years designing and building large-scale distributed systems, with at least 4 years focused on GPU-based cloud infrastructure.
Proven experience enabling large-scale AI training, inference, or computer vision workloads in GPU clusters.
Deep understanding of petabyte-scale data architecture, including storage federation, high-throughput access, and data locality for AI workloads.
Strong technical leadership with a track record of defining and communicating architectural strategy, balancing long-term vision with delivery needs.
A natural mentor with a history of developing engineers and influencing technical direction across teams.
Advanced degree in Computer Science, Electrical Engineering, or a related field—or equivalent industry experience.
Desirable
Experience with multi-cloud orchestration, particularly in latency- or cost-sensitive training and inference pipelines.
Familiarity with systems like Ray, Kubernetes, Airflow, or Flyte, and deep fluency in AI/ML job scheduling, model lifecycle management, and infrastructure-as-code practices.
Background in supporting safety-critical or real-time inference use cases (e.g., robotics, autonomous vehicles, aerospace).
Passion for building infrastructure-as-a-product that delivers performance and simplicity to research and product teams alike.
This is a full-time role based in-office. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. This role is a full-time role based in Sunnyvale, CA (hybrid) and the reasonably estimated salary for this role ranges from $370,300 to $418,200, plus a competitive equity package. Actual compensation is based on the
Similar Jobs
Related searches:
Hybrid Jobs
Principal Jobs
Hybrid Principal Jobs
Principal Robotics & AutonomyPrincipal Generative AIPrincipal Machine LearningPrincipal Computer VisionPrincipal AI Infrastructure
AI Jobs in Sunnyvale
Robotics & Autonomy in SunnyvaleGenerative AI in SunnyvaleMachine Learning in SunnyvaleComputer Vision in SunnyvaleAI Infrastructure in Sunnyvale
deep-learningautonomous-vehiclesmlopscomputer-visiongpugenerative-aicloudrobotics
Get jobs like this delivered weekly
Free AI jobs newsletter. No spam.