Principal Engineer, Data & Compute

Wayve · Sunnyvale, CA · $370k - $418k

full-time principal Posted 5 days ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

deep-learning autonomous-vehicles mlops computer-vision gpu generative-ai cloud robotics

About this role

About us Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems. Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving. In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future. At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact. Make Wayve the experience that defines your career! The Role At Wayve, we are teaching machines to drive—not by coding rules, but by training end-to-end neural networks that learn from vast streams of real-world data. Achieving this requires unprecedented scale in both data infrastructure and compute orchestration. Our workloads span thousands of GPUs, petabytes of driving data, and geographically distributed training and inference clusters. As Architect for AI Infrastructure, you will design and guide the evolution of the foundational compute and storage systems that fuel our model development lifecycle. Your leadership will directly accelerate AI research, enable rapid model deployment, and ensure our platform meets the demands of a company pushing the boundaries of autonomy. You’ll sit at the strategic core of AI, systems, and cloud infrastructure—owning challenges that few companies have the ambition or scale to tackle. Key Responsibilities Global Compute Strategy – Define and evolve the architecture for how Wayve allocates and orchestrates training and inference workloads across thousands of GPUs and multiple data centers, ensuring optimal throughput, resiliency, and cost efficiency. Petabyte-Scale Data Federation – Design systems that enable fast, reliable access to high-volume sensor and simulation data across geographies, ensuring the right data is always available for training, evaluation, and inference. Furthermore, preparing Wayve for being an exabyte-scale company. Cross-Region GPU Job Execution – Build the foundations that enable large-scale AI workloads to run seamlessly across hybrid and multi-cloud environments. Cloud Infrastructure Advisory – Act as a trusted partner to leadership in aligning compute investments and architecture with company strategy, growth plans, and performance goals. Technical Leadership & Mentorship – Uplift the broader engineering org through architectural coaching, technical deep dives, and by cultivating a culture of operational and engineering excellence. About You In order to set you up for success at Wayve, we’re looking for the following skills and experience. Essential 10+ years designing and building large-scale distributed systems, with at least 4 years focused on GPU-based cloud infrastructure. Proven experience enabling large-scale AI training, inference, or computer vision workloads in GPU clusters. Deep understanding of petabyte-scale data architecture, including storage federation, high-throughput access, and data locality for AI workloads. Strong technical leadership with a track record of defining and communicating architectural strategy, balancing long-term vision with delivery needs. A natural mentor with a history of developing engineers and influencing technical direction across teams. Advanced degree in Computer Science, Electrical Engineering, or a related field—or equivalent industry experience. Desirable Experience with multi-cloud orchestration, particularly in latency- or cost-sensitive training and inference pipelines. Familiarity with systems like Ray, Kubernetes, Airflow, or Flyte, and deep fluency in AI/ML job scheduling, model lifecycle management, and infrastructure-as-code practices. Background in supporting safety-critical or real-time inference use cases (e.g., robotics, autonomous vehicles, aerospace). Passion for building infrastructure-as-a-product that delivers performance and simplicity to research and product teams alike. This is a full-time role based in-office. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. This role is a full-time role based in Sunnyvale, CA (hybrid) and the reasonably estimated salary for this role ranges from $370,300 to $418,200, plus a competitive equity package. Actual compensation is based on the