Principal Engineer, AI Platform

Epic Games · BLANK,BLANK,Multiple Locations · $223k - $327k
full-time principal Posted 1 week ago

About this role

WHAT MAKES US EPIC? At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating. Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development. ONLINE INFRASTRUCTURE What We Do We enable Epic’s online services teams to build, deploy, and manage services that are used by more than half a billion players around the world. Our mission is to provide world class tools and platforms to improve the experience of our developers and make it easier, faster, and safer to build, operate, and scale their applications. We operate at massive scale as one of the largest cloud computing users in the world. What You'll Do Epic Games is the creator of Fortnite, Unreal Engine, and the Epic Games Store — a company that has shaped the way billions of people play, create, and connect. We build the foundational technology that powers some of the largest interactive experiences on the planet, and we give those tools to creators and developers around the world through open ecosystems. Behind the products is an engineering organization that operates at rare scale: real-time infrastructure for hundreds of millions of players, a game engine used across film, architecture, and automotive industries, and a platform ecosystem that lets indie developers ship games to every major device on Earth. Our AI Platform team is building the next layer of that infrastructure — an enterprise-grade stack of agentic AI systems that automates engineering workflows, accelerates developer productivity, and enables new kinds of collaboration across Epic's teams. We're not a research group and we're not deploying off-the-shelf tools. We're architecting and building production systems from the ground up, across six interconnected platforms: AI Agent Orchestration — multi-tenant platform for team AI agents that live and collaborate in Slack channels, Source Control, etc. EMA (Epic Managed Agents) — compute and workspace infrastructure for headless agent harness runs at scale AI MCP Gateway — MCP OAuth gateway, plugin runtime, and governance layer for AI tool orchestration Non Human Identity Management — agent identity, credential vault, and authorization for non-human workloads Centralized AI Knowledge Base — org-wide memory plane with knowledge graph, deductive reasoning, and hierarchical summarization Roost — cryptographically signed software distribution and the Claude Code plugin marketplace This is foundational work that will define how AI is used inside Epic for the next decade. The scale is real, the problems are hard, and the team is small enough that every engineer makes a decisive architectural impact. As a Principal Engineer on the AI Platform team, you'll own the technical direction of our agent infrastructure stack end to end. You'll set the architecture across the six platforms above, drive alignment between them, and personally solve the hardest distributed systems and security problems that emerge as the stack scales. You'll work across teams to ensure agent identity, tool governance, memory, and execution infrastructure are coherent, secure, and operable — and you'll mentor the engineers who build alongside you. This isn't a coordinator role. You'll write production code, design protocols, make the calls that determine how agents authenticate and what they're allowed to do, and be accountable for the reliability of systems that are actively used by Epic's engineering organization. In this role, you will Platform Architecture & Technical Leadership: Own the end-to-end technical architecture across Epic's AI Infrastructure Platforms — ensuring each platform is coherent with the others and that the integration seams are well-defined Drive architectural decisions for agent identity and workload authorization (SPIFFE/SPIRE, OIDC, token exchange, policy planes), translating security requirements into implementable designs Establish the patterns for how AI agents authenticate, receive credentials, execute tools, and are audited — and hold the bar for correctness across the stack Lead design reviews for new capabilities, evaluate build vs. buy decisions, and surface technical risk before it becomes production risk Distributed Systems & Infrastructure: Design and implement the Cluster API and provider abstractions for EMA — the layer that orchestrators depend on to launch, drive, and recover headless agent runs across Kubernetes, EC2, and other compute backends Evolve Epic's AI MCP Gateway plugin runtime (WASM, gRPC sidecar, subprocess multiplexer) and its gateway security posture as external tool surface are

Similar Jobs

Related searches:

On-site Jobs Principal Jobs On-site Principal Jobs Principal Backend & SystemsPrincipal AI Safety & SecurityPrincipal NLP & Language AIPrincipal AI InfrastructurePrincipal AI Agents & RAGPrincipal Data SciencePrincipal Machine Learning securitydistributed-systemsembeddingsapi-designllmagentsmicroservicesplatform

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.