Senior Backend Engineer, AI Platform

Epic Games · BLANK,BLANK,Multiple Locations
full-time senior Posted 1 week ago
Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

About this role

WHAT MAKES US EPIC? At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating. Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development. Online Infrastructure What We Do Epic Games is the creator of Fortnite, Unreal Engine, and the Epic Games Store — a company that has shaped the way billions of people play, create, and connect. We build foundational technology that powers some of the largest interactive experiences on the planet, and we give those tools to creators and developers around the world through open ecosystems. Our AI Platform team is building an enterprise-grade stack of agentic AI systems that automates engineering workflows, accelerates developer productivity, and enables new kinds of collaboration across Epic's teams. We're not deploying off-the-shelf tools — we're building production systems from the ground up, across six interconnected platforms:\ Geppetto — multi-tenant platform for team AI agents that live and collaborate in Slack channels EMA (Epic Managed Agents) — compute and workspace infrastructure for headless agent harness runs at scale Hodor — MCP OAuth gateway, plugin runtime, and governance layer for AI tool orchestration (1,500+ MAU, 700K+ tool executions) Multipass — agent identity, credential vault, and authorization for non-human workloads Vektor — org-wide memory plane with knowledge graph, deductive reasoning, and hierarchical summarization Roost — cryptographically signed software distribution and the Claude Code plugin marketplace This is foundational work with real production usage and real consequences. The stack is live, growing, and touching every corner of Epic's engineering organization. What You'll Do As a Senior Engineer on the AI Platform team, you'll be a primary contributor and technical owner across one or more of the platforms in our agent stack. You'll take complex, loosely-defined problems — a new plugin runtime, an agent compute provider, a memory ingestion pipeline — and drive them from design to production with limited oversight. You'll work closely with the team's principal engineer and technical lead to make sure your implementations are consistent with the broader architecture, and you'll be the person others turn to when the work gets technically hard. This role is for an engineer who wants to go deep. The problems span distributed systems, security, LLM integration, and developer tooling — and the surface area is large enough that you'll have genuine ownership, not just ticket work. In this role, you will Own the design and implementation of major features and subsystems across the AI Platform stack — from Hodor's plugin runtimes and credential manager to Geppetto's agent-service LLM dispatch and session lifecycle Build and harden EMA's worker layer: workspace materialization, harness lifecycle management, normalized event streaming, and mid-run input handling across compute backends Implement production-grade components for Vektor's memory pipeline — ingestion workers, knowledge graph writes, semantic search, and nightly consolidation jobs Contribute to Roost's publish and consume pipelines: TUF signing, artifact storage, marketplace generation, and plugin signature verification Implement credential manager components with rigor: AES-256-GCM encryption, AAD binding, scope isolation, and audit trail completeness Write services that operate correctly under failure: circuit breakers, rate limiters, DLQ handling, and idempotent replay patterns Contribute to Multipass as it moves from strategy to implementation — workload identity, token broker, policy plane — under the guidance of the principal engineer Participate in on-call and incident response for Hodor and other production systems, building operational intuition alongside engineering depth Write design documents for the features you own — clear enough for async review, precise enough to serve as the implementation spec Collaborate across the team on cross-cutting concerns: NATS JetStream event bus patterns, multi-tenant isolation, RBAC enforcement, and observability Review code from peers with the goal of raising quality and spreading knowledge, not just catching bugs Surface architectural concerns early and engage constructively with the principal engineer and team lead when your implementation work reveals design gaps What we're looking for 7+ years of software engineering experience, with a track record of owning and shipping complex backend systems Strong distributed systems fundamentals: service design, event-driven architecture, fai

Similar Jobs

Related searches:

On-site Jobs Senior Jobs On-site Senior Jobs Senior Machine LearningSenior AI InfrastructureSenior Backend & SystemsSenior NLP & Language AISenior Data ScienceSenior AI Agents & RAGSenior Data Engineering llmagentssearchdistributed-systemsembeddingsapi-designbackendplatform

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.