Senior Backend Engineer, AI Platform
full-time
senior
Posted 1 week ago
Apply Now
Stand out: build a proof-of-work pitch →
Free GitHub-based preview. Direct apply stays one click away.
Get weekly job alerts like this →Hiring for this role?
About this role
WHAT MAKES US EPIC?
At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating.
Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development.
Online Infrastructure
What We Do
Epic Games is the creator of Fortnite, Unreal Engine, and the Epic Games Store — a company that has shaped the way billions of people play, create, and connect. We build foundational technology that powers some of the largest interactive experiences on the planet, and we give those tools to creators and developers around the world through open ecosystems.
Our AI Platform team is building an enterprise-grade stack of agentic AI systems that automates engineering workflows, accelerates developer productivity, and enables new kinds of collaboration across Epic's teams. We're not deploying off-the-shelf tools — we're building production systems from the ground up, across six interconnected platforms:\
Geppetto — multi-tenant platform for team AI agents that live and collaborate in Slack channels
EMA (Epic Managed Agents) — compute and workspace infrastructure for headless agent harness runs at scale
Hodor — MCP OAuth gateway, plugin runtime, and governance layer for AI tool orchestration (1,500+ MAU, 700K+ tool executions)
Multipass — agent identity, credential vault, and authorization for non-human workloads
Vektor — org-wide memory plane with knowledge graph, deductive reasoning, and hierarchical summarization
Roost — cryptographically signed software distribution and the Claude Code plugin marketplace
This is foundational work with real production usage and real consequences. The stack is live, growing, and touching every corner of Epic's engineering organization.
What You'll Do
As a Senior Engineer on the AI Platform team, you'll be a primary contributor and technical owner across one or more of the platforms in our agent stack. You'll take complex, loosely-defined problems — a new plugin runtime, an agent compute provider, a memory ingestion pipeline — and drive them from design to production with limited oversight. You'll work closely with the team's principal engineer and technical lead to make sure your implementations are consistent with the broader architecture, and you'll be the person others turn to when the work gets technically hard.
This role is for an engineer who wants to go deep. The problems span distributed systems, security, LLM integration, and developer tooling — and the surface area is large enough that you'll have genuine ownership, not just ticket work.
In this role, you will
Own the design and implementation of major features and subsystems across the AI Platform stack — from Hodor's plugin runtimes and credential manager to Geppetto's agent-service LLM dispatch and session lifecycle
Build and harden EMA's worker layer: workspace materialization, harness lifecycle management, normalized event streaming, and mid-run input handling across compute backends
Implement production-grade components for Vektor's memory pipeline — ingestion workers, knowledge graph writes, semantic search, and nightly consolidation jobs
Contribute to Roost's publish and consume pipelines: TUF signing, artifact storage, marketplace generation, and plugin signature verification
Implement credential manager components with rigor: AES-256-GCM encryption, AAD binding, scope isolation, and audit trail completeness
Write services that operate correctly under failure: circuit breakers, rate limiters, DLQ handling, and idempotent replay patterns
Contribute to Multipass as it moves from strategy to implementation — workload identity, token broker, policy plane — under the guidance of the principal engineer
Participate in on-call and incident response for Hodor and other production systems, building operational intuition alongside engineering depth
Write design documents for the features you own — clear enough for async review, precise enough to serve as the implementation spec
Collaborate across the team on cross-cutting concerns: NATS JetStream event bus patterns, multi-tenant isolation, RBAC enforcement, and observability
Review code from peers with the goal of raising quality and spreading knowledge, not just catching bugs
Surface architectural concerns early and engage constructively with the principal engineer and team lead when your implementation work reveals design gaps
What we're looking for
7+ years of software engineering experience, with a track record of owning and shipping complex backend systems
Strong distributed systems fundamentals: service design, event-driven architecture, fai
Similar Jobs
Related searches:
Get jobs like this delivered weekly
Free AI jobs newsletter. No spam.