Infrastructure Engineer - Member of Technical Staff

Simile · Palo Alto, CA · $200k - $400k
full-time lead Posted 1 month ago

About this role

ABOUT THE COMPANY Pilots don’t train with real passengers. Surgeons don’t practice on real people. Yet, the most consequential decisions in society are often pushed straight to production. Simile is changing that. We have built the first AI simulation of society, populated by generative agents based on real humans. Our research pioneered the field of AI-based simulation, proving it is possible to model human behavior with high accuracy. Today, we are developing a Foundation Model to predict human behavior in any situation, at any scale. We are backed by $100M in funding led by Index Ventures, with participation from Hanabi, A*, Bain Capital Ventures, and AI visionaries including Andrej Karpathy, Fei-Fei Li, Adam D’Angelo, and Guillermo Rauch. ABOUT THE TEAM The Infrastructure team is the backbone of our platform. We build the foundational systems that allow our AI agents to operate at scale with uncompromising security. We operate at the intersection of high-scale cloud networking, distributed systems, and enterprise-grade privacy. WE ORGANIZE OUR WORK INTO THREE CORE PILLARS: - Cloud Foundation: Managing our multi-cloud footprint (AWS/GCP) with a focus on high availability, cost-efficiency, and Infrastructure-as-Code. - Enterprise Deployments: Building the "paved paths" for VPC peering, PrivateLink, and BYOC (Bring Your Own Cloud) architectures for our largest customers. - Platform & Reliability: Developing the CI/CD pipelines and observability stacks (p99 latency tracking, SLOs) that empower our entire engineering org to ship safely. ABOUT THE ROLE We are looking for an Infrastructure Engineer who thrives on the complexity of modern deployment patterns. You will own the infrastructure roadmap from design to operation, ensuring our platform is resilient, compliant, and ready for global scale. RESPONSIBILITIES - Architect Multi-Cloud Environments: Design and scale multi-region architectures across AWS and GCP to support global data residency and failover requirements. - Enable Engineering Velocity: Partner cross-functionally with Product Engineering, Research, and Security teams to build internal tooling and "paved paths" that accelerate development velocity and empower every engineer to ship with confidence. - Own Enterprise Connectivity: Build and automate secure networking solutions, including VPC peering, PrivateLink, and dedicated interconnects for customer-managed environments. - Drive Reliability: Set and maintain strict SLOs. You’ll optimize networking paths and resource allocation to ensure our real-time AI features hit their latency targets. - Champion GitOps: Manage our entire stack via Terraform/Pulumi; ensuring that "the code is the truth" across all environments. - Security & Compliance: Implement "security-by-design," focusing on encryption at rest/transit and identity management (SAML/SCIM) to meet SOC2 and HIPAA standards. REQUIREMENTS MUST HAVES - 5+ years of experience building production-grade infrastructure in a high-growth environment. - Cloud Polyglot: Deep expertise in AWS is required; experience with GCP or Azure is a major plus. - Networking Guru: Deep understanding of DNS, Load Balancing, Service Meshes, and complex VPC routing. - IaC Architect: Proven track record managing large-scale environments using Terraform, Pulumi, or other production-ready, battle-tested IaC tools with a focus on reusable, modular infrastructure. - Operational Mindset: Experience with modern observability (Datadog, OpenTelemetry) and a "you build it, you run it" mentality. - Communication: Ability to write clear technical specs for both internal teams and external customers. NICE TO HAVES - AI/ML Infrastructure: Experience building or scaling infrastructure for AI/ML workloads, specifically high-throughput inference systems or GPU-accelerated computing. - Kubernetes Mastery: Strong K8s (EKS/GKE) experience, specifically around multi-tenant security and resource isolation. COMPENSATION & BENEFITS At Simile, we provide competitive compensation packages that include base salary, equity, and comprehensive benefits. - Salary Range: $200,000 – $400,000 USD - Note: Final offers are based on experience, specialized skills, interview performance, and relevant training. - Equity: Grants are available for eligible roles, subject to board approval. - Health & Wellness: Comprehensive medical, dental, and vision coverage. - Time Off: Flexible time off policies to support work-life balance. OUR PROCESS We prioritize thoughtful conversations and clear examples of past work. Our hiring journey is designed to help both sides align on fit, working style, and expectations. Reapplication Policy: To ensure a fair and thorough evaluation for all applicants, Simile observes a 90-day waiting period before reconsidering candidates for the same role. COMMITMENT TO DIVERSITY & INCLUSION Equal Opportunity: Simile is an equal opportunity wor

Similar Jobs

Related searches:

On-site Jobs Lead Jobs On-site Lead Jobs Lead AI Agents & RAGLead Backend & SystemsLead AI InfrastructureLead Generative AI AI Jobs in Palo Alto AI Agents & RAG in Palo AltoBackend & Systems in Palo AltoAI Infrastructure in Palo AltoGenerative AI in Palo Alto agentsgenerative-aidistributed-systemscloudinfrastructure