Principle Software Engineer, AI Observability & Evals Platform

LangChain · Boston, MA · $230k - $270k
full-time lead Posted 16 hours ago

About this role

ABOUT US At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to also offer a platform for building, evaluating, deploying, and operating agents at scale. With $125M raised at Series B from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we’re at a stage where we’re continuing to develop new products, growth is accelerating, and all team members have meaningful impact on what we build and how we work together. LangChain is a place where your contributions can shape how this technology shows up in the real world. Today, LangChain, LangGraph, LangSmith, and Fleet are used by teams shipping real AI products across startups and large enterprises. Millions of developers trust LangChain to power AI teams at companies like Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500. ABOUT THE TEAM The LangSmith team owns and builds LangChain's core platform for observability, evaluation, and production reliability of AI systems. From tracing and annotation to run rules, evaluations, and beyond, they own this end-to-end. If you want to help define what great AI observability looks like at production scale, this is where that work gets done. ABOUT THE ROLE We're looking for a Principal/Lead level Software Engineer to join the LangSmith team and help drive the technical direction of the platform. You'll build across the full stack from backend services and APIs to frontend product surfaces, and you'll play a central role in shaping how we build: setting engineering standards, mentoring engineers across the team, and making architectural decisions that hold up as we scale. If you're energized by both hands-on engineering and the multiplier effect of leveling up those around you, this role is built for that. Location: This role can be based in our Boston, San Francisco, or NYC office. WHAT YOU'LL DO DRIVE TECHNICAL DIRECTION - Lead architectural decisions across our Go, Python, and TypeScript stack, ensuring systems are performant, maintainable, and built to scale - Work across the full stack, owning features end-to-end from backend services and APIs through to frontend product experiences - Drive tracing, monitoring, and evaluation workflows at scale, with a focus on reliability and query performance across high-volume data - Help shape the product roadmap by partnering closely with product and design — not just executing on it RAISE THE BAR FOR THE TEAM - Set engineering standards for the team: define patterns, lead code reviews, and establish the foundations others build on - Mentor and grow engineers at all levels through code review, design feedback, pairing, and ongoing technical guidance - Drive projects from ambiguity to delivery while maintaining high engineering standards and aggressive timelines OWN RELIABILITY AND QUALITY - Troubleshoot and resolve production issues with a root-cause mindset, and implement durable fixes - Ensure system reliability through strong testing, monitoring, and alerting practices - Create and maintain technical documentation, including system design docs and API references WHAT YOU'LL BRING - 10+ years of professional experience in backend or fullstack engineering on highly complex, production systems - Strong programming skills across multiple parts of the stack: backend (Python and/or Go) and frontend (TypeScript, React, or similar) - Demonstrated experience making and owning architectural decisions, including tradeoffs around data systems, APIs, and service reliability - Experience with high-throughput or mission-critical systems, and a proven ability to optimize for performance and reliability - Depth in operationalizing technical work — you've taken systems from prototype to production and kept them running well at scale - Demonstrated track record of mentoring engineers and raising the technical quality of a team, not just the codebase - Strong communication skills and comfort operating cross-functionally with product, design, and engineering leadership - Customer centricity and an ownership mentality — you care how the product lands, not just how the code reads - You exemplify our operating principles https://www.langchain.com/careers NICE TO HAVE - Experience with database systems (Postgres, Redis, ClickHouse) and cloud platforms (AWS, GCP, or Azure) - Familiarity with observability tooling, evaluation frameworks, or AI/LLM infrastructure Salary Range: $230,000 - $270,000 Compensation Philosophy: We offer competitive compensation that includes base salary, variable compensation for relevant roles, meaningful equity, benefits, and perks. Actual compensation and offerings will vary based on role, level, and location

Similar Jobs

Related searches:

Hybrid Jobs Lead Jobs Hybrid Lead Jobs Lead Machine LearningLead NLP & Language AILead AI ResearchLead AI Agents & RAG AI Jobs in Boston Machine Learning in BostonNLP & Language AI in BostonAI Research in BostonAI Agents & RAG in Boston agentsllmplatformevaluation

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.