Machine Learning Infra Engineer

Reducto · San Francisco, CA
full-time mid Posted 1 day ago

About this role

ABOUT REDUCTO Reducto helps AI teams ingest real world enterprise data with state of the art accuracy. The vast majority of enterprise data — from financial statements to health records — is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, and make it possible to build products, train models, and automate processes at scale. We’ve grown incredibly quickly, growing revenue by 7x YOY, and now work with hundreds of companies ranging from leading AI teams (Harvey, Vanta, Scale), through to enterprise (FAANG, top 3 trading firm). We're raised over 100M from world class investors like A16z, Benchmark, and First Round Capital, and are hiring a Machine Learning Engineer to help us train and deploy the models critical to the performance of our core product. THE OPPORTUNITY As an ML Infra Engineer, you’ll play a key role in building the inference and training frameworks that make it possible to deliver results at scale. You’ll collaborate closely with our ML and Platform teams to scale training across nodes, develop faster and more efficient serving, and create observability across the stack. This is a high-impact role where you’ll help define what high performance ML training and inference look like at Reducto.   WHAT YOU’LL DO - Build, and maintain our training and inference stack with an emphasis for fast iteration on training + flexibility for exploring new methods and high performance in inference. - Develop benchmarks for both sets of stacks to identify bottlenecks. - Explore SOTA advances in training and inference and work to apply them. - Design systems for scaling model training across multi-node, multi-GPU environments with strong reliability and observability. - Scale distributed training and inference workloads across large GPU clusters while improving utilization, reliability, and cost efficiency. - Build the tooling, abstractions, and observability that help ML engineers move faster from experiment to production.   YOU’LL THRIVE HERE IF YOU: - Hold yourself to a high bar for quality and precision. - Enjoy solving complex problems and building from first principles. - Have strong Python skills + a background in systems engineering. - Are comfortable with Kubernetes and distributed training frameworks. - Love getting your hands dirty with real-world implementation challenges. - Operate well in fast-changing, high-growth environments. - Collaborate effectively across technical and non-technical teams. - Take full ownership from strategy through execution.   BONUS POINTS IF YOU: - Have experience at an early-stage or high-growth startup. - Have developed in open source training/inference stacks in a meaningful way. - Are excited to set up distributed inference across 100s-1000s of GPUs. - Care deeply about combining technical excellence with business impact. This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.   MORE ABOUT REDUCTO Nearly 80% of enterprise data is in unstructured formats like PDFs   PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week https://www.reducto.ai/blog/the-real-cost-of-manual-document-processing.   Traditional approaches fail at reliably extracting information in complex PDFs   OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.   Reducto https://www.reducto.ai/ breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you: - Accurately extract text and tables even with nonstandard layouts - Automatically convert graphs to tabular data and summarize images in documents - Extract important fields from complex forms with simple, natural language instructions - Build powerful retrieval pipelines using Reducto’s document metadata - Intelligently chunk information using the document’s layout data   BENEFITS AT REDUCTO At Reducto, we’re invested in the well-being and growth of our team. Here’s what we currently offer: - Unlimited PTO: We believe great work requires

Similar Jobs

Related searches:

On-site Jobs Mid-Level Jobs On-site Mid-Level Jobs Mid-Level AI InfrastructureMid-Level Backend & SystemsMid-Level Data ScienceMid-Level Machine LearningMid-Level NLP & Language AI AI Jobs in San Francisco AI Infrastructure in San FranciscoBackend & Systems in San FranciscoData Science in San FranciscoMachine Learning in San FranciscoNLP & Language AI in San Francisco llmgpudistributed-systemsmachine-learninginfrastructure

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.