Staff Machine Learning Engineer (Research Scientist) - DFAI

Plaid · DEU

full-time lead Posted 1 month ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

AI Market Demand Pack · $29 one-time

Compare this role's skills with the full AI hiring market. Get ranked demand, salary bands, leading companies, public source URLs, and a decision brief.

See the live sample →

pre-training payments distributed-systems generative-ai llm fine-tuning machine-learning research

About this role

We believe that the way people interact with their finances will drastically improve in the next few years. We’re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products. Plaid powers the tools millions of people rely on to live a healthier financial life. We work with thousands of companies like Venmo, SoFi, several of the Fortune 500, and many of the largest banks to make it easy for people to connect their financial accounts to the apps and services they want to use. Plaid’s network covers 12,000 financial institutions across the US, Canada, UK and Europe. Founded in 2013, the company is headquartered in San Francisco with offices in New York, Washington D.C., London and Amsterdam. We are the Data Foundation & AI team within Plaid’s Data organization. Our mission is to build the shared ML and AI infrastructure that powers intelligent capabilities across Plaid’s product suite. We develop the foundational systems, models, and data assets that transform Plaid’s unique financial network data into scalable, general-purpose representations that teams across the company can leverage. Our work spans the full ML lifecycle — from large-scale data curation and model pretraining to production serving, evaluation, and monitoring. As part of the team, you’ll work at the intersection of machine learning infrastructure, applied AI, and distributed systems, helping establish the core AI platform that enables innovation across Plaid. As a Staff Machine Learning Engineer, you will lead the technical strategy and development of Plaid’s foundation models, driving key decisions across pretraining objectives, model architecture, and fine-tuning approaches that power a wide range of downstream product applications. You will serve as the technical lead for the full machine learning lifecycle, overseeing everything from data curation and experimentation to production deployment, feature management, and observability. In this role, you will establish rigorous evaluation frameworks to measure model performance across diverse use cases and build scalable, repeatable pipelines that translate research into production impact. You will also partner closely with teams across the organization to define how products integrate with and adapt foundation models, enabling reusable ML infrastructure and reducing duplicated modeling efforts. As a senior technical leader, you will mentor engineers across experience levels, elevate engineering and experimentation standards, and communicate technical advancements both internally and externally as a representative of Plaid’s AI and machine learning capabilities. Responsibilities: - Owning the end-to-end technical strategy for a foundation model built on one of the world's richest financial datasets, from pretraining architecture to production serving. - Doing research that ships: driving decisions from experimentation through production systems that serve real customers and power multiple product teams. - Working across the full ML stack, including pretraining objectives, architecture design, distributed training, serving infrastructure, monitoring, and cross-team integration. - Setting technical direction and mentoring a high-caliber team, with your work amplifying the capabilities of engineers and product teams across Plaid. - Helping hundreds of millions of consumers achieve greater financial freedom through the ML capabilities you build and ship. Qualifications: - MS: 7–12+ years of industry experience with a demonstrated track record of technical leadership and production delivery. - PhD: 5–9+ years of industry experience with evidence of technical leadership (tech lead, principal/staff-equivalent roles) and end-to-end production ownership. - Prior technical leadership experience (tech lead, principal, or staff) with demonstrated cross-team influence and mentorship. - Deep expertise in Transformers/LLMs/Foundation Models, including large-scale training or domain adaptation. - End-to-end production ownership; proven track record shipping models through training, serving, monitoring, and iteration in live environments. - Distributed training experience and strong Python + software engineering fundamentals at a staff level. - Ability to drive technical alignment across teams: setting standards, defining integration patterns, and influencing beyond your immediate scope. - Fintech / financial data domain experience - Nice to have - External publications or open-source contributions - Nice to have - Experience defining ML platform capabilities (serving infra, feature stores) used across multiple teams. - Nice to have Our mission at Plaid is to unlock financial freedom for everyone. To support that mission, we seek to build a diverse team of driven individuals who care deeply about making the financial ecosystem more equitable. We recognize that strong quali