Young Investigator, FlexOlmo

Allen Institute for AI · Berkeley, CA

full-time junior Posted 7 months ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

AI Market Demand Pack · $29 one-time

Compare this role's skills with the full AI hiring market. Get ranked demand, salary bands, leading companies, public source URLs, and a decision brief.

See the live sample →

pytorch deep-learning robotics llm healthcare nlp research

About this role

Persons in these roles are welcome to work remotely from Berkeley, CA. Compensation: $159,650 Who You Are: Ai2 is seeking talented and motivated Postdoctoral Young Investigator to join the FlexOlmo team , working on a series of large language models designed for flexible data use , with a focus on Mixture-of-Experts (MoE) , long-context language models (LCLMs) , and retrieval . Postdoctoral Young Investigators will be based in Berkeley, California . This opportunity offers a unique opportunity to contribute to cutting-edge research in natural language processing and machine learning in an exciting, fast-paced research environment. You have the opportunity to: Define and lead a high-impact research project. Train and release leading models. Collaborate with and learn from team members across Ai2. Build open-source software for the research community. Author scientific papers for publication in a high-profile conference or journal. The Young Investigator position: Duration: 1-3 years Start date: Flexible Candidates: Are within one year of completing their PhD, or already have a PhD The Allen AI Young Investigator is a postdoctoral program offering unique benefits. The program will enable you to balance working collaboratively on an Ai2 project while having opportunities to mentor junior researchers. Who We Are: We design new architectures and training methods that help models use data more effectively—through improved training, inference-time conditioning, and retrieval—broadening the types of data they can leverage and ultimately enhancing performance. We also develop scientific methodologies for evaluating and understanding these systems. Our team produces high-impact research and expertly engineered open-source tools that accelerate NLP research worldwide. We lead the FlexOlmo project, whose first release in July 2025 focused on a new Mixture-of-Experts architecture. Looking ahead, we plan to pursue creative, groundbreaking research that delivers scientific insights and practical solutions for building architectures and training methods that unlock the use of large and diverse data sources. Your Next Challenge: Why FlexOlmo? We are building the foundation for research into the next generation of language models designed for flexible data use. FlexOlmo is a small, tightly knit team, giving you the unique opportunity to work closely with team members toward one high-impact project. We encourage open collaboration projects, even with researchers at external institutions. Team member will be based in Berkeley, with opportunities to engage actively with the University of California, Berkeley, and the BAIR lab. Our pay is competitive, and visa sponsorship is available. We are committed to open science and support students freely publishing papers, as exemplified by our first release: FlexOlmo: Open Language Models for Flexible Data Use . The essential functions include, but are not limited to the following: Dedicated Ai2 mentor who is also a faculty member at the University of California, Berkeley. 50% work on leading and collaborating on an Ai2 project as an independent contributor (IC). 50% work on mentoring junior researchers (PhD students/interns, predoctoral students/interns) as well as opportunities to receive mentorship in academic activities such as grant writing and teaching, if desired. What You’ll Need: Qualifications: Are within one year of completing their PhD, or already have a PhD, in Computer Science or similar field with research experience in machine learning, natural language processing, language and vision, or related areas. Outstanding individual contributor (IC) skills , especially with deep learning frameworks (e.g. PyTorch). An outstanding publication record at AI-related venues, such as NeurIPS, ICLR, ICML, COLM, ACL, EMNLP. We will specifically evaluate the quality of publications in terms of rigor and impact , not the quantity. Extensive research experience in areas such as large language models, training dynamics, scaling laws, and data curation. Experience with mixture-of-experts, long-context language models, and retrieval is preferred but not required. Located [or willing to relocate] in Berkeley, CA. Physical Demands and Work Environment: The physical demands described here are representative of those that must be met by a team member to successfully perform the essential functions of this position. Reasonable accommodations may be made to enable individuals with disabilities to perform the functions. Must be able to remain in a stationary position for long periods of time. The ability to communicate information and ideas so others will understand. Must be able to exchange accurate information in these situations. The ability to observe details at close range. Can work under deadlines. A Little More About Ai2: Ai2 is a Seattle based non-profit AI research institute founded in 2014 by the la