Assessment Designer & Learning Analyst

Mercor · Mercor HQ, 181 Fremont Street

full-time junior Posted 15 hours ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

About this role

ABOUT MERCOR Mercor's mission is to organize human intelligence to power the AI economy. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development. Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $3 million a day. Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society. Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our San Francisco, NYC, or London offices. We're looking for an Assessment Designer & Learning Analyst who can build rigorous measurement systems and use data to understand what actually drives expert performance. This is not an instructional design role. You won't be building courses or writing training materials. You will be designing the assessments and certification frameworks that measure whether our talent experts and internal teams are genuinely skilled — and then doing the analytical work to understand what those assessments reveal, what predicts expert effectiveness, and how our programs should evolve based on evidence. You will be working closely with the Learning & Development team to understand the relationship between materials and assessments, and making recommendations to the team based on your analysis. If you've come from an ed school background, taught in a high-accountability environment, and completed quantitative projects or theses, and are energized by the measurement and data side of education — this role is for you. WHAT YOU'LL DO Assessment Design - Design and continuously improve assessments and certification frameworks that validly and reliably measure expert readiness for specific project types - Build assessments and measurements of skills that are consistent, interpretable, and actually predictive of on-the-job performance — not just checklists. - Develop item banks, scoring guides, and inter-rater reliability protocols for evaluating complex human judgment tasks. - Run validity studies: do our assessments measure what we think they measure? Learning Analytics & Impact Analysis - Analyze the relationship between instructional materials, assessments, and expert performance — identifying what's working and what isn't and make recommendations accordingly. - Analyze assessment data at the item level — difficulty, discrimination, reliability — and iterate based on findings. - Investigate the relationship between assessment performance and real-world expert effectiveness: who performs well on our assessments, and does that predict quality outcomes? - Build reports and dashboards that surface actionable insights to program and operations teams. - Design and analyze quasi-experimental, quantitative and qualitative (mixed methods) studies to understand what interventions actually move the needle on expert quality. Ongoing Measurement & Improvement - Track certification and assessment outcomes over time and flag when programs need revision - Partner with learning designers and project teams to translate your findings into program improvements - Bring a continuous improvement mindset — ship, measure, learn, iterate WHAT WE'RE LOOKING FOR Education - Master's degree in Learning Sciences, Educational Psychology, Educational Measurement, Psychometrics, or a closely related field — required - Coursework in quantitative research methods, psychometrics, and educational statistics — required - Familiarity with classical test theory (CTT) and ideally item response theory (IRT) Quantitative Skills — Required This role requires genuine comfort with numbers. We're looking for someone who can do the following and show their work: - Item-level analysis: difficulty index, discrimination index, inter-rater reliability (Cohen's kappa, Krippendorff's alpha, ICC) - Assess and report on assessment validity and reliability — and know what to do when results look off - Analyze relationships between variables: correlation, regression, and basic predictive modeling - Work fluently in Excel or Google Sheets for data cleaning and summaries - Use Python, STATA or R for deeper analysis (basic proficiency expected; we'll grow this with you) - Translate quantitative findings into plain-language recommendations for non-technical stakeholders We will ask you to demonstrate this. Finalists will complete a short take-home exercise involving a real assessment dataset — you'll analyze item performance, identify problems, and recommend improvements. Experience - 1–2 years o