International Programme on AI Evaluation: Capabilities & Safety

AI is advancing faster than our ability to evaluate it. We are changing that.

This programme brings together 40 exceptional students and professionals from around the world for a 150-hour hybrid course that blends lectures, hands-on labs, and a capstone project week in Valencia.

Fully funded through Open Philanthropy and certified by ValgrAI, it is the first step toward establishing AI Evaluation & Safety as a formal academic discipline.

Format at a Glance

150 hours hybrid programme. From February to May 2026

  • 90 hours of online lectures, seminars, and activities.

  • 20 hours of hands-on courses to apply evaluation methods to real-world AI systems.

  • 40 hours in-person capstone week in Valencia, Spain. Collaborative project work with academic mentorship, guest keynotes, and practical labs.

Student Experience

  • Community: Cohort of 40 top international participants, fostering collaboration and peer learning.

  • Mentorship: Direct guidance from leading researchers in academia, industry, and government.

  • Credential: 15 ECTS Expert Diploma from ValgrAI.

  • Scholarship: All students receive full funding, including travel and accommodation support for the in-person week.

Abstract collage with black and white photos, geometric shapes, text, and digital elements.

Curriculum Highlights

Participants will explore the following modules, each with an evaluation activity designed to provide hands-on experience.

  • Introduction to AI Evaluation

  • AI Architectures: Large Language Models and Beyond

  • Metrics and Experimental Methodology

  • Benchmarks, Leaderboards, and Competitions

  • Red-teaming Evaluations

  • Construct-Based Evaluation

  • Mechanistic Interpretability

  • AI Alignment and Control Evaluations

  • Governance, Policy, and Regulation of AI

  • Real-world Evaluations: Societal Impacts of AI

The programme culminates in a Capstone Project, where teams design and carry out an original evaluation study under the guidance of expert mentors, to produce publishable-quality work.

What You’ll Gain

By the end of the programme, you will:

  • Build core expertise in AI evaluation methods, from benchmarks and interpretability to red-teaming, governance, and societal impact.

  • Apply these skills in practice, evaluating real-world systems and completing a mentored capstone project.

  • Position yourself for impact in AI Safety Institutes, frontier labs, and policy bodies, or as researchers continuing into PhD and postdoctoral work.

  • Join a global network of peers and faculty shaping the future of AI evaluation and safety.

How We Are Different

This is not a bootcamp, an online course, nor a focused research program.

This is the first global academic programme focused on AI evaluations.

We offer you a comprehensive overview of the tools, and practical frameworks used to test, interpret, and govern advanced AI systems.

We show you the map, so you can connect the dots across disciplines: learning how technical evaluation, interpretability, governance, and policy interact.

A digital collage featuring a partial view of the moon, abstract geometric shapes, digital textures, and black-and-white photographic elements.

Shape the future of AI evaluation.

Applications are open now!

Learn More