Updates

Red-teaming and security suggestions regarding proposed rule by the Bureau of Industry and Security, “Establishment of Reporting Requirements for the Development of Advanced Artificial Intelligence Models and Computing Clusters.”

New Support Through The Audacious Project

9 October 2024

Funding for Canary will enable research and implementation at scale

Response to U.S. AISI Draft “Managing Misuse Risk for Dual-Use Foundation Models”

8 September 2024

Suggestions for expanded guidance on capability elicitation and robust model safeguards in the U.S. AI Safety Institute’s draft document “Managing Misuse Risk for Dual-Use Foundation Models” (NIST AI 800-1).

Response to NIST Draft Generative AI Profile

2 June 2024

Comments on NIST’s draft document “AI Risk Management Framework: Generative AI Profile.”

ML Engineers Needed for New AI R&D Evals Project

16 May 2024

METR is hiring ML engineers and researchers.

Emma Abele is METR’s new Executive Director

26 April 2024

Emma moves from President to Executive Director, Beth moves to Head of Research.

2023 Year In Review

7 February 2024

A summary of what METR accomplished in 2023 – our first full year of operation.

Bounty: Diverse hard tasks for LLM agents

16 December 2023

METR (formerly ARC Evals) is looking for (1) ideas, (2) detailed specifications, and (3) well-tested implementations for tasks to measure performance of autonomous LLM agents.

ARC Evals is now METR

4 December 2023

ARC Evals is wrapping up our incubation period at ARC, and spinning off into our own standalone nonprofit.

Responsible Scaling Policies (RSPs)

26 September 2023

We describe the basic components of Responsible Scaling Policies (RSPs) as well as why we find them promising for reducing catastrophic risks from AI.

ARC Evals is spinning out from ARC

19 September 2023

ARC Evals plans to spin out from the Alignment Research Center (ARC) in the coming months, and become its own standalone organization.

Response to RfC on AI Accountability Policy

11 June 2023

Input to NTIA’s AI Accountability Policy Request for Comment.