Amy Deng (former)

Amy Deng

Former Technical Staff

Amy was interested in eliciting and evaluating the capabilities of frontier models. Prior to joining METR, she was a founding engineer at an ML Infra startup and studied Electrical Engineering & Computer Science at UC Berkeley.

By Amy Deng

Analyzing coding agent transcripts to upper bound productivity gains from AI agents

February 17, 2026

Amy Deng investigates whether coding agent transcripts could serve as an alternative for estimating AI productivity uplift, using 5305 Claude Code transcripts from METR technical staff.

CoT May Be Highly Informative Despite “Unfaithfulness”

August 8, 2025

Recent work from Anthropic and others claims that LLMs' chains of thoughts can be “unfaithful”. These papers make an important point: you can't take everything in the CoT at face value. As a result, people often use these results to conclude the CoT is useless for analyzing and monitoring AIs. Here, instead of asking whether the CoT always contains all information relevant to a model's decision-making in all problems, we ask if it contains enough information to allow developers to monitor models in practice. Our experiments suggest that it might.

HCAST: Human-Calibrated Autonomy Software Tasks

March 17, 2025

Sharing details about HCAST (Human-Calibrated Autonomy Software Tasks), a benchmark we’ve been developing to measure the abilities of frontier AI systems to complete diverse software tasks autonomously.