AI Coding Agents Autonomously Program and Train Physical Robots Without Human Supervision
What
Two distinct systems for autonomous robot training emerged in the same week in June 2026. NVIDIA's ENPIRE framework, built with Carnegie Mellon University and UC Berkeley, uses AI coding agents to run robot training experiments overnight without human supervision — demonstrating robotic arms learning to cut zip ties and install GPUs into motherboards [1]. Separately, Anthropic's Project Fetch Phase 2 showed Claude Opus 4.7 independently programming a robot dog in 12 minutes and 7 seconds, approximately 20x faster than a human-assisted team completed the same task in the prior year [3][2].
Why it matters
If AI agents can autonomously run the trial-and-error loops that robot training requires — writing code, interpreting sensor feedback, adjusting policies, and repeating — the pace at which robots acquire new skills could increase without proportional human labor. Both NVIDIA and Anthropic demonstrate this on real hardware, not simulation.
Open questions
How well do ENPIRE-trained robot policies generalize to tasks or hardware configurations not seen during the autonomous overnight training runs? [1]
With no human check during overnight runs, what happens when an AI coding agent designs and iterates on a flawed experiment for hours before a researcher reviews the morning report? [1]
Project Fetch Phase 2 completed hardware integration in 12 minutes — but how does Claude Opus 4.7 perform on tasks requiring recovery from unexpected hardware states or sensor failures? [3][2]
One social observer argues that every major AI lab except OpenAI and Anthropic is building physical AI [9] — though Project Fetch Phase 2 contradicts the Anthropic half of this claim. Will either lab announce a sustained robotics research program rather than periodic experiments?
Narrative
NVIDIA's GEAR lab, working with Carnegie Mellon University and UC Berkeley, released ENPIRE — a framework that wraps AI coding agents with memory, context management, constraint handling, and feedback loops to run robot training experiments autonomously [1]. Researchers set tasks, then let the system operate overnight: by morning, a report describes what the agents tried and how robot performance changed. The system has been used to train robotic arms on dexterous manipulation including cutting zip ties and inserting GPUs into motherboard sockets. NVIDIA's own framing of the result: 'A part of our NVIDIA GEAR lab now self-improves tirelessly overnight. We just read the reports in the morning' [1].
On the same day this research attracted broad attention, Anthropic published results from Project Fetch Phase 2 [2]. In the original 2024 experiment, Project Fetch asked human employees — aided by Claude — to program an off-the-shelf robot dog from scratch. In Phase 2, Claude Opus 4.7 was given the task alone: connect real robot hardware, read camera and lidar sensor feeds, write movement code, and track the robot's location. The model completed the full sequence in 12 minutes and 7 seconds [3], roughly 20x faster than the human-assisted team in the prior year [3][4].
A parallel body of academic work covers the methods that underlie both systems: language-instructed skill acquisition, continual robot learning, and LLM-guided reinforcement learning for real-world manipulation [5][6][7][8]. These papers situate ENPIRE and Project Fetch within wider research on using LLMs to reduce the human effort in robot learning pipelines. The NVIDIA and Anthropic work is distinctive for running against real hardware rather than simulation, and for framing the result as a production capability rather than an ablation study.
One notable gap in the coverage: a social media post asserts that every major AI lab except OpenAI and Anthropic is building physical AI [9]. Project Fetch Phase 2 directly contradicts the Anthropic half of this claim, but neither company has announced a sustained robotics research division comparable to efforts at Google DeepMind or dedicated robotics firms. Whether these experiments represent ongoing strategic bets or periodic capability probes is not addressed in the available sources.
Timeline
- 2024: Anthropic Project Fetch Phase 1: human employees aided by Claude attempt to program an off-the-shelf robot dog from scratch, establishing the baseline for future comparison. [2][4]
- 2026-06-17: Ars Technica reports on NVIDIA ENPIRE: AI coding agents autonomously train robotic arms overnight on dexterous tasks including GPU installation and zip-tie cutting, in a collaboration between NVIDIA GEAR, CMU, and UC Berkeley. [1]
- 2026-06-18: Anthropic releases Project Fetch Phase 2: Claude Opus 4.7 programs a robot dog in 12 minutes 7 seconds without human assistance, approximately 20x faster than the 2024 human-assisted effort. [3][2][13][14]
Perspectives
NVIDIA GEAR Lab
Frames ENPIRE as enabling a genuinely self-improving research lab where agents run experiments overnight and researchers review reports in the morning, with no human required during training.
Evolution: Consistent with NVIDIA's broader push into physical AI infrastructure; ENPIRE is the public research instantiation of that direction.
Anthropic
Project Fetch Phase 2 demonstrates that a frontier LLM can independently handle robot hardware integration, sensor reading, code writing, and navigation — far faster than a human-assisted team.
Evolution: Phase 2 directly follows Phase 1 (2024), showing substantially expanded autonomous capability by removing the human from the loop entirely.
Rohan Paul (AI commentator)
The 20x speed improvement in Project Fetch is striking evidence that autonomous AI capability in physical systems is advancing rapidly.
Evolution: Consistent framing as a significant milestone; no skepticism expressed.
Jeremy Hsu / Ars Technica
Reports ENPIRE as a significant step toward fully autonomous robot skill acquisition pipelines, amplifying NVIDIA's 'self-improving lab' framing without notable skepticism.
Evolution: Consistent neutral-to-positive technology reporting.
Academic research community (CMU, UC Berkeley, USC RASC, AAAI)
A body of concurrent work confirms LLMs can guide robot skill acquisition in unfamiliar environments, providing methodological grounding for what ENPIRE and Project Fetch demonstrate on real hardware.
Evolution: Ongoing; the papers predate or run parallel to the industry announcements.
Social commentator (thehype.)
Argues that every major AI lab except OpenAI and Anthropic is investing in physical AI, positioning both as absent from embodied AI development.
Evolution: A reactive framing published the same day Anthropic released Project Fetch Phase 2, which directly contradicts the Anthropic half of the claim.
Tensions
Status: active and growing
Sources
- [1] AI coding agents taught robots how to install GPUs and cut zip ties — Ars Technica AI (2026-06-17)
- [2] Project Fetch: Can Claude train a robot dog? \ Anthropic — reactive:ai-coding-agents-robot-training
- [3] Anthropic just showed Claude Opus 4.7 program a robodog in 12:07 mint, about 20x faster than last year’s Claude-aided hu… — Rohan Paul Twitter (2026-06-18)
- [4] Anthropic reran Project Fetch from 2024, their robodog experiment where random employees tried to make an off the shelf,... — reactive:ai-coding-agents-robot-training (2026-06-18)
- [5] Continual Robot Learning via Language-Guided Skill Acquisition | OpenReview — reactive:ai-coding-agents-robot-training
- [6] LLMs can help robots learn new tasks in unfamiliar places – Robotics and Autonomous Systems Center — reactive:ai-coding-agents-robot-training
- [7] Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models — reactive:ai-coding-agents-robot-training
- [8] [PDF] Efficient Language-instructed Skill Acquisition via Reward-Policy Co ... — reactive:ai-coding-agents-robot-training
- [9] every big ai lab is now building physical ai. except openai and anthropic. why? — reactive:ai-coding-agents-robot-training (2026-06-16)
- [10] ENPIRE: Agentic Robot Policy Self-Improvement in the Real World — reactive:ai-coding-agents-robot-training
- [11] Read the full write-up of Project Fetch: — reactive:ai-coding-agents-robot-training
- [12] Anthropic's Project Fetch: How AI models like Claude can control robots | Anthropic posted on the topic | LinkedIn — reactive:ai-coding-agents-robot-training
- [13] Anthropic just released Phase 2 of Project Fetch. They gave their latest AI model a robotic dog and told it to figure ou... — reactive:ai-coding-agents-robot-training (2026-06-18)
- [14] AnthropicAI just released Phase 2 of Project Fetch. They gave their latest AI model a robotic dog and told it to figure ... — reactive:ai-coding-agents-robot-training (2026-06-18)