Autoresearch, Agent Loops and the Future of Work
Episode Date: March 9, 2026Andrej Karpathy released autoresearch this weekend — a system where an AI agent runs experiments to improve a language model overnight, keeping what w...
A daily news analysis show on all things artificial intelligence. NLW looks at AI from multiple angles, from the explosion of creativity brought on by new tools like Midjourney and ChatGPT to the potential disruptions to work and industries as we know them to the great philosophical, ethical and practical questions of advanced general intelligence, alignment and x-risk.
948 episodes transcribedAndrej Karpathy released autoresearch this weekend — a system where an AI agent runs experiments to improve a language model overnight, keeping what w...
OpenClaw has now been in the wild for a little over a month, and builders are starting to converge on what actually works. The early experiments are r...
GPT 5.4 just dropped and the early consensus is clear — this is the most substantial OpenAI release in recent memory, with massive jumps in computer u...
AI has crossed the line from tech story to political battleground as the Anthropic–Pentagon dispute, Dario Amodei’s leaked memo attacking OpenAI and t...
Anthropic’s surge and OpenAI’s latest updates highlight how the consumer AI race is becoming about far more than model benchmarks. This episode explor...
A new wave of experiments is testing whether AI agents can build and run companies without human employees, with projects like FelixCraft generating r...
February 2026 was the month that AI's transformation stopped being an insider story and cascaded across groups — from developers embracing a new e...
This week, the global AI conversation hit a new level. From investor memos to viral economic doomsday scenarios, the debate is shifting from “Is AI re...
The standoff between Anthropic and the Pentagon exploded this week when President Trump directed every federal agency to cease using Anthropic's t...
Block just cut 40% of its workforce in one move, with Jack Dorsey arguing that new intelligence tools and smaller, flatter teams fundamentally change...
Anthropic rolls out Claude Code Remote Control and Scheduled Tasks, Perplexity launches Perplexity Computer, Notion unveils Custom Agents, and suddenl...
Public skepticism toward AI is rising, and it’s not just media hype. From job displacement fears and artist backlash to data center protests, child de...
As METR releases the results of their long-horizon test for Claude Opus 4.6, the benchmark shows just how fast things are moving. In fact, one recent...
AI is reshaping the economy—but not always in the way most leaders expect. This episode explores why AI could matter more for plumbers than programmer...
Gemini 3.1 Pro arrives with big benchmark gains and a sharp jump in reasoning, coding, and efficiency—but in a world where the frontier rotates weekly...
A new Anthropic study shows that AI agents are being used far more conservatively than their capabilities suggest, with short sessions, heavy human ov...
Anthropic drops Sonnet 4.6 with a million-token context window and major gains in computer use, coding, and agentic workflows at a dramatically lower...
For years, AI felt transformative in anecdotes but invisible in macroeconomic data. That may be changing. Revised labor statistics suggest stronger-th...
OpenClaw’s meteoric rise—from a weekend Claude experiment to the fastest-growing open source AI project in the world—just culminated in Peter Steinber...
An 80-million-view post by Matt Schumer ignited one of the most important AI debates of 2026—are we underestimating how fast AI is transforming work,...