
Anthropic似乎已势不可挡。该公司接连发布了多款重磅模型,据称其正在洽谈的新一轮融资估值接近1万亿美元,公司年化收入规模呈现近乎指数级增长。如今,这家公司又将死对头OpenAI旗下最知名的前成员之一收入麾下。
安德烈·卡帕西周二在社交平台X发帖称:“个人近况更新:我已经加入Anthropic。”这条帖子发布仅一小时便获得近300万次浏览。他表示,未来几年将是大语言模型前沿发展的“关键成型期”,而他也迫切渴望重新回到研究工作。本周起,他已加入Anthropic的预训练团队。
这一决定意味着,在AI竞赛不断升温之际,在X平台上拥有近200万粉丝的AI大V卡帕西已经选择了自己的阵营。2015年,他是OpenAI创始成员之一;随后离开OpenAI,前往特斯拉(Tesla)负责AI业务;2023年重返OpenAI,但仅一年后再次离开,创办了自己的教育科技公司Eureka Labs。
过去十年来,卡帕西在AI领域一直颇具影响力。但真正让他跻身AI传奇行列的,却是他去年发布的一条推文。2025年2月,他发文称出现了一种“新的编程方式”,并将其命名为“氛围编程”:只需要直白地描述自己的需求,剩下的工作交给模型去完成。
这一说法很快突破行业圈层,席卷整个商业世界。越来越多企业开始绕开传统软件公司,争相开发专属于自己的AI智能体,并由此引发备受争议的“SaaS末日”。(毫无争议的是,在企业纷纷尝试通过“氛围编程”打造自有解决方案的过程中,软件公司的市值蒸发了数百亿美元。)“Vibe coding”这个英文词组后来还被《柯林斯词典》(Collins Dictionary)评选为年度词汇。而卡帕西在那条原始推文中提到的模型,正是Anthropic的产品。
加入Anthropic后,卡帕西将在另一篇爆款帖子的基础之上继续深化研究。今年3月,他搭建了一个AI编程智能体,并给它配备了一个小语言模型,让其在无人干预的情况下连续运行两天,让其自主测试并微调训练代码。经过700次实验和20项自主优化后,他表示,将同样的方法应用于更大规模模型时,训练时间可缩短11%。他将这一过程称为“自动研究”,并在帖子中配上笑脸写道:“一部分是代码,一部分像科幻小说,再加上一点点精神错乱。”这种方法后来被业内称为“卡帕西循环”。
而教会AI掌握这种方法,某种程度上正是卡帕西新工作的内容。Anthropic表示,卡帕西将组建一支团队,专注于利用Claude加速预训练研究,即那些赋予Claude核心知识与能力的大规模训练过程。他所在的团队由尼克·约瑟夫领导。
不过,在这之前,让卡帕西声名鹊起的其实是另一项技能——玩魔方。他曾运营一个名为“badmephisto”的YouTube频道,整整一代竞技魔方玩家都通过他的视频学会了“速拧”技巧。在他的教学体系里,魔方并非由54个彩色贴纸组成,而是由26个独立的“小方块”构成。只要掌握这些最小单元的结构与规律,就能驾驭整个魔方。他最快能在约17秒内还原一个魔方。
当然,从神经网络到大语言模型,卡帕西面对的谜题变得越来越复杂。但他的思维方式始终没有改变:只要能够彻底掌控一个足够小的系统,你就能驱动一个更庞大的系统。(财富中文网)
译者:刘进龙
审校:汪皓
Anthropic似乎已势不可挡。该公司接连发布了多款重磅模型,据称其正在洽谈的新一轮融资估值接近1万亿美元,公司年化收入规模呈现近乎指数级增长。如今,这家公司又将死对头OpenAI旗下最知名的前成员之一收入麾下。
安德烈·卡帕西周二在社交平台X发帖称:“个人近况更新:我已经加入Anthropic。”这条帖子发布仅一小时便获得近300万次浏览。他表示,未来几年将是大语言模型前沿发展的“关键成型期”,而他也迫切渴望重新回到研究工作。本周起,他已加入Anthropic的预训练团队。
这一决定意味着,在AI竞赛不断升温之际,在X平台上拥有近200万粉丝的AI大V卡帕西已经选择了自己的阵营。2015年,他是OpenAI创始成员之一;随后离开OpenAI,前往特斯拉(Tesla)负责AI业务;2023年重返OpenAI,但仅一年后再次离开,创办了自己的教育科技公司Eureka Labs。
过去十年来,卡帕西在AI领域一直颇具影响力。但真正让他跻身AI传奇行列的,却是他去年发布的一条推文。2025年2月,他发文称出现了一种“新的编程方式”,并将其命名为“氛围编程”:只需要直白地描述自己的需求,剩下的工作交给模型去完成。
这一说法很快突破行业圈层,席卷整个商业世界。越来越多企业开始绕开传统软件公司,争相开发专属于自己的AI智能体,并由此引发备受争议的“SaaS末日”。(毫无争议的是,在企业纷纷尝试通过“氛围编程”打造自有解决方案的过程中,软件公司的市值蒸发了数百亿美元。)“Vibe coding”这个英文词组后来还被《柯林斯词典》(Collins Dictionary)评选为年度词汇。而卡帕西在那条原始推文中提到的模型,正是Anthropic的产品。
加入Anthropic后,卡帕西将在另一篇爆款帖子的基础之上继续深化研究。今年3月,他搭建了一个AI编程智能体,并给它配备了一个小语言模型,让其在无人干预的情况下连续运行两天,让其自主测试并微调训练代码。经过700次实验和20项自主优化后,他表示,将同样的方法应用于更大规模模型时,训练时间可缩短11%。他将这一过程称为“自动研究”,并在帖子中配上笑脸写道:“一部分是代码,一部分像科幻小说,再加上一点点精神错乱。”这种方法后来被业内称为“卡帕西循环”。
而教会AI掌握这种方法,某种程度上正是卡帕西新工作的内容。Anthropic表示,卡帕西将组建一支团队,专注于利用Claude加速预训练研究,即那些赋予Claude核心知识与能力的大规模训练过程。他所在的团队由尼克·约瑟夫领导。
不过,在这之前,让卡帕西声名鹊起的其实是另一项技能——玩魔方。他曾运营一个名为“badmephisto”的YouTube频道,整整一代竞技魔方玩家都通过他的视频学会了“速拧”技巧。在他的教学体系里,魔方并非由54个彩色贴纸组成,而是由26个独立的“小方块”构成。只要掌握这些最小单元的结构与规律,就能驾驭整个魔方。他最快能在约17秒内还原一个魔方。
当然,从神经网络到大语言模型,卡帕西面对的谜题变得越来越复杂。但他的思维方式始终没有改变:只要能够彻底掌控一个足够小的系统,你就能驱动一个更庞大的系统。(财富中文网)
译者:刘进龙
审校:汪皓
Anthropic can’t seem to stop winning. After a string of blockbuster model releases, a new funding round reportedly in talks at a valuation approaching $1 trillion, and an annual run rate that’s nearly parabolic, it has now hired one of OpenAI’s—its bitter competitor—most famous alumni.
“Personal update: I’ve joined Anthropic,” Andrej Karpathy wrote on X on Tuesday, in a post that drew nearly 3 million views within one hour. He said the next few years at the frontier of LLMs would be “especially formative” and that he was eager to get back to research. He started this week on the pretraining team.
The decision suggests that Karpathy, whose writing on AI is followed by nearly 2 million people on X, has decided to put his stake in the AI race. He was first a founding member of OpenAI in 2015, left to run AI at Tesla, came back in 2023, and left only a year later to start his own education company, Eureka Labs.
Karpathy has been well-known in AI for a decade, but the thing—or phrase—that etched him into AI legend was a tweet from last year. In February 2025, he posted that there was a “new kind of coding” that he called vibe coding: Describe what you want plainly and let the model do the work.
The phrase escaped the industry and infected the business world, which turned its back against software companies and raced to develop its own bespoke agents, touching off the much-debated “SaaSpocalypse” in its wake. (What wasn’t debated is that tens of billions of dollars in stock valuations evaporated as firms tried to vibe code their own solutions.) Collins Dictionary named it Word of the Year. The model he cited in that original tweet was Anthropic’s.
As an Anthropic employee, Karpathy will be building on the work from another viral post. In March, he wired up an AI coding agent, handed it a single small language model, and let it run unsupervised for two days, testing and tweaking the training code on its own. After 700 experiments and 20 self-found optimizations, he said the same tweaks applied to a larger model cut training time by 11%. This, he said, was called autoresearch: “Part code, part sci-fi, and a pinch of psychosis” he wrote with a smiley face. The method would come to be known as “the Karpathy Loop.”
Teaching that method looks, more or less, like his new job. According to Anthropic, Karpathy will be starting a team focused on using Claude to accelerate pretraining research, the large-scale training runs that give Claude its core knowledge and capabilities. He sits on a team led by Nick Joseph.
Long before any of this, though, Karpathy was famous for something else: his Rubik’s Cube skills. He ran a YouTube channel called “badmephisto,” where a generation of competitive cubers learned to “speedcube” by seeing the cube as 26 individual “cubies” rather than 54 colored stickers. By sticking to the small structure, he could move the whole thing. He solved a Rubik’s Cube in about 17 seconds.
Certainly the puzzles for Karpathy got harder—neural nets and then language models—but the method never really changed. Get a small enough system fully under control, and you can move something much bigger.