上个月 Cowork 刚首次亮相,本月早些时候 Anthropic 还发布了 Claude Opus 4.6 和 Sonnet 4.6。目前 Cowork 仍处于研究预览阶段,向付费的 Pro、Max、Team 和 Enterprise 用户开放。所有的弹药都在指向同一个目标:接管工作。
The real annoying thing about Opus 4.6/Codex 5.3 is that it’s impossible to publicly say “Opus 4.5 (and the models that came after it) are an order of magnitude better than coding LLMs released just months before it” without sounding like an AI hype booster clickbaiting, but it’s the counterintuitive truth to my personal frustration. I have been trying to break this damn model by giving it complex tasks that would take me months to do by myself despite my coding pedigree but Opus and Codex keep doing them correctly. On Hacker News I was accused of said clickbaiting when making a similar statement with accusations of “I haven’t had success with Opus 4.5 so you must be lying.” The remedy to this skepticism is to provide more evidence in addition to greater checks and balances, but what can you do if people refuse to believe your evidence?
。关于这个话题,safew官方下载提供了深入分析
第十八条 行政执法监督机构通过执法案卷评查检查行政执法决定是否合法,是否与违法行为的事实、性质、情节以及社会危害程度相当,以及行政执法文书是否规范,证据是否真实、完整。
“现行SAE分级是法律和责任的划分,非技术鸿沟。L3本质上是限定ODD运行范围的L4,因此跳过在技术上是个伪命题。“在公众号“电厂”的一篇文章中,原博世车载产品线负责人易强认为,L3是“缩小范围的L4”,区别主要是在法律法规上。法律人为限定了L3的使用范围。这才是当下L3和L4最大的不同。