Anlife: what does an unusual evolution simulator have to say about AI?

2026年1月22日 · 马琳 · 来源：cache资讯

l00777 0 0 0 /lib - usr/lib

Hand-coded models can go much smaller (36 vs 311 trained) since they don't need to be discoverable by SGD

2026-02-26 00:00:00:0新华社记者 ——习近平总书记引领全党树立和践行正确政绩观

The real annoying thing about Opus 4.6/Codex 5.3 is that it’s impossible to publicly say “Opus 4.5 (and the models that came after it) are an order of magnitude better than coding LLMs released just months before it” without sounding like an AI hype booster clickbaiting, but it’s the counterintuitive truth to my personal frustration. I have been trying to break this damn model by giving it complex tasks that would take me months to do by myself despite my coding pedigree but Opus and Codex keep doing them correctly. On Hacker News I was accused of said clickbaiting when making a similar statement with accusations of “I haven’t had success with Opus 4.5 so you must be lying.” The remedy to this skepticism is to provide more evidence in addition to greater checks and balances, but what can you do if people refuse to believe your evidence?

Раскрыт не