Claude Opus 4.8: Anthropic makes a more honest AI

0
60

How to try Claude Opus 4.8, the 'honest' Anthropic AI

Anthropic isn't ready to let regular users look at its supposedly super-powerful Claude Mythos AI model just yet. But the AI company has just released an upgrade to its flagship product, Claude Opus — now in its 4.8 version.

"It builds on Opus 4.7 with improvements across benchmarks, and is a more effective collaborator," Anthropic promised in a press release Thursday. Indeed, the benchmark numbers, below, show very minor improvements across the board.

One major improvement, allegedly, is in the area of hallucinations. Claude Opus 4.8 won't lie to users as much. "Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims," Anthropic said, touting the model's "honesty."

Claude Opus 4.8 has 'better judgment'

"Claude Opus 4.8 has noticeably better judgment," an engineer at Shopify, Tom Pritchard, told Anthropic. The coding version of the model "asks the right questions, catches its own mistakes, and pushes back when a plan isn't sound."

Given the increasing number of horror stories about AI agents deleting entire corporate databases, that promise may be music to the ears of vibe coders everywhere.

Mashable Light Speed

To please power users, Anthropic is offering a significant discount on "fast mode," where Claude will work at 2.5 times regular speed. Fast mode "is now three times cheaper than it was for previous models," the company said.

Users on Reddit weren't buying it, however. Many feared a loss of access to a more popular model, Claude Opus 4.6. "Nobody trusts the benchmark charts," wrote one redditor in summary, noting that Opus 4.7 also seemed to have some pretty good numbers when it was released.

Whether or not we can trust the benchmarks — and to be clear, Mashable hasn't independently verified these numbers — here's what Anthropic is claiming.

A list of benchmark numbers for Claude Opus 4.8

Credit: Anthropic

Claude Opus 4.8 is available now via Anthropic's website, Claude.AI, as well as via the Claude API, plus Anthropic partners like Microsoft Foundry.

The new model is priced exactly the same as its predecessors, which is to say models going all the way back to Claude Opus 4.5. All of them will cost you $5 per million input tokens and $25 per million output tokens.

Given that Anthropic is promising Claude Mythos within a matter of weeks, however, you may want to hang back and wait to see whether that model can be even more "honest" about its hallucinations.

Αναζήτηση
Κατηγορίες
Διαβάζω περισσότερα
Παιχνίδια
New World of Warcraft Midnight update throws Hunters an essential lifeline, with Warriors and Shamans also on the rise
New World of Warcraft Midnight update throws Hunters an essential lifeline, with Warriors and...
από Test Blogger6 2026-05-03 11:00:14 0 523
Technology
The EcoVacs Deebot X11 robot vacuum is down to its best-ever price at Amazon — save $200
Best robot vacuum deal: Save $200 on EcoVacs Deebot X11...
από Test Blogger7 2026-05-06 10:00:14 0 502
άλλο
Asia-Pacific Emerges as Key Growth Hub for Explosion-Proof Lighting
Introduction The explosion-proof lighting market is witnessing significant growth as industries...
από Akshay Patil 2026-02-26 10:57:46 0 2χλμ.
Technology
The Samsung QD-OLED Odyssey G9 gaming monitor is over $300 off at Amazon — save vs. Walmart
Best gaming monitor deal: Samsung Odyssey G9 QD-OLED drops to $899.99 with free code of Resident...
από Test Blogger7 2026-03-05 12:00:25 0 1χλμ.
Technology
Bluetti just launched the FridgePower portable power station — preorders save at least $540
Bluetti FridgePower launch: Preorder the new FridgePower and take up to 44% off...
από Test Blogger7 2026-04-16 16:00:14 0 835