DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's GPT-5 family, Anthropic's Claude Opus, and Google's Gemini Pro have clustered within a narrow band on Scale AI's SWE-Bench Pro leaderboard, making it nearly impossible for engineering leaders to determine which agent will actually perform bes

Source

VentureBeat

Read full article at VentureBeat

Opens original article in a new tab

Did the Pope use AI to write about the dangers of AI?

It's possible that AI was used to write parts of Pope Leo XIV's latest encyclical about AI's impact on humanity. An analysis by Linch Zhang posted on the forum LessWrong found certain paragraphs of Magnifica Humanitas to be between 40 percent and 100 percent written by AI, according to the popular AI detector Pangram. The document includes known traits that appear in AI-generated writing, such as a higher use of the word "genuinely" - which crops up in writing by Anthropic's Claude - than previo

about 1 hour agoRead more →

TechCrunch

UK Visa Portal spilled thousands of applicants’ passports and selfies online — and hasn’t fixed the leak

The third-party website exposed applicants' sensitive documents as part of the U.K. visa application process. Instead of fixing the issue, the company sent attorneys.

about 2 hours agoRead more →

Wired

Pope Leo Schooled the Tech Bros on Tolkien

The Holy Father referenced The Lord of the Rings in his encyclical about AI—an expert (if unintentional) troll of tech billionaires who keep misinterpreting the series.

about 2 hours agoRead more →

TechCrunch3.0 · Center

What we’re looking for in Startup Battlefield 2026, and how to apply in time for the May 27 deadline

Startup Battlefield applications are due tomorrow, so now's the time to put the finishing touches on your submission!

about 3 hours agoRead more →

Related Tech Stories

Did the Pope use AI to write about the dangers of AI?

UK Visa Portal spilled thousands of applicants’ passports and selfies online — and hasn’t fixed the leak

Pope Leo Schooled the Tech Bros on Tolkien

What we’re looking for in Startup Battlefield 2026, and how to apply in time for the May 27 deadline