claude ai news - Search News

36m

DeepSeek R1 : Open Source AI Competing with Big Tech Giants

DeepSeek R1 sets a new standard in open-source AI with competitive performance, model distillation, and groundbreaking ...

50m

CAIS and Scale AI Unveil Results of "Humanity's Last Exam," a Groundbreaking New Benchmark

The Center for AI Safety (CAIS) and Scale AI today announced the results of a groundbreaking new AI benchmark that was designed to test the limits of AI knowledge and whether the models are capable of ...

50m

When A.I. Passes This Test, Look Out

The creators of a new test called “Humanity’s Last Exam” argue we may soon lose the ability to create tests hard enough for A ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Trending now