Ars Technica - All News (RSS/Atom feed) 11 months ago

How does DeepSeek R1 really fare against OpenAI’s best reasoning models? We run the LLMs through a gauntlet of tests, from creative writing to complex instruction.

Ars Technica

How does DeepSeek R1 really fare against OpenAI’s best reasoning models?

We run the LLMs through a gauntlet of tests, from creative writing to complex instruction.