How does DeepSeek R1 really fare against OpenAI’s best reasoning models? We run the LLMs through a gauntlet of tests, from creative writing to complex instruction.