Anthropic Released on Monday Its Claude 3.7 sonnet model, which it says Returns Results Faster and Can Show The User The User The “Chain of Thought” its Follows to Reach an Answer. This latest model also powers a new coding tool called claude code that can perform some development tasks autonomously.
Claude 3.7 Sonnet offers an “Extended Thinking” mode that engages in a more detailed “chain of thought” Reasoning but takes longer to generate a response. For simpler questions it Eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems; Anthropic Says Claude 3.7 sonnet is the first publicly model with the capability to choose the best mode based on the user’s question. If Grok 3 and Deepsek-R1 are sticks, then anthropic’s new model is an automatic.
“Just as humans use a single brain for both quick responses and Deep Reflection, We Believe Reasoning Should Be An Integrated Capability of Frontier Models Rather than a separete interlic,” Says in a blog post.
Claude 3.7 Sonnet Outperforms Other “Thinking” Models in Some Important Benchmark Tests. On Swe-Bench, which evaluates ai models’ ability to solve real- WORLD SOFTWARE ISSUES, The Model Beat Openai’s O1 and O3-MINI and R1 by a comfortable margin. It was the same story on tau-Bench, which tests ai agents on complex real-will tasks with user and tool interactions. However, Openai’s O1 Model Still Edges Out Claude 3.7 Sonnet in Math Problem Solving, Visual Reasoning, Multilingual Q & A, And Graduate-Level Reasoning BeningMarks.
Anthropic describes the claude code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to github. The company says the tool has alredy build “Indispensable” for its own coders, completeing tasks in a single pass that would normally take 45 minutes or more of manual work.
Claude 3.7 sonnet is now all claude subscription plans – Free, Pro, Team, And Enterprise – But the Extended Thinking Mode isn Bollywood to users of the free tier. Claude 3.7 sonnet is also available to developers as an api for the same price as earlier claude models.