AI-360
Anthropic's Claude 3.7 Sonnet sets new benchmarks in coding abilities. Novo Nordisk uses it to reduce clinical report writing from 12 weeks to 10 minutes.
Claude 3.7 Sonnet, the first hybrid reasoning model, will be tested by scientists to compress decades of scientific progress into shorter timeframes.
Research shows AI can accelerate education innovation by simulating interventions before human evaluation—potentially reducing timelines from decades to years.
GPT-4.5 achieves 62.5% accuracy on SimpleQA evaluation and reduces hallucination rates to 37.1%, compared to GPT-4o's 61.8% hallucination rate.
Salesforce's help.salesforce.com implementation handled 380,000 customer service conversations with an 84% resolution rate, with only 2% requiring humans.
Claude 3.7 Sonnet is both an ordinary LLM and a reasoning model in one: you can pick when you want the model to answer normally or think longer.