AI-360
"AI systems went from solving 2% of real coding problems to 49% in one year. Anthropic warns the window for safe regulation is closing fast."
"Stanford's fellowship places tech experts in government roles. One fellow helped write AI laws, while others improved public services and training."
OpenAI's SimpleQA tests 4,326 factual questions with 3% error rate. GPT-4o scores under 40%, showing larger models excel while deeper thinking ones opt to decline.
New MIT postdoc programme pairs AI experts with mentors in biology, physics, music & more, funded by $20M Tayebati gift to advance cross-disciplinary AI applications.
Google's Trillium TPU achieves 4.7x peak compute vs prior gen with 67% better energy efficiency, now powering Search, YouTube & DeepMind's LLMs in preview.
Using Edit & Bash tools, newest Claude 3.5 Sonnet hits 49% on SWE-bench's GitHub issue tests, despite challenges like hidden tests & high token costs.