Large Language Models
OpenAI's o1 model excels in math, coding, and science, outperforming humans on GPQA Diamond. Uses internal "chain of thought" for complex reasoning.
Google's DataGemma uses Data Commons to reduce LLM hallucinations. RIG and RAG approaches improve factual accuracy in AI responses.
Claude Enterprise offers 500K token context, GitHub integration, and enhanced security. Early users report significant productivity gains across tasks.
Llama models near 350M downloads, with 20M in the past month. Usage doubled from May to July 2024, and major firms are integrating Llama-based AI.
Anthropic releases Artifacts for Claude.ai across all user tiers, enabling visualisation and iteration of AI-generated work like code diagrams and dashboards.
MIT's SigLLM framework uses large language models to detect anomalies in time-series data without extensive training, showing promise for complex systems.