Large Language Models
Anthropic launches Message Batches API, allowing processing of up to 10,000 queries within 24 hours at half the cost of standard API calls.
MIT study: AI models in home surveillance inconsistently detect crime, recommend police calls, and show demographic biases in decision-making
Co-LLM allows general AI to collaborate with expert models, improving factual accuracy. It uses a "switch variable" to defer to specialists at word level, enhancing efficiency.
OpenAI's new 'o1' models excel in science, coding, and math. They achieved 83% success on IMO qualifying exams, up from GPT-4's 13%.
OpenAI's o1 model excels in math, coding, and science, outperforming humans on GPQA Diamond. Uses internal "chain of thought" for complex reasoning.
Google's DataGemma uses Data Commons to reduce LLM hallucinations. RIG and RAG approaches improve factual accuracy in AI responses.