AI-360
Salesforce launches Agentforce, autonomous AI agents for customer interaction. Atlas Reasoning Engine simulates human thought for complex tasks.
ChatGPT integrated with AVs interprets direct and indirect passenger commands. Tests showed lower discomfort rates and good performance even with novel instructions.
Co-LLM allows general AI to collaborate with expert models, improving factual accuracy. It uses a "switch variable" to defer to specialists at word level, enhancing efficiency.
Stanford HAI held workshop for ASEAN on AI governance. Topics: bias, fairness, trust. ASEAN plans new AI initiatives, including ethics guide and working group.
OpenAI's o1-mini matches larger models in STEM tasks at 80% lower cost. It scores 70% on AIME, reaches 86th percentile on Codeforces, and is 3-5x faster than GPT-4o.
OpenAI's new 'o1' models excel in science, coding, and math. They achieved 83% success on IMO qualifying exams, up from GPT-4's 13%.