15 posts

OpenAI

Latest posts
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy

OpenAI's SimpleQA tests 4,326 factual questions with 3% error rate. GPT-4o scores under 40%, showing larger models excel while deeper thinking ones opt to decline.

by AI-360
OpenAI Models Power Decagon's High-Performance Customer Support Platform
OpenAI Models Power Decagon's High-Performance Customer Support Platform

"Decagon combines GPT-3.5 for query rewriting in RAG workflows with GPT-4 for complex decisions, achieving 91% automation in customer support"

by AI-360
AI Transcription: Why Human Oversight Remains Essential in the Age of Language Models
AI Transcription: Why Human Oversight Remains Essential in the Age of Language Models

"While AI tools save time, human oversight remains essential. As AI-360 notes: 'HUMAN IN THE LOOP is very much ESSENTIAL in largely, every step.'"

by Stewart Tinson
AI Medical Transcription Tool Invents Fake Patient Information, AP Investigation Finds
AI Medical Transcription Tool Invents Fake Patient Information, AP Investigation Finds

"Studies found fabrications in 80% of public meetings, 50% of 100+ hours of audio, and nearly all 26,000 transcripts analyzed by developers"

by AI-360
OpenAI Announces Major Speed Breakthrough in AI Image Generation Technology
OpenAI Announces Major Speed Breakthrough in AI Image Generation Technology

sCM system reduces image generation to 0.11 seconds using two processing steps vs hundreds, and scales to 1.5B parameters while matching quality of slower methods.

by AI-360
The IP Paradox: OpenAI's Defensive Patent Pledge Raises Eyebrows Amid Copyright Battles
The IP Paradox: OpenAI's Defensive Patent Pledge Raises Eyebrows Amid Copyright Battles

"OpenAI's defensive patent pledge faces skepticism as they block competitor access to training data while defending against copyright infringement suits."

by Stewart Tinson
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.