15 posts

Anthropic

Latest posts
Anthropic Launches Initiative to Fund Third-Party AI Model Evaluations
Anthropic Launches Initiative to Fund Third-Party AI Model Evaluations

Anthropic launched a programme to fund third-party AI evaluations, focusing on safety and advanced capabilities. The initiative covers areas like cybersecurity, multilingual skills, and societal impacts, aiming to improve AI safety across the industry.

by AI-360
Claude 3.5 Sonnet: Advancing AI Intelligence and Accessibility
Claude 3.5 Sonnet: Advancing AI Intelligence and Accessibility

Claude 3.5 Sonnet: Faster, smarter AI with enhanced vision. New Artifacts feature enables collaboration. Anthropic prioritizes safety and privacy.

by Stewart Tinson
Anthropic Reveals Insights on AI Red Teaming Challenges and Methods
Anthropic Reveals Insights on AI Red Teaming Challenges and Methods

Anthropic share their AI red teaming practices, covering various methods and their pros and cons. They propose steps for industry-wide standardisation, including funding for technical standards and supporting independent red teaming bodies.

by AI-360
Safeguarding Election Integrity in the Age of AI
Safeguarding Election Integrity in the Age of AI

Anthropic safeguards elections with AI testing: Policy Vulnerability Testing and automated evaluations to address risks in AI models.

by AI-360
A  Look Inside a Large Language Model
A Look Inside a Large Language Model

Anthropic maps Claude Sonnet's inner workings, revealing features linked to concepts. The breakthrough could enhance AI safety and reliability.

by Stewart Tinson
The Inner Workings of Claude
The Inner Workings of Claude

Anthropic maps Claude's inner workings, identifying "features" in its neural network. Researchers can tune concept activation, impacting behaviour.

by AI-360
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.