Laughing Hyena
  • Home
  • Hyena Games
  • Esports
  • NFT Gaming
  • Crypto Trends
  • Game Reviews
  • Game Updates
  • GameFi Guides
  • Shop
Tag:

evaluations

DAAPrivacyRightIcon
Product Reviews

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems

by admin August 27, 2025


Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each other’s publicly available systems and shared the results of their analyses. The full reports get pretty technical, but are worth a read for anyone who’s following the nuts and bolts of AI development. A broad summary showed some flaws with each company’s offerings, as well as revealing pointers for how to improve future safety tests.

Anthropic said it evaluated OpenAI models for “sycophancy, whistleblowing, self-preservation, and supporting human misuse, as well as capabilities related to undermining AI safety evaluations and oversight.” Its review found that o3 and o4-mini models from OpenAI fell in line with results for its own models, but raised concerns about possible misuse with the ​​GPT-4o and GPT-4.1 general-purpose models. The company also said sycophancy was an issue to some degree with all tested models except for o3.

Anthropic’s tests did not include OpenAI’s most recent release. GPT-5 has a feature called Safe Completions, which is meant to protect users and the public against potentially dangerous queries. OpenAI recently faced its first wrongful death lawsuit after a tragic case where a teenager discussed attempts and plans for suicide with ChatGPT for months before taking his own life.

On the flip side, OpenAI ran tests on Anthropic models for instruction hierarchy, jailbreaking, hallucinations and scheming. The Claude models generally performed well in instruction hierarchy tests, and had a high refusal rate in hallucination tests, meaning they were less likely to offer answers in cases where uncertainty meant their responses could be wrong.

The move for these companies to conduct a joint assessment is intriguing, particularly since OpenAI allegedly violated Anthropic’s terms of service by having programmers use Claude in the process of building new GPT models, which led to Anthropic barring OpenAI’s access to its tools earlier this month. But safety with AI tools has become a bigger issue as more critics and legal experts seek guidelines to protect users, particularly minors.



Source link

August 27, 2025 0 comments
0 FacebookTwitterPinterestEmail

Categories

  • Crypto Trends (239)
  • Esports (173)
  • Game Reviews (145)
  • Game Updates (198)
  • GameFi Guides (232)
  • Gaming Gear (212)
  • NFT Gaming (239)
  • Product Reviews (209)

Recent Posts

  • Buzzy Ethereum Game Football.fun Has Soccer Fans Scoring Crypto Gains
  • US Trading App Webull Launches Crypto Service in Australia to Challenge Incumbents
  • Nearly Every Whale Shark at This Tourist Destination Bears Human-Made Scars
  • Ohtani takes big leap, earns first win of season for Dodgers
  • Crypto Trader Boosts MEXC Bounty to $2.5M Over KYC Demand

Recent Posts

  • Buzzy Ethereum Game Football.fun Has Soccer Fans Scoring Crypto Gains

    August 28, 2025
  • US Trading App Webull Launches Crypto Service in Australia to Challenge Incumbents

    August 28, 2025
  • Nearly Every Whale Shark at This Tourist Destination Bears Human-Made Scars

    August 28, 2025
  • Ohtani takes big leap, earns first win of season for Dodgers

    August 28, 2025
  • Crypto Trader Boosts MEXC Bounty to $2.5M Over KYC Demand

    August 28, 2025

Newsletter

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

About me

Welcome to Laughinghyena.io, your ultimate destination for the latest in blockchain gaming and gaming products. We’re passionate about the future of gaming, where decentralized technology empowers players to own, trade, and thrive in virtual worlds.

Recent Posts

  • Buzzy Ethereum Game Football.fun Has Soccer Fans Scoring Crypto Gains

    August 28, 2025
  • US Trading App Webull Launches Crypto Service in Australia to Challenge Incumbents

    August 28, 2025

Newsletter

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

@2025 laughinghyena- All Right Reserved. Designed and Developed by Pro


Back To Top
Laughing Hyena
  • Home
  • Hyena Games
  • Esports
  • NFT Gaming
  • Crypto Trends
  • Game Reviews
  • Game Updates
  • GameFi Guides
  • Shop

Shopping Cart

Close

No products in the cart.

Close