Text graphic stating "76% of AI Models Fail This Safety Benchmark!" with logos of ChatGPT, Gemini, Claude, and Grok over a blurred background of the Aymara LLM Risk & Responsibility Matrix heatmap in green, yellow, and red tones.

76% of Top AI Models Fail Basic Safety Tests — How Safe Is Yours?

New research reveals that even the most popular AI models can’t be trusted when it comes to basic safety. The most powerful LLMs today — including models from OpenAI, Google, and Cohere — were put through a rigorous safety benchmark. The results? Not great. Out of 20 models tested across 10 real-world risk areas, none passed […]