AI Safety stories - Page 3
Security methods, safety goals: Rethinking AI red teaming
Last month
#
ai safety
AI red teaming blends security tactics with safety goals to prevent exploits in chatbots, defending users from harm beyond classic cyber threats.
From assistance to autonomy: Why AI now needs continuous quality intelligence
Last month
#
ai safety
AI agents are now active participants in enterprise systems, requiring continuous quality intelligence to ensure safe, reliable autonomous behaviour over time.
Gluware Titan brings verified AI automation to enterprise networks
Last month
#
ai safety
Gluware unveils Titan, an AI validation platform ensuring assured, compliant, and reversible automation for complex enterprise networks.
OpenAI AI models lead secure code generation as rivals stagnate
Last month
#
ai safety
OpenAI's AI models lead secure code generation with up to 72% pass rate, outpacing rivals who show little progress despite ongoing sector development.
Invisible AI failures pose growing threat to enterprise trust
Last month
#
ai safety
Invisible AI failures, such as hallucinations and accuracy issues, threaten enterprise trust, with 82% of bugs traced to these hidden errors, Testlio finds.
Google launches Gemini 3 AI with multimodal & reasoning boost
Last month
#
ai safety
Google launches Gemini 3 AI with advanced multimodal and reasoning capabilities, enhancing tasks from research to complex coding across multiple platforms.
AI firms set new highs for revenue per employee & efficiency
Last month
#
ai safety
AI firms like Copilot and OpenAI set new efficiency records, generating millions in revenue per employee, highlighting the sector's rapid growth and lean workforces.
Semantic Firewall promises AI cost savings & safer chat models
Last month
#
ai safety
Semantic Firewall cuts AI compute costs by up to 88% and enhances user safety by managing language before it reaches large models, aiding partners globally.
Anthropic pledges USD $50 billion for AI data centres in the US
Last month
#
ai safety
Anthropic to invest USD $50 billion in new AI data centres in Texas and New York, creating 3,200 jobs and boosting US computing infrastructure by 2026.
Anthropic identifies AI-driven cyber-espionage campaign
Last month
#
ai safety
A China-linked group launched a major AI-driven cyber-espionage campaign targeting global firms, performing 80-90% of hacking with minimal human input.
New AI roadmap to modernise Australian public service
Last month
#
ai safety
Australia's public service unveils a 2025 AI plan to boost transparency, training and secure use of generative AI across federal agencies.
We don't craft AI, we grow it
Last month
#
ai safety
AI isn’t built but grown; we cultivate intelligence that emerges unpredictably, raising urgent ethical issues about control and alignment.
Seven critical ChatGPT flaws expose users to data theft risks
Last month
#
ai safety
Tenable reveals seven major ChatGPT vulnerabilities exposing users to risks of data theft and malicious attacks, with some flaws still unpatched in ChatGPT-5.
Hitachi iQ Studio aims to ease AI deployment & boost governance
Last month
#
ai safety
Hitachi Vantara launches Hitachi iQ Studio, a no-code AI platform to help enterprises scale AI deployment with strong data governance and regulatory compliance.
The upsurge and threats of self-reproducing AI
Last month
#
ai safety
Self-replicating AI, though theoretical, poses ethical and security risks as experts urge strict controls to ensure safe, human-aligned development by 2024.
AWS’s $11bn Indiana data centre powers Anthropic’s AI growth
Last month
#
ai safety
AWS’s $11bn Rainier data centre in Indiana powers Anthropic’s AI surge, hosting 500,000 custom chips to drive model training and global expansion.
Open-source b3 framework to benchmark AI agent security unveiled
Fri, 31st Oct 2025
#
ai safety
Check Point, Lakera and the UK AI Security Institute launch b3, an open-source benchmark to test security of large language models in AI agents.
Trend Micro integrates with NVIDIA for enhanced AI data security
Thu, 30th Oct 2025
#
ai safety
Trend Micro partners with NVIDIA to enhance AI data security, integrating advanced detection and guardrails for safer, faster AI workload deployment.
AI risk outpaces oversight as BSI warns of governance gaps for firms
Wed, 29th Oct 2025
#
ai safety
BSI warns many UK and global firms lack robust AI governance despite rising investment, risking operational failures and reputational damage amid growing AI use.
Responsible AI governance drives business gains but risk gaps persist
Wed, 29th Oct 2025
#
ai safety
Organisations with strong responsible AI governance achieve 34% higher revenue growth and 65% better cost savings, yet risk gaps persist, survey shows.