Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Apple Music partners with Ticketmaster to boost concert discovery

The FCC bans the importation of new consumer routers made abroad, citing security risks

Databricks has bought two startups to support its new AI security product

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    OpenAI’s Sora was the creepiest app on your phone — now it’s shutting down

    25 March 2026

    Mirage raises $75M to continue building models for AI video editing app Captions

    24 March 2026

    Bernie Sanders’ AI ‘gotcha’ video fails, but the memes are great

    24 March 2026

    Are AI tokens the new signing bonus or just a cost of doing business?

    23 March 2026

    Want to build a robot snowman?

    23 March 2026
  • Apps

    Spotify is testing new tool to prevent artificial intelligence from attributing real artists

    25 March 2026

    Pinterest is launching a new feature for promoting a Pin

    24 March 2026

    Apple Maps may receive advertisements

    24 March 2026

    Facebook is launching a new monetization program to attract popular creators from TikTok, YouTube

    23 March 2026

    Apps that distract you from the endless cycle of scrolling

    23 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026

    Nominations for the Startup Battlefield 200 are still open

    19 March 2026

    Kalshi’s legal woes pile up as Arizona files first criminal charges for ‘illegal gambling operation’

    17 March 2026
  • Hardware

    Arm releases the first in-house chip in its 35-year history

    24 March 2026

    Ultrahuman boosts US push with Ring Pro as Oura tightens its grip

    24 March 2026

    Amazon is working on a new smartphone with Alexa at its core, the report says

    20 March 2026

    CEO Carl Pei says nothing about smartphone apps disappearing as they’re replaced by artificial intelligence agents

    18 March 2026

    MacBook Neo, AirPods Max 2, iPhone 17e and everything else Apple announced this month

    18 March 2026
  • Media & Entertainment

    Apple Music partners with Ticketmaster to boost concert discovery

    25 March 2026

    Google TV’s new Gemini features keep fans updated on sports teams and more

    24 March 2026

    Tubi joins forces with popular TikTokers to create original streaming content

    19 March 2026

    Patreon CEO calls AI companies’ fair use argument ‘bogus’, says creators should be paid

    18 March 2026

    Meet Vurt, the first mobile streaming platform for indie filmmakers embracing vertical video

    18 March 2026
  • Security

    The FCC bans the importation of new consumer routers made abroad, citing security risks

    25 March 2026

    Crunchyroll confirms data breach after hackers claim unauthorized access

    24 March 2026

    Delve halts demos, Insight Partners sheds investment position amid ‘false compliance’ claims

    24 March 2026

    The FBI says Iranian hackers are using Telegram to steal data in malware attacks

    23 March 2026

    Delve accused of misleading customers with ‘false compliance’

    22 March 2026
  • Startups

    Databricks has bought two startups to support its new AI security product

    25 March 2026

    Insight Partners removes investment post for Delve amid ‘false compliance’ claims.

    24 March 2026

    Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

    24 March 2026

    Cursor admits that his new coding model was built on top of Moonshot AI’s Kimi

    23 March 2026

    Microsoft hires Sequoia-backed AI collaboration platform team Cove

    21 March 2026
  • Transportation

    Flighty’s new update gives you real-time alerts for airport disruptions

    25 March 2026

    Zoox is bringing its robotaxis to Austin and Miami

    24 March 2026

    Zipline raises another $200 million to fuel drone delivery expansion

    24 March 2026

    TechCrunch Mobility: Uber everywhere, at once

    23 March 2026

    The SEC ends its four-year investigation into EV startup Faraday Future

    23 March 2026
  • Venture

    Accel, Prosus select six ‘off-the-map’ startups for inaugural India team

    25 March 2026

    Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way

    24 March 2026

    AI startups are eating up the venture industry, and the returns, so far, are good

    21 March 2026

    Sequen raised $16 million to bring TikTok-style personalization technology to any consumer company

    19 March 2026

    AI ‘boys club’ could widen wealth gap for women, says Rana el Kaliouby

    18 March 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»The British agency releases tools to test the safety of the AI ​​model
AI

The British agency releases tools to test the safety of the AI ​​model

techtost.comBy techtost.com12 May 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
The British Agency Releases Tools To Test The Safety Of
Share
Facebook Twitter LinkedIn Pinterest Email

The UK Security Institute, the UK’s newly formed AI security body, has launched a toolkit designed to “enhance AI security” by making it easier for industry, research organizations and academia to develop AI assessments intelligence.

It’s called Inspect, the toolset — which is available under an open source license, specifically one MIT license — aims to evaluate certain capabilities of artificial intelligence models, including the underlying knowledge and reasoning ability of the models, and generate a score based on the results.

In a press release announcing In the news on Friday, the Security Institute claimed that Inspect marks “the first time that an AI security testing platform, spearheaded by a state-backed body, has been released for wider use.”

A look at the Inspect dashboard.

“Successfully collaborating on AI security testing means we have a common, accessible approach to assessments, and we hope Inspect can be a building block,” said Security Institute president Ian Hogarth. “We hope to see the global AI community use Inspect not only to perform their own model safety tests, but also to help adapt and leverage the open source platform so we can produce high-quality assessments to everyone the sectors”.

As we’ve written before, AI benchmarks are tricky — mostly because the most sophisticated AI models today are black boxes whose infrastructure, training data, and other key details are closely guarded by the companies that build them. So how does Inspect rise to the challenge? By being extensible and extensible to new testing techniques, mainly.

Inspect consists of three main components: datasets, solvers and markers. The datasets provide samples for evaluation tests. Solvers do the work of running the tests. And raters evaluate solvers’ work and aggregate test scores into metrics.

Inspect’s built-in components can be augmented by third-party packages written in Python.

In a post on X, Deborah Raj, a Mozilla researcher and well-known AI practitioner, called Inspect “a testament to the power of public investment in open source AI accountability tools.”

Clément Delangue, CEO of AI startup Hugging Face, championed the idea of ​​integrating Inspect with Hugging Face’s library of models or creating a public leaderboard of the results of the toolset’s evaluations.

The release of Inspect comes after a government agency — the National Institute of Standards and Technology (NIST) — launched NIST GenAI, a program to evaluate various emerging AI technologies, including AI that generates text and images. NIST GenAI plans to release benchmarks, help build content authentication systems, and encourage the development of software to detect false or misleading information generated by artificial intelligence.

In April, the US and UK announced a partnership to jointly develop advanced AI model testing, following commitments announced at the UK AI Security Summit at Bletchley Park in November last year. As part of the collaboration, the US plans to create its own AI safety institute, which will be broadly tasked with assessing risks from artificial intelligence and genetic artificial intelligence.

agency British model releases safety test tools
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleBumble says it’s looking for mergers and acquisitions to drive growth
Next Article Tesla’s profitable Supercharger network is in limbo after Musk axed the entire team
bhanuprakash.cg
techtost.com
  • Website

Related Posts

OpenAI’s Sora was the creepiest app on your phone — now it’s shutting down

25 March 2026

Arm releases the first in-house chip in its 35-year history

24 March 2026

Mirage raises $75M to continue building models for AI video editing app Captions

24 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Apple Music partners with Ticketmaster to boost concert discovery

25 March 2026

The FCC bans the importation of new consumer routers made abroad, citing security risks

25 March 2026

Databricks has bought two startups to support its new AI security product

25 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026

Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

23 March 2026

Amid legal turmoil, Kalshi is temporarily banned in Nevada

20 March 2026
Startups

Databricks has bought two startups to support its new AI security product

Insight Partners removes investment post for Delve amid ‘false compliance’ claims.

Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.