Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Ultrahuman boosts US push with Ring Pro as Oura tightens its grip

Delve halts demos, Insight Partners sheds investment position amid ‘false compliance’ claims

Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Bernie Sanders’ AI ‘gotcha’ video fails, but the memes are great

    24 March 2026

    Are AI tokens the new signing bonus or just a cost of doing business?

    23 March 2026

    Want to build a robot snowman?

    23 March 2026

    Why Wall Street Didn’t Win Nvidia’s Big Conference

    22 March 2026

    New court filing reveals Pentagon told Anthropic the two sides were nearly aligned — a week after Trump declared his relationship

    21 March 2026
  • Apps

    Apple Maps may receive advertisements

    24 March 2026

    Facebook is launching a new monetization program to attract popular creators from TikTok, YouTube

    23 March 2026

    Apps that distract you from the endless cycle of scrolling

    23 March 2026

    The features powered by Gemini in Google Workspace that are worth using

    22 March 2026

    Meta finally decides not to close Horizon Worlds in VR

    22 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026

    Nominations for the Startup Battlefield 200 are still open

    19 March 2026

    Kalshi’s legal woes pile up as Arizona files first criminal charges for ‘illegal gambling operation’

    17 March 2026

    Fuse raises $25M to disrupt legacy loan origination systems used by US credit unions

    16 March 2026
  • Hardware

    Ultrahuman boosts US push with Ring Pro as Oura tightens its grip

    24 March 2026

    Amazon is working on a new smartphone with Alexa at its core, the report says

    20 March 2026

    CEO Carl Pei says nothing about smartphone apps disappearing as they’re replaced by artificial intelligence agents

    18 March 2026

    MacBook Neo, AirPods Max 2, iPhone 17e and everything else Apple announced this month

    18 March 2026

    Oura enters India’s smart ring market with Ring 4

    17 March 2026
  • Media & Entertainment

    Tubi joins forces with popular TikTokers to create original streaming content

    19 March 2026

    Patreon CEO calls AI companies’ fair use argument ‘bogus’, says creators should be paid

    18 March 2026

    Meet Vurt, the first mobile streaming platform for indie filmmakers embracing vertical video

    18 March 2026

    BuzzFeed debuts AI applications for new revenue

    17 March 2026

    Facebook makes it easy for creators to report copycats

    14 March 2026
  • Security

    Delve halts demos, Insight Partners sheds investment position amid ‘false compliance’ claims

    24 March 2026

    The FBI says Iranian hackers are using Telegram to steal data in malware attacks

    23 March 2026

    Delve accused of misleading customers with ‘false compliance’

    22 March 2026

    Delve accused of misleading customers with ‘false compliance’

    21 March 2026

    The US accuses the Iranian government of operating a hacktivist group that hacked the Stryker

    20 March 2026
  • Startups

    Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

    24 March 2026

    Cursor admits that his new coding model was built on top of Moonshot AI’s Kimi

    23 March 2026

    Microsoft hires Sequoia-backed AI collaboration platform team Cove

    21 March 2026

    Consumer-focused privacy firm Cloaked raises $375 million as it expands into the enterprise

    20 March 2026

    Tools for founders to navigate and move past conflicts

    20 March 2026
  • Transportation

    Zipline raises another $200 million to fuel drone delivery expansion

    24 March 2026

    TechCrunch Mobility: Uber everywhere, at once

    23 March 2026

    The SEC ends its four-year investigation into EV startup Faraday Future

    23 March 2026

    Uber taps Rivian to build robotaxis in deal worth up to $1.25 billion

    22 March 2026

    Federal authorities intensify investigation into Tesla’s Full Self-Driving (Supervised) software

    21 March 2026
  • Venture

    Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way

    24 March 2026

    AI startups are eating up the venture industry, and the returns, so far, are good

    21 March 2026

    Sequen raised $16 million to bring TikTok-style personalization technology to any consumer company

    19 March 2026

    AI ‘boys club’ could widen wealth gap for women, says Rana el Kaliouby

    18 March 2026

    Billionaires made a promise – now some want to leave

    17 March 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Openai says GPT-5 stacks people in a wide range of jobs
AI

Openai says GPT-5 stacks people in a wide range of jobs

techtost.comBy techtost.com28 September 202504 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Says Gpt 5 Stacks People In A Wide Range Of
Share
Facebook Twitter LinkedIn Pinterest Email

Openai released a new benchmark On Thursday he tries how AI models operate compared to human professionals in a wide range of industries and jobs. Testing, GDPPVal, is an early attempt to understand how closely the Openai systems it is to overcome people in economically valuable work – a key part of the company’s founding mission to develop artificial general intelligence or agi.

Openai says it found that the GPT-5 model and Claude Opus 4.1 of ANTHROPIC are already approaching the quality of work produced by industry experts. ”

This does not mean that OpenAi models are going to start replacing people in their jobs immediately. Despite the forecasts of some CEO who AI will take people’s jobs in just a few years, Openai admits that GDPVal today covers a very limited number of duties that people do in their real jobs. However, it is one of the last ways in which the company measures AI’s progress towards this milestone.

GDPVAL is based on nine industries that contribute more to America’s gross domestic product, including sectors such as healthcare, funding, construction and government. The benchmark tries the performance of an AI model in 44 professions between these industries, ranging from software to nurses to journalists.

For the first version of the OpenAi test, the GDPVAL-V0, Openai asked experienced professionals to compare the reports created by AI with those produced by other professionals and then choose the best. For example, one prompt asked investment bankers to create a landscape of the latest mileage industry and compare them with reports created by AI. Openai then calculates on average the “rhythm of victory” of an AI model over human references to all 44 occupations.

For GPT-5-High, a version of the GPT-5 with additional computing power, the company states that the AI ​​model is classified as better than or is equivalent to industry experts 40.6% of the time.

Openai also examined Anthropic’s Claude Opus 4.1, which was classified as better than or equivalent to industry experts in 49% of duties. Openai says he believes Claude scored so high because of his tendency to make pleasant graphics instead of net performance.

TechCrunch event

Francisco
|
27-29 October 2025

Image credits:Open

It is worth noting that most professionals who work do much more than submitting research reports to their boss, which are all these GDPVAL-V0 tests. Openai recognizes this and says it plans to create more powerful tests in the future that can represent more industries and interactive work flows.

Nevertheless, the company sees the progress on GDPVAL as remarkable.

In an interview with TechCrunch, Openai’s chief economist, Dr. Aaron Chatterji, said the results of GDPVAL indicate that people in these jobs can now use AI models to spend time with more important tasks.

“[Because] The model is good in some of these things, “says Chatterji,” people in these jobs can now use the model, increasingly as the potential improves, to unload part of their job and potentially do higher things. “

Openai’s ratings lead Tejal Patwardhan tells TechCrunch that he is encouraged by the progress rate on GDPVAL. Openai’s GPT-4O model scored just 13.7% (victories and ties against people), released about 15 months ago. Now the GPT-5 scores almost triple this, a trend that Patwardhan expects to continue.

Silicon Valley has a wide range of reference criteria she uses to measure AI models and evaluate if a given model is state-of-the-art. Among the most popular are Aime 2025 (a test of competitive mathematical problems) and GPQA Diamond (a test of scientific questions at a doctoral level). However, several AI models are approaching satiety in some of these reference points, and many AI researchers have reported the need for better tests that can measure AI’s adequacy of real duties.

Reference points such as GDPPVal could become more and more important in this discussion, as OpenAi makes the assumption that AI models are valuable for a wide range of industries. But openai may need a more complete version of the test to permanently say that AI models can overcome people.

Automation ChatGPT Classical GPT-5 GPT5 jobs open OpenAI people range stacks wide
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMeta launches ‘Vibes,’ A AI Slop short -formed video flow
Next Article Cohere hits $ 7b valuation a month after its last increase, it works with AMD
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Bernie Sanders’ AI ‘gotcha’ video fails, but the memes are great

24 March 2026

Are AI tokens the new signing bonus or just a cost of doing business?

23 March 2026

Want to build a robot snowman?

23 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Ultrahuman boosts US push with Ring Pro as Oura tightens its grip

24 March 2026

Delve halts demos, Insight Partners sheds investment position amid ‘false compliance’ claims

24 March 2026

Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

24 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

23 March 2026

Amid legal turmoil, Kalshi is temporarily banned in Nevada

20 March 2026

Nominations for the Startup Battlefield 200 are still open

19 March 2026
Startups

Bengaluru food delivery startup Swish raises $38 million, its third round in 18 months

Cursor admits that his new coding model was built on top of Moonshot AI’s Kimi

Microsoft hires Sequoia-backed AI collaboration platform team Cove

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.