Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Disrupt 2026 Early Bird ticket prices end May 29

Google is pitching an ecosystem of AI agents to consumers who might not buy it

Startup Battlefield 200 applications close before May 27 | TechCrunch

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    The Pope’s encyclical on artificial intelligence is not really about artificial intelligence

    25 May 2026

    Everyone is navigating real-time AI security — even Google

    25 May 2026

    I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

    24 May 2026

    Elon Musk has given up on solar power (on Earth)

    24 May 2026

    Ferrari uses IBM AI to create F1 superfans

    23 May 2026
  • Apps

    Google is pitching an ecosystem of AI agents to consumers who might not buy it

    26 May 2026

    Founded by Tony Robbins and Calm alums, The Path hopes to offer safer treatment with artificial intelligence

    25 May 2026

    Spotify will reserve tickets for an artist’s top fans in an effort to fill the engagement

    25 May 2026

    Audio production app Huxe, founded by former NotebookLM developers, is shutting down

    24 May 2026

    Spotify’s AI bet: more of everything, less of what you want

    24 May 2026
  • Crypto

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025
  • Fintech

    Disrupt 2026 Early Bird ticket prices end May 29

    26 May 2026

    Startup Battlefield 200 applications close before May 27 | TechCrunch

    26 May 2026

    General Catalyst just led a $63 million bet in India’s travel payments market

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    21 May 2026

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026
  • Hardware

    The Dreamie alarm clock made me stop using my phone in bed

    26 May 2026

    6 kitchen gadgets that make adult life easier

    25 May 2026

    Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

    25 May 2026

    We tested Google’s AI glasses and they’re almost there

    23 May 2026

    Finnish phone maker HMD ropes Indian AI chatbot into new smartphone to reach local market

    22 May 2026
  • Media & Entertainment

    Spotify launches an audiobook creation tool powered by ElevenLabs

    22 May 2026

    New York City Mayor Zohran Mamdani Takes To Twitch To Chat With New Yorkers

    21 May 2026

    Clouted wants to take the guesswork out of making short videos go viral

    21 May 2026

    ‘Ask YouTube’ Brings AI Chat Search to Video, Adds Gemini Omni to Shorts

    20 May 2026

    Google’s Gemini Omni turns images, audio and text into video — and that’s just the beginning

    19 May 2026
  • Security

    Scammers abuse an internal Microsoft account to send spam links

    22 May 2026

    Law enforcement shuts down VPN service used by two dozen ransomware gangs

    21 May 2026

    GitHub says hackers stole data from thousands of internal repositories

    21 May 2026

    Customers say Trump Mobile is leaking their personal information

    20 May 2026

    US cyber agency CISA has exposed bundles of passwords and cloud keys to the open web

    19 May 2026
  • Startups

    What ClickUp’s mass layoff tells us about the future of work

    25 May 2026

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

    24 May 2026

    This startup raised $43 million to create a hive mind for ships

    22 May 2026

    Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

    22 May 2026

    This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

    21 May 2026
  • Transportation

    Global EV market becomes K-shaped as US falls behind

    25 May 2026

    Tesla’s Full Self-Driving software is creeping into Europe

    25 May 2026

    TechCrunch Mobility: Robotaxi Reality Check

    24 May 2026

    Wayve’s self-driving technology is heading to US cars made by Stellantis

    24 May 2026

    How Elon Musk will increase his power through the SpaceX IPO

    23 May 2026
  • Venture

    The pitch trick that helped an eSports startup raise $20 million when VCs only wanted AI

    25 May 2026

    Peec, one of Berlin’s up-and-coming startups, more than doubled annual revenue in months to $10 million, sources say

    23 May 2026

    Convective Capital Raises $85M Fund to Build Disaster Resilience

    22 May 2026

    Sam Altman does a ‘mic drop’ pitch to every Y Combinator startup

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    20 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Openai says GPT-5 stacks people in a wide range of jobs
AI

Openai says GPT-5 stacks people in a wide range of jobs

techtost.comBy techtost.com28 September 202504 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Says Gpt 5 Stacks People In A Wide Range Of
Share
Facebook Twitter LinkedIn Pinterest Email

Openai released a new benchmark On Thursday he tries how AI models operate compared to human professionals in a wide range of industries and jobs. Testing, GDPPVal, is an early attempt to understand how closely the Openai systems it is to overcome people in economically valuable work – a key part of the company’s founding mission to develop artificial general intelligence or agi.

Openai says it found that the GPT-5 model and Claude Opus 4.1 of ANTHROPIC are already approaching the quality of work produced by industry experts. ”

This does not mean that OpenAi models are going to start replacing people in their jobs immediately. Despite the forecasts of some CEO who AI will take people’s jobs in just a few years, Openai admits that GDPVal today covers a very limited number of duties that people do in their real jobs. However, it is one of the last ways in which the company measures AI’s progress towards this milestone.

GDPVAL is based on nine industries that contribute more to America’s gross domestic product, including sectors such as healthcare, funding, construction and government. The benchmark tries the performance of an AI model in 44 professions between these industries, ranging from software to nurses to journalists.

For the first version of the OpenAi test, the GDPVAL-V0, Openai asked experienced professionals to compare the reports created by AI with those produced by other professionals and then choose the best. For example, one prompt asked investment bankers to create a landscape of the latest mileage industry and compare them with reports created by AI. Openai then calculates on average the “rhythm of victory” of an AI model over human references to all 44 occupations.

For GPT-5-High, a version of the GPT-5 with additional computing power, the company states that the AI ​​model is classified as better than or is equivalent to industry experts 40.6% of the time.

Openai also examined Anthropic’s Claude Opus 4.1, which was classified as better than or equivalent to industry experts in 49% of duties. Openai says he believes Claude scored so high because of his tendency to make pleasant graphics instead of net performance.

TechCrunch event

Francisco
|
27-29 October 2025

Image credits:Open

It is worth noting that most professionals who work do much more than submitting research reports to their boss, which are all these GDPVAL-V0 tests. Openai recognizes this and says it plans to create more powerful tests in the future that can represent more industries and interactive work flows.

Nevertheless, the company sees the progress on GDPVAL as remarkable.

In an interview with TechCrunch, Openai’s chief economist, Dr. Aaron Chatterji, said the results of GDPVAL indicate that people in these jobs can now use AI models to spend time with more important tasks.

“[Because] The model is good in some of these things, “says Chatterji,” people in these jobs can now use the model, increasingly as the potential improves, to unload part of their job and potentially do higher things. “

Openai’s ratings lead Tejal Patwardhan tells TechCrunch that he is encouraged by the progress rate on GDPVAL. Openai’s GPT-4O model scored just 13.7% (victories and ties against people), released about 15 months ago. Now the GPT-5 scores almost triple this, a trend that Patwardhan expects to continue.

Silicon Valley has a wide range of reference criteria she uses to measure AI models and evaluate if a given model is state-of-the-art. Among the most popular are Aime 2025 (a test of competitive mathematical problems) and GPQA Diamond (a test of scientific questions at a doctoral level). However, several AI models are approaching satiety in some of these reference points, and many AI researchers have reported the need for better tests that can measure AI’s adequacy of real duties.

Reference points such as GDPPVal could become more and more important in this discussion, as OpenAi makes the assumption that AI models are valuable for a wide range of industries. But openai may need a more complete version of the test to permanently say that AI models can overcome people.

Automation ChatGPT Classical GPT-5 GPT5 jobs open OpenAI people range stacks wide
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMeta launches ‘Vibes,’ A AI Slop short -formed video flow
Next Article Cohere hits $ 7b valuation a month after its last increase, it works with AMD
bhanuprakash.cg
techtost.com
  • Website

Related Posts

The Pope’s encyclical on artificial intelligence is not really about artificial intelligence

25 May 2026

Everyone is navigating real-time AI security — even Google

25 May 2026

I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

24 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Disrupt 2026 Early Bird ticket prices end May 29

26 May 2026

Google is pitching an ecosystem of AI agents to consumers who might not buy it

26 May 2026

Startup Battlefield 200 applications close before May 27 | TechCrunch

26 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Disrupt 2026 Early Bird ticket prices end May 29

26 May 2026

Startup Battlefield 200 applications close before May 27 | TechCrunch

26 May 2026

General Catalyst just led a $63 million bet in India’s travel payments market

21 May 2026
Startups

What ClickUp’s mass layoff tells us about the future of work

SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

This startup raised $43 million to create a hive mind for ships

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.