Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Apple and Netflix team up to stream Formula 1 Canadian Grand Prix

CISA replaces deputy director after a difficult year on the job

Superhuman bets on redesigned smart ring to win back US market after Oura controversy

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Anthropic CEO stands firm as Pentagon deadline looms

    27 February 2026

    Jack Dorsey just halved the size of Block’s employee base — and he says your company is next

    27 February 2026

    Salesforce CEO Marc Benioff: This isn’t our first SaaSpocalypse

    26 February 2026

    Gushwork is betting on AI prospecting for leads — and the first results are showing

    26 February 2026

    India’s AI boom prompts companies to trade short-term revenue for users

    25 February 2026
  • Apps

    Bumble adds AI photo feedback and profile guidance tools

    27 February 2026

    Threads is testing a shortcut to quickly start DM conversations

    27 February 2026

    Instagram now alerts parents if their teen is looking for suicide or self-harm content

    26 February 2026

    Snapchat announces ‘The Snappys’, its first creator awards show

    26 February 2026

    Discord delays global rollout of age verification after backlash

    25 February 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026

    Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

    24 February 2026

    InScope raises $14.5M to solve financial reporting pain

    20 February 2026

    OpenAI deepens India push with Pine Labs fintech partnership

    19 February 2026
  • Hardware

    Everything announced at Samsung’s Galaxy Unpacked event, including S26 smartphones, privacy screen and more

    26 February 2026

    Samsung introduces new display technology that adds a privacy screen to apps and notifications

    25 February 2026

    Oura launches a proprietary AI model focused on women’s health

    25 February 2026

    Spotify and Liquid Death are releasing a limited-edition speaker shaped like a … container?

    24 February 2026

    5 days left to lock in the lowest Disrupt 2026 rates

    23 February 2026
  • Media & Entertainment

    Apple and Netflix team up to stream Formula 1 Canadian Grand Prix

    27 February 2026

    Netflix pulls out of bid for Warner Bros. Discovery, giving studios, HBO and CNN to Ellison-owned Paramount

    27 February 2026

    Book the best deals for Disrupt 2026 | TechCrunch

    26 February 2026

    Americans now listen to podcasts more often than talk radio, study shows

    25 February 2026

    Music producer ProducerAI joins Google Labs

    25 February 2026
  • Security

    CISA replaces deputy director after a difficult year on the job

    27 February 2026

    Cisco Says Hackers Are Exploiting Critical Flaw To Break Into Large Customer Networks By 2023

    26 February 2026

    US cybersecurity agency CISA reportedly in dire straits amid Trump cuts and layoffs

    26 February 2026

    Treasury sanctions Russian zero-day broker accused of buying holdings stolen from US defense contractor

    25 February 2026

    Former L3Harris Trenchant boss jailed for selling hacking tools to Russian broker

    25 February 2026
  • Startups

    Superhuman bets on redesigned smart ring to win back US market after Oura controversy

    27 February 2026

    Trace raises $3 million to solve AI agent adoption in the enterprise

    27 February 2026

    How to avoid bad hires in early stage startups

    26 February 2026

    Apply to take the stage at Founder Summit 2026

    26 February 2026

    Ukrainian startups continue to build | TechCrunch

    25 February 2026
  • Transportation

    Self-driving truck startup Einride raises $113M PIPE ahead of public debut

    27 February 2026

    It’s time to pull the plug on plug-in hybrids

    26 February 2026

    Harbinger acquires self-driving company Phantom AI

    26 February 2026

    Waymo robotaxis are now operating in 10 US cities

    25 February 2026

    Self-driving tech startup Wayve raises $1.2 billion from Nvidia, Uber and three automakers

    25 February 2026
  • Venture

    Dive into Boston’s startup ecosystem at Founder Summit 2026 | TechCrunch

    27 February 2026

    A VC and some big-name developers are trying to solve the open source funding problem, permanently

    27 February 2026

    Y Combinator grad and AI insurance brokerage Harper raises $47 million

    26 February 2026

    Anthropic acquires AI startup Vercept after Meta indicts one of its founders

    26 February 2026

    Last 4 days to save up to $680 on your Disrupt 2026 Pass

    25 February 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Openai says GPT-5 stacks people in a wide range of jobs
AI

Openai says GPT-5 stacks people in a wide range of jobs

techtost.comBy techtost.com28 September 202504 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Says Gpt 5 Stacks People In A Wide Range Of
Share
Facebook Twitter LinkedIn Pinterest Email

Openai released a new benchmark On Thursday he tries how AI models operate compared to human professionals in a wide range of industries and jobs. Testing, GDPPVal, is an early attempt to understand how closely the Openai systems it is to overcome people in economically valuable work – a key part of the company’s founding mission to develop artificial general intelligence or agi.

Openai says it found that the GPT-5 model and Claude Opus 4.1 of ANTHROPIC are already approaching the quality of work produced by industry experts. ”

This does not mean that OpenAi models are going to start replacing people in their jobs immediately. Despite the forecasts of some CEO who AI will take people’s jobs in just a few years, Openai admits that GDPVal today covers a very limited number of duties that people do in their real jobs. However, it is one of the last ways in which the company measures AI’s progress towards this milestone.

GDPVAL is based on nine industries that contribute more to America’s gross domestic product, including sectors such as healthcare, funding, construction and government. The benchmark tries the performance of an AI model in 44 professions between these industries, ranging from software to nurses to journalists.

For the first version of the OpenAi test, the GDPVAL-V0, Openai asked experienced professionals to compare the reports created by AI with those produced by other professionals and then choose the best. For example, one prompt asked investment bankers to create a landscape of the latest mileage industry and compare them with reports created by AI. Openai then calculates on average the “rhythm of victory” of an AI model over human references to all 44 occupations.

For GPT-5-High, a version of the GPT-5 with additional computing power, the company states that the AI ​​model is classified as better than or is equivalent to industry experts 40.6% of the time.

Openai also examined Anthropic’s Claude Opus 4.1, which was classified as better than or equivalent to industry experts in 49% of duties. Openai says he believes Claude scored so high because of his tendency to make pleasant graphics instead of net performance.

TechCrunch event

Francisco
|
27-29 October 2025

Image credits:Open

It is worth noting that most professionals who work do much more than submitting research reports to their boss, which are all these GDPVAL-V0 tests. Openai recognizes this and says it plans to create more powerful tests in the future that can represent more industries and interactive work flows.

Nevertheless, the company sees the progress on GDPVAL as remarkable.

In an interview with TechCrunch, Openai’s chief economist, Dr. Aaron Chatterji, said the results of GDPVAL indicate that people in these jobs can now use AI models to spend time with more important tasks.

“[Because] The model is good in some of these things, “says Chatterji,” people in these jobs can now use the model, increasingly as the potential improves, to unload part of their job and potentially do higher things. “

Openai’s ratings lead Tejal Patwardhan tells TechCrunch that he is encouraged by the progress rate on GDPVAL. Openai’s GPT-4O model scored just 13.7% (victories and ties against people), released about 15 months ago. Now the GPT-5 scores almost triple this, a trend that Patwardhan expects to continue.

Silicon Valley has a wide range of reference criteria she uses to measure AI models and evaluate if a given model is state-of-the-art. Among the most popular are Aime 2025 (a test of competitive mathematical problems) and GPQA Diamond (a test of scientific questions at a doctoral level). However, several AI models are approaching satiety in some of these reference points, and many AI researchers have reported the need for better tests that can measure AI’s adequacy of real duties.

Reference points such as GDPPVal could become more and more important in this discussion, as OpenAi makes the assumption that AI models are valuable for a wide range of industries. But openai may need a more complete version of the test to permanently say that AI models can overcome people.

Automation ChatGPT Classical GPT-5 GPT5 jobs open OpenAI people range stacks wide
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMeta launches ‘Vibes,’ A AI Slop short -formed video flow
Next Article Cohere hits $ 7b valuation a month after its last increase, it works with AMD
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Anthropic CEO stands firm as Pentagon deadline looms

27 February 2026

A VC and some big-name developers are trying to solve the open source funding problem, permanently

27 February 2026

Jack Dorsey just halved the size of Block’s employee base — and he says your company is next

27 February 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Apple and Netflix team up to stream Formula 1 Canadian Grand Prix

27 February 2026

CISA replaces deputy director after a difficult year on the job

27 February 2026

Superhuman bets on redesigned smart ring to win back US market after Oura controversy

27 February 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

3 days left: Save up to $680 on your ticket to Disrupt 2026

25 February 2026

More startups surpass $10M ARR in 3 months than ever before

24 February 2026

Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

24 February 2026
Startups

Superhuman bets on redesigned smart ring to win back US market after Oura controversy

Trace raises $3 million to solve AI agent adoption in the enterprise

How to avoid bad hires in early stage startups

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.