Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Netflix expands revamped mobile app across Asia and doubles down on games for kids

North Koreans behind nearly half of US tech industry hacks, CrowdStrike says

Datadog veterans launch AI coding startup Niteshift in a bet against Big AI lock-in

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    How memory tools can make AI models worse

    10 June 2026

    Google just fired a warning shot in the AI ​​subscription price wars

    10 June 2026

    Sandstone raises $30M to bring AI to in-house legal teams

    9 June 2026

    Because Apple’s slow and steady AI bet is starting to look pretty smart

    9 June 2026

    Amazon now lets you design custom merchandise using AI

    8 June 2026
  • Apps

    Zest Launches Restaurant Discovery App Powered by Where People Really Eat

    10 June 2026

    iOS 27 features we didn’t see on stage

    10 June 2026

    Apple says it can remove some apps from the App Store if they don’t attract users

    9 June 2026

    Apple’s WWDC AI demos seemed more real after $250 million false ad settlement

    9 June 2026

    The new update of NotebookLM will help you to create source repository from chat

    8 June 2026
  • Crypto

    Startup Battlefield 200 applications close today

    27 May 2026

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026
  • Fintech

    Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

    5 June 2026

    Last 24 hours to save up to $410 on your Disrupt 2026 ticket

    29 May 2026

    2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

    28 May 2026

    Robinhood now allows your AI agents to trade stocks

    28 May 2026

    Disrupt 2026 Early Bird ticket savings expire in 3 days

    27 May 2026
  • Hardware

    WWDC 2026: What to expect, from Siri’s long-awaited revamp to Apple Intelligence and iOS 27

    9 June 2026

    What to expect from WWDC 2026: The long-awaited Siri refresh and Apple Intelligence updates

    7 June 2026

    What to expect from WWDC 2026: The long-awaited Siri refresh and Apple Intelligence updates

    5 June 2026

    Oura Ring 5 review: Thinner, lighter, better

    4 June 2026

    Meta mercifully released the VR fitness game Supernatural instead of just killing it

    4 June 2026
  • Media & Entertainment

    Netflix expands revamped mobile app across Asia and doubles down on games for kids

    10 June 2026

    Plex adds new social features ahead of major price hike for its lifetime pass

    6 June 2026

    Startup Battlefield 200 applications officially close in 3 days

    5 June 2026

    Founders Fund Launches Series of Games Starring Sam Altman, Palmer Luckey and Other Tech Elites

    5 June 2026

    Meet Wander, a StumbleUpon-inspired tool for discovering the ‘small web’

    4 June 2026
  • Security

    North Koreans behind nearly half of US tech industry hacks, CrowdStrike says

    10 June 2026

    Massachusetts votes in favor of new privacy bill that bans sale of precise location data

    9 June 2026

    WhatsApp says it has detected new spyware attacks linked to the NSO group in violation of a court order

    9 June 2026

    Microsoft’s open source tools hacked to steal AI developers’ passwords

    8 June 2026

    Hacked, leaked and held for ransom: the worst breaches of 2026 so far

    7 June 2026
  • Startups

    Datadog veterans launch AI coding startup Niteshift in a bet against Big AI lock-in

    10 June 2026

    Evotrex raises $30 million to build RV that doesn’t need a charging station

    10 June 2026

    Zepto’s IPO filing reveals fast growth, bigger losses and a valuation question no one has yet answered

    9 June 2026

    How to apply to Startup Battlefield 2026, what you need before today’s June 8 deadline

    8 June 2026

    Sam Altman-backed fusion startup Helion raises $465M to build power plant for Microsoft

    6 June 2026
  • Transportation

    Because everyone is an energy company now

    10 June 2026

    Top Lucid Motors executive exits amid new CEO shakeup

    10 June 2026

    Rivian begins deliveries of its all-important R2 SUV

    9 June 2026

    Waymo bought Apple’s self-driving car for $220 million

    9 June 2026

    Uber, Wayve and Waymo are heading for a robot showdown in London

    8 June 2026
  • Venture

    Why business AI will be the focus of VivaTech 2026

    10 June 2026

    How Justin Ernest invested nearly $500 million in hot startups without a traditional VC fund

    10 June 2026

    Mercor’s Brendan Foody calls out Sequoia, accusing it of “double pricing” valuation tricks.

    9 June 2026

    Founders share VC horror stories and some name names

    6 June 2026

    Defense technology, artificial intelligence and fundraising take center stage at StrictlyVC Los Angeles

    5 June 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»OpenAI believes superhuman artificial intelligence is coming — and wants to build tools to control it
AI

OpenAI believes superhuman artificial intelligence is coming — and wants to build tools to control it

techtost.comBy techtost.com15 December 202308 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Believes Superhuman Artificial Intelligence Is Coming And Wants
Share
Facebook Twitter LinkedIn Pinterest Email

While investors were getting ready to go nuclear after Sam Altman’s unceremonious departure from OpenAI and Altman planning his return to the company, members of OpenAI’s Superalignment team were busily dealing with the problem of how to control the artificial intelligence that is smarter than humans.

Or at least, that’s the impression they’d like to give.

This week, I got on the phone with three of the Superalignment team members — Collin Burns, Pavel Izmailov and Leopold Aschenbrenner — who were in New Orleans at NeurIPS, the annual machine learning conference, to present OpenAI’s newest project to ensure that AI systems behave as intended.

OpenAI formed the Superalignment team in July to develop ways to guide, regulate and govern “superintelligent” AI systems — that is, theoretical systems with intelligence that far exceeds that of humans.

“Today, we can basically align models that are dumber than us or maybe human-level maximumBurns said. “Aligning a model that is actually smarter than us is much, much less obvious — how can we do that?”

The Superalignment effort is led by OpenAI co-founder and chief scientist Ilya Sutskever, who didn’t disappoint in July — but certainly now, in light of the fact that Sutskever was among those who initially pushed for Altman’s firing. While some reference suggests that Sutskever is in a “vacuum state” after Altman’s return, OpenAI’s PR tells me that Sutskever is indeed – as of today, at least – still head of the Superalignment team.

Hyper-alignment is a bit of a touchy subject in the AI ​​research community. Some argue that the subfield is premature. others imply it is a red herring.

While Altman has invited comparisons between OpenAI and the Manhattan Project, going so far as to assemble a team to explore artificial intelligence models to protect against “catastrophic risks,” including chemical and nuclear threats, some experts say there is little evidence that indicate the startup’s technology will acquire vast, world-beating capabilities anytime soon — or ever. Claims of impending superintelligence, these experts add, only serve to deliberately distract and divert attention from the pressing AI regulatory issues of the day, such as algorithmic bias and AI’s propensity for toxicity.

For what it’s worth, Sutskever seems to think fervently that AI — not OpenAI per se, but some incarnation of it — could one day become an existential threat. According to information, he reached the spot supply and combustion a wooden dummy in an off-site company to demonstrate its commitment to preventing artificial intelligence from harming humanity, and orders a significant amount of OpenAI computing—20% of its existing computer chips—for the Superalignment team’s research.

“The progress of artificial intelligence recently has been extremely fast, and I can assure you that it is not slowing down,” Aschenbrenner said. “I think we’ll get to human-level systems very soon, but it won’t stop there — we’ll go straight to superhuman systems… So how do we align superhuman AI systems and make them safe? It really is a problem for all of humanity — perhaps the most important unsolved technical problem of our time.”

The Superalignment team is currently trying to build governance and control frameworks that could they apply well to future powerful AI systems. It’s no simple task, as the definition of “superintelligence” — and whether a particular AI system has achieved it — is hotly debated. But the approach the team has come up with for now involves using a weaker, less sophisticated AI model (eg GPT-2) to guide a more advanced, sophisticated model (GPT-4) in desired directions — and away from spam.

An image illustrating the AI-based Superalignment team’s analogy for the alignment of superintelligent systems. Image Credits: OpenAI

“A lot of what we’re trying to do is tell a model what to do and make sure it does it,” Burns said. “How can we get a model to follow instructions and get a model to only help with things that are real and not things that are made up? How can we get a model to tell us if the code it generated is safe or obscene? These are the types of tasks we want to be able to achieve with our research.”

But wait, you might say – what does AI that guides AI have to do with preventing AI that threatens humanity? Well, it’s an analogy: The weak model is meant to be a stand-in for human supervisors, while the strong model represents the super-intelligent AI. Similar to humans who may not be able to understand a super-intelligent AI system, the weak model cannot “get” all the complexities and nuances of the strong model – making the setup useful for proving hyper-alignment hypotheses, the team says Superalignment.

“You can think of a sixth grader trying to supervise a college student,” Izmailov explained. “Suppose the sixth grader tries to tell the student about a task that he somehow knows how to solve… Although the sixth grader’s supervision may have errors in the details, there is hope that the college student would understand the substance and would be able to do the work better than the supervisor.”

In the Superalignment group setting, a weak model fine-tuned to a particular task creates tags that are used to “communicate” the broad strokes of that task to the strong model. Given these labels, the strong model can more or less correctly generalize according to the weak model’s intent — even if the weak model’s labels contain errors and biases, the team found.

The weak-strong model approach may even lead to breakthroughs in the field of hallucinations, the team claims.

“Illusions are actually very interesting, because internally, the model actually knows whether what it’s saying is fact or fiction,” Aschenbrenner said. “But the way these models are trained today, the human supervisors reward them with ‘up’, ‘up down’ for saying things. So sometimes, inadvertently, people reward the model for saying things that are either false or that the model doesn’t actually know about, and so on. If “If we’re successful in our research, we should develop techniques where we can basically call the knowledge of the model, and we could apply that call to whether something is fact or fiction and use that to reduce hallucinations.”

But the analogy is not perfect. So OpenAI wants to gather ideas.

To that end, OpenAI is launching a $10 million grant program to support technical research on superintelligent alignment, portions of which will go to academic labs, nonprofits, individual researchers, and graduate students. OpenAI also plans to host an academic hyperalignment conference in early 2025, where it will share and promote the work of the hyperalignment prize finalists.

Interestingly, part of the funding for the grant will come from former Google CEO and chairman Eric Schmidt. Schmidt — a staunch supporter of Altman — is fast becoming a poster child for AI doom, arguing that the arrival of dangerous AI systems is imminent and that regulators aren’t doing enough to prepare. It’s not necessarily out of a sense of altruism — the petition Protocol and Wired Note that Schmidt, an active AI investor, stands to benefit enormously commercially if the US government implements his proposed plan to boost AI research.

Donating can be seen as virtue signaling through a cynical lens. Schmidt’s personal fortune is estimated at $24 billion, and he has poured hundreds of millions into other, arguably less focused on ethics AI ventures and capital — including his own.

Schmidt denies this is happening, of course.

“Artificial intelligence and other emerging technologies are reshaping our economy and society,” he said in an emailed statement. “Ensuring that they are aligned with human values ​​is critical and I am proud to support OpenAI’s new [grants] to develop and control artificial intelligence responsibly for public benefit”.

Indeed, the involvement of a figure with such transparent commercial motives begs the question: Will OpenAI’s superalignment research and the research it encourages the community to submit to its future conference be made available to anyone to use as they see fit?

The Superalignment team assured me that, yes, both OpenAI research — including code — and the work of others who receive OpenAI grants and awards for superalignment-related work will be publicly shared. We will keep the company on it.

“Contributing not only to the safety of our models, but also to the safety of other labs’ models and advanced artificial intelligence in general is part of our mission,” Aschenbrenner said. “It’s really the core of our mission to build [AI] for the benefit of all mankind, safely. And we believe that doing this research is absolutely necessary to make it beneficial and safe.”

All included artificial believes build coming control intelligence OpenAI Research superhuman tools
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThree years after its refresh, the Firefox Android browser adds 450+ new extensions
Next Article What’s up with all these new venture funds?
bhanuprakash.cg
techtost.com
  • Website

Related Posts

How memory tools can make AI models worse

10 June 2026

Evotrex raises $30 million to build RV that doesn’t need a charging station

10 June 2026

Google just fired a warning shot in the AI ​​subscription price wars

10 June 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Netflix expands revamped mobile app across Asia and doubles down on games for kids

10 June 2026

North Koreans behind nearly half of US tech industry hacks, CrowdStrike says

10 June 2026

Datadog veterans launch AI coding startup Niteshift in a bet against Big AI lock-in

10 June 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

5 June 2026

Last 24 hours to save up to $410 on your Disrupt 2026 ticket

29 May 2026

2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

28 May 2026
Startups

Datadog veterans launch AI coding startup Niteshift in a bet against Big AI lock-in

Evotrex raises $30 million to build RV that doesn’t need a charging station

Zepto’s IPO filing reveals fast growth, bigger losses and a valuation question no one has yet answered

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.