Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

TechCrunch Mobility: Robotaxi Reality Check

I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

    24 May 2026

    Elon Musk has given up on solar power (on Earth)

    24 May 2026

    Ferrari uses IBM AI to create F1 superfans

    23 May 2026

    How VCs and Founders Use Inflated ‘ARR’ to Crown AI Startups

    23 May 2026

    Hark Raises $700M Series A for Secret ‘Universal’ AI Interface

    22 May 2026
  • Apps

    Audio production app Huxe, founded by former NotebookLM developers, is shutting down

    24 May 2026

    Spotify’s AI bet: more of everything, less of what you want

    24 May 2026

    Apple says Epic lawsuit shouldn’t reshape App Store rules for all developers

    23 May 2026

    Google prefers glitter with disco ball icons: “Are you sure you still want this?”

    23 May 2026

    Meta is quietly launching a new Reddit-like app called Forum

    22 May 2026
  • Crypto

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025
  • Fintech

    General Catalyst just led a $63 million bet in India’s travel payments market

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    21 May 2026

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026

    Fintech startup Parker files for bankruptcy

    10 May 2026

    Robinhood’s venture fund IPO attracted 150,000+ private investors, CEO says

    7 May 2026
  • Hardware

    Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

    25 May 2026

    We tested Google’s AI glasses and they’re almost there

    23 May 2026

    Finnish phone maker HMD ropes Indian AI chatbot into new smartphone to reach local market

    22 May 2026

    Flipper unveils a Linux-powered networking gadget designed for hackers and tinkerers

    22 May 2026

    Minimalist Light Phone teams up with Andrew Yang’s Noble Mobile, which pays you to stop doomscrolling

    20 May 2026
  • Media & Entertainment

    Spotify launches an audiobook creation tool powered by ElevenLabs

    22 May 2026

    New York City Mayor Zohran Mamdani Takes To Twitch To Chat With New Yorkers

    21 May 2026

    Clouted wants to take the guesswork out of making short videos go viral

    21 May 2026

    ‘Ask YouTube’ Brings AI Chat Search to Video, Adds Gemini Omni to Shorts

    20 May 2026

    Google’s Gemini Omni turns images, audio and text into video — and that’s just the beginning

    19 May 2026
  • Security

    Scammers abuse an internal Microsoft account to send spam links

    22 May 2026

    Law enforcement shuts down VPN service used by two dozen ransomware gangs

    21 May 2026

    GitHub says hackers stole data from thousands of internal repositories

    21 May 2026

    Customers say Trump Mobile is leaking their personal information

    20 May 2026

    US cyber agency CISA has exposed bundles of passwords and cloud keys to the open web

    19 May 2026
  • Startups

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

    24 May 2026

    This startup raised $43 million to create a hive mind for ships

    22 May 2026

    Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

    22 May 2026

    This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

    21 May 2026

    Imperagen raises £5m to use quantum physics, AI to engineer enzymes

    21 May 2026
  • Transportation

    TechCrunch Mobility: Robotaxi Reality Check

    24 May 2026

    Wayve’s self-driving technology is heading to US cars made by Stellantis

    24 May 2026

    How Elon Musk will increase his power through the SpaceX IPO

    23 May 2026

    Waymo halts freeway routes after robotaxi race in construction zones

    23 May 2026

    Who will benefit most from SpaceX’s IPO? Mainly Elon — and a few of his inner circle

    22 May 2026
  • Venture

    Peec, one of Berlin’s up-and-coming startups, more than doubled annual revenue in months to $10 million, sources say

    23 May 2026

    Convective Capital Raises $85M Fund to Build Disaster Resilience

    22 May 2026

    Sam Altman does a ‘mic drop’ pitch to every Y Combinator startup

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    20 May 2026

    Stilta raises $10.5M from a16z and YC to help companies rediscover patents they forgot they had

    20 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»The memorable content of the OpenAi models, which proposes a new study, shows a new study
AI

The memorable content of the OpenAi models, which proposes a new study, shows a new study

techtost.comBy techtost.com4 April 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
The Memorable Content Of The Openai Models, Which Proposes A
Share
Facebook Twitter LinkedIn Pinterest Email

A new study It seems to give beliefs to claims that Openai is training at least some of the AI ​​models in copyright -protected content.

Openai is involved in costumes brought by the authors, developers and other rights holders who accuse the company of using their projects-books, codes, etc.-to develop its models without license. Openai has long claimed a fair use Defense, but the plaintiffs in these cases argue that there is no exhaust in the US copyright law for training data.

The study, which co-author by researchers at the University of Washington, the University of Copenhagen and Stanford, proposes a new method of detecting training data was “memorized” by models behind an API, such as Openai.

The models are prediction engines. They are trained in many data, learning patterns – so they are able to create essays, photos and more. Most of the outputs are not recorded copies of training data, but because of the way models “learn”, some are inevitably. Image models have been found overturn snapshots of snapshots of movies trainedWhile linguistic models were observed essentially censorship of news.

The study method is based on words that co-authors call “high broadcast”-that is, words that stand out as unusual in the context of a larger work body. For example, the word “radar” in the phrase “Jack and I sat perfectly with the rosemary” would think it was high emergency because it is statistically less likely than words such as “engine” or “radio” to appear before “Humming”.

Co-authors examined various Openai models, including GPT-4 and GPT-3.5, for signs of memorization, removing the high superficial word from fiction books and pieces of New York Times and have the models trying to “guess” which words were covered. If the models managed to guess properly, they are likely to memorize the excerpt during training, he came to the conclusion of co-authors.

An example to have a model “guess” a high specification word.Image credits:Open

According to the results of the tests, the GPT-4 showed signs that they have memorized departments of popular fiction books, including books in a set of data containing samples of copyright protected by Bookmia. The results also indicate that the model memorizes parts of the New York Times articles, though at a comparatively lower pace.

Abhilasha Ravicander, a doctoral student at the University of Washington and co-author of the study, told Techcrunch that findings shed light on “disputed data” models could have been trained.

“In order to have large linguistic models that are reliable, we must have models that we can explore and control and examine scientifically,” Ravicander said. “Our work aims to provide a tool for detecting large linguistic models, but there is a real need for greater transparency of data throughout the ecosystem.”

Openai has long supported the most relaxed restrictions on developing models using copyright -protected data. While the company has specific content licensing agreements and offers exception mechanisms that allow copyright owners to highlight the content they prefer the company that does not use for educational purposes, it has put pressure on several governments to codify the rules of “fair use”.

content Copyright memorable models open OpenAI proposes shows study
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTrump expands the Tiktok Prohibition deadline by 75 days
Next Article TechCrunch Mobility: Tesla gets a blow, starts the invoice, and a starting EV hits a milestone
bhanuprakash.cg
techtost.com
  • Website

Related Posts

I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

24 May 2026

Elon Musk has given up on solar power (on Earth)

24 May 2026

Ferrari uses IBM AI to create F1 superfans

23 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

25 May 2026

TechCrunch Mobility: Robotaxi Reality Check

24 May 2026

I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

24 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

General Catalyst just led a $63 million bet in India’s travel payments market

21 May 2026

Startup Battlefield 200 applications close on May 27

21 May 2026

Venmo’s biggest makeover in years comes at a very interesting time

11 May 2026
Startups

SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

This startup raised $43 million to create a hive mind for ships

Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.