Whatfinger News Content
    What's Hot

    Is this Keir Starmer’s biggest week as PM?

    May 12, 2025

    Trump Infuriates India With Ceasefire Announcement | Insight with Haslinda Amin | 5/12/2025

    May 12, 2025

    ‘BUDGET HAWKS’: This member ‘needs to be watched’ for Trump’s budget bill, warns WSJ reporter

    May 12, 2025
    Whatfinger News Headlines

    Is this Keir Starmer’s biggest week as PM?

    May 12, 2025

    Trump Infuriates India With Ceasefire Announcement | Insight with Haslinda Amin | 5/12/2025

    May 12, 2025

    ‘BUDGET HAWKS’: This member ‘needs to be watched’ for Trump’s budget bill, warns WSJ reporter

    May 12, 2025

    Starmer announces sweeping migration reform at Downing Street news conference

    May 12, 2025

    Trump’s Executive Order on Drugs, Pharma Costs: What We Know

    May 12, 2025

    Indian Diplomat on the India-Pakistan Ceasefire

    May 12, 2025

    US Treasury Secretary speaks after China trade talks

    May 12, 2025

    Far-right Hindus vandalise Indian-owned bakery demanding owners change its name

    May 12, 2025
    Facebook Twitter Instagram
    Monday, May 12
    • Whatfinger®
    • Breaking
    • Videos
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Daily Paper
    • Sci-Tech
    • Top 3
    • Chat GPT
    • Choice Clips
    • About
    • Retirement
    Whatfinger News ContentWhatfinger News Content
    CLICK HERE for all posts on this page
    Whatfinger News Content
    Home » Risks of Model Collapse in AI – Challenges of Training on AI-Generated Data

    Risks of Model Collapse in AI – Challenges of Training on AI-Generated Data

    July 25, 2024 Science & Tech 3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Recent research by Ilia Shumailov and his team at Google DeepMind highlights a significant challenge for the future of AI language models (LLMs). They discovered that if LLMs are trained predominantly on AI-generated content, it leads to a phenomenon called “model collapse.” This occurs when new generations of models, using data produced by older AI models, start to misinterpret reality and degrade in performance.

    Researchers have discovered AI’s worst enemy — its data. https://t.co/OO0v6RH00R

    — Randy Kemp (@randylewiskemp) July 24, 2024

    The study, published in Nature, showed that LLMs trained on AI-generated data tend to forget less common elements from their original training sets. For instance, if a model is tasked with generating images of tourist landmarks, it might overly focus on popular sites like the Statue of Liberty, eventually ignoring other landmarks altogether. This repetitive focus can result in the models generating meaningless or repetitive phrases, such as “tailed jackrabbits,” as seen in their experiments.

    The issue of model collapse raises concerns about the future of machine learning advancements. The research suggests that while model collapse can affect any LLM, the severity depends on the model’s architecture, learning processes, and the quality of data it uses. This situation echoes past challenges faced by search engines, which had to adjust their algorithms due to content farms flooding the internet with low-quality articles.

    For the average user, this problem might not be immediately noticeable, as major chatbot creators conduct thorough evaluations to prevent such degradation. However, for AI companies, understanding and addressing model collapse is crucial. Using high-quality, human-generated content for training could be a solution, as it provides more reliable data than AI-generated content.

    AI models collapse when trained on recursively generated data

    “the value of data collected about genuine human interactions with systems will be increasingly valuable in the presence of LLM-generated content in data crawled from the Internet.”https://t.co/w8oqVaS5sl

    — Joshua Grubb (@entogrubb) July 25, 2024

    The research also points to the potential value of platforms like Reddit, where human interactions generate a wealth of content. Companies like Google and OpenAI have already made deals with such platforms, recognizing the importance of quality data in developing robust AI models.

    Key Points:

    • Research by Google DeepMind found that LLMs trained on AI-generated data can suffer “model collapse.”
    • Model collapse leads to repetitive and degraded responses, as models forget less common data elements.
    • The problem can slow machine learning advancements and requires high-quality data for training.
    • Major chatbot creators can detect and prevent degradation through evaluations.
    • Platforms like Reddit, with human-generated content, offer valuable training data for AI models.

    RM Tomi – Reprinted with permission of Whatfinger News

    Keep Reading

    Discovery of Giant Black Hole Jets Reshapes Our View of the Universe

    Mars Unveils “Freya Castle”: A Mysterious Striped Rock Discovered by Perseverance Rover

    Soyuz Returns from ISS: Oleg Kononenko, Tracy Dyson, and Nikolai Chub Conclude Record-Breaking Space Mission

    SpaceX’s Crew-9 Mission Prepares for Unplanned ISS Rescue Amid Starliner Setbacks

    Falcon 9 Launch Expands Starlink’s Direct-to-Cell Network and Marks 94th SpaceX Mission of 2024

    SpaceX Crew-9 Mission Delayed to September 26, NASA Prioritizes Safety Amid Complexities

    Add A Comment

    Leave A Reply Cancel Reply

    Content Partners

    If you need to log in to transfer posts to your WordPress site – CLICK HERE

    If you need to contact us, please email us at editor@whatfinger.com

    or use our form HERE

    Our Landing Page for these Content and Traffic Services

    Categories & Posts Ready To Take For Your Wordpress Site

    Original News Content Daily

    • Business And Money (articles)
    • Entertainment (articles)
    • Science & Tech (articles)
    • Top News (articles)
    • World News (articles)

    Videos (YouTube and Twitter)

    • A.I. Vids
    • Business & Money (video)
    • Entertainment (Videos)
    • Humor (articles and vids)
    • Mainstream News Vids (ABC, CBS, Fox, Etc)
    • Religious Vids
    • Sports  (articles – small selection – not many interested)
    • Sports Videos (Large selection)
    • Top Vids
    • Tweets (Vids)
    • World News (Vids)
    IMPORTANT INFO on our new Content program

    Please Read – Important To All: Our content services are new as of June 3, 2024 – We are now adding as of today HUNDREDS of pages you can take as your own daily with just a click, including now 30 plus originally written news items per day in Entertainment, Top News, Science & Tech, Business & Money and World News.

    If there is a topic you would like to see more of please email us at editor@whatfinger.com.

    If you have favorite video channels from YouTube or Twitter (X) accounts you want to easily use the content from, please do not hesitate to let us know.

    If you are interested in getting a ton of traffic direct form Whatfinger News,  or want to add massive amounts of new content daily to your own site… CLICK HERE for our landing page on the services

    – Mal Antoni

    Whatfinger Content Services click below for all the details
    The Following is our widget. This one is for Top Political News, but widgets can be for World News, Entertainment, Sports, Or Business-Money to fit your site. As part of our deals for a deep discount on our Content, place our widget on your pages. You can also have a widget below designed for below the article.. That one has 6 smaller links.
    undefined
    titleDemocrats PIVOT as Trump numbers come in! Liberal Hivemind...
    titleGeneral Flynn: I’m shocked that Fox News aired this on Epstein...
    titleGerman advisor Merz hid a spoon used for cocaine, while French President Macron concealed a bag of i...
    titleDonald Trump reportedly FURIOUS with Amy Coney Barrett. - Liberal Hivemind...
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW (This is NOT part of our widget)
    Empower Your Online Presence with Our Web Development Solutions!

    Looking to start from scratch or revamp your news, blog, business, or e-commerce website? Look no further! Our expert team specializes in creating stunning, high-performance WordPress websites tailored to your unique needs.

    🚀 Get Started Today! contact us at nephilainc@gmail.com to get a quote!
    Or
    Skype:info.mighty

    Don’t just build a website—create an experience!

    Type above and press Enter to search. Press Esc to cancel.