If the AI model repeatedly learns with self-generated data, the model’s performance

1
If the AI model repeatedly learns with self-generated data, the model’s performance may deteriorate and eventually collapse. It deteriorates gradually as if inbreeding, and the model loses diversity and produces extremely biased results.
It is said that the content created by AI is already overflowing, affecting Google’s search quality. Then, will this phenomenon continue to intensify in the future, leaving only trash on the web, and the AI model will collapse?

2
It is unlikely. Similar things have already happened in the history of the web. It is one of the backgrounds of the rapid emergence of early SNS such as Facebook and Twitter. Existing blogs and web pages were commercially abused, and the reliability of the information was greatly reduced. The existing web ecosystem has been greatly damaged by content abuse using SEO optimization and the flooding of advertising articles.
In this situation, early SNS was recognized as a reliable source of information for real people and connections. In a way, the recent popularity of influencer marketing is similar to this. Influencers build a kind of trust relationship with their followers. Advertisements or recommendations based on this work much more effectively than general advertisements.

3
In any case, a recent study found that maintaining about 10% of the original data could prevent the model from collapsing. In other words, the importance of high-quality human-generated data in AI learning has grown. The value of human-made “real” data increases as AI-generated data increases rapidly. Big technologies are already striving to secure high-quality text data from prestigious media and academic journals.

4
As AI-generated data increases rapidly, there will be more wasteful content, but that will not disrupt the web. Instead, artificial intelligence and foundation models will serve as tools to identify reliable information sources.
Just as early SNS secured the reliability of information through connections, AI can evaluate the reliability of information by analyzing relationships and patterns between data. Similar to selecting friends or followers on SNS, the foundation model can build a personalized trust network by learning the user’s interests, specialties, and trusted information sources.

5
Where can I get a large amount of human-made “real” data? It is big tech that has a platform that can collect a large amount of actual data from users. Data quality and diversity management become more important in AI model development, which leads to an increase in development costs. Only big techs can afford these costs.
The serious monopoly is getting worse. These are the things you really need to worry about.

tslaaftermarket

Share
Published by
tslaaftermarket

Recent Posts

Tesla News Recaps A Little Life From Hell

Tesla News Recaps A Little Life From Hell Tesla Expands Invitations to Robotaxi ServicesTesla is…

23시간 ago

Always a crisis is basically a life, Tesla news recap

Always a crisis is basically a life, Tesla news recap Grok 4 Releases This Wednesday…

2일 ago

Warren Buffett purchased a house in Omaha, Nebraska for $31,500 in 1958.

Warren Buffett purchased a house in Omaha, Nebraska for $31,500 in 1958. It's been a…

4일 ago

Tearful, Tesla News Summary

Tearful, Tesla News Summary U.S. House of Representatives Passes Legislation To End Electric Vehicle SubsidiesThe…

6일 ago

Satisfied EV Deliveries, Tesla News Summary

Satisfied EV Deliveries, Tesla News Summary Tesla Q2 Deliveries Earnings Meet Market ExpectationsTesla delivered a…

7일 ago

Tesla News Summary Now Immersed In Politics

Tesla News Summary Now Immersed In Politics Elon Warns Of Establishing A New Party"If this…

1주 ago