arxiv.org/abs/2305.17493

Preview meta tags from the arxiv.org website.

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

https://arxiv.org/abs/2305.17493

The Curse of Recursion: Training on Generated Data Makes Models Forget

Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such language models to the general public. It is now clear that large language models (LLMs) are here to stay, and will bring about drastic change in the whole ecosystem of online text and images. In this paper we consider what the future might hold. What will happen to GPT-{n} once LLMs contribute much of the language found online? We find that use of model-generated content in training causes irreversible defects in the resulting models, where tails of the original content distribution disappear. We refer to this effect as Model Collapse and show that it can occur in Variational Autoencoders, Gaussian Mixture Models and LLMs. We build theoretical intuition behind the phenomenon and portray its ubiquity amongst all learned generative models. We demonstrate that it has to be taken seriously if we are to sustain the benefits of training from large-scale data scraped from the web. Indeed, the value of data collected about genuine human interactions with systems will be increasingly valuable in the presence of content generated by LLMs in data crawled from the Internet.

Bing

The Curse of Recursion: Training on Generated Data Makes Models Forget

https://arxiv.org/abs/2305.17493

DuckDuckGo

https://arxiv.org/abs/2305.17493

The Curse of Recursion: Training on Generated Data Makes Models Forget

General Meta Tags
20
- title
  [2305.17493] The Curse of Recursion: Training on Generated Data Makes Models Forget
- title
  open search
- title
  open navigation menu
- title
  contact arXiv
- title
  subscribe to arXiv mailings
Open Graph Meta Tags
10
- og:type
  website
- og:site_name
  arXiv.org
- og:title
  The Curse of Recursion: Training on Generated Data Makes Models Forget
- og:url
  https://arxiv.org/abs/2305.17493v3
- og:image
  /static/browse/0.3.4/images/arxiv-logo-fb.png
Twitter Meta Tags
6
- twitter:site
  @arxiv
- twitter:card
  summary
- twitter:title
  The Curse of Recursion: Training on Generated Data Makes Models Forget
- twitter:description
  Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such...
- twitter:image
  https://static.arxiv.org/icons/twitter/arxiv-logo-twitter-square.png
Link Tags
12
- apple-touch-icon
  /static/browse/0.3.4/images/icons/apple-touch-icon.png
- canonical
  /abs/2305.17493
- icon
  /static/browse/0.3.4/images/icons/favicon-32x32.png
- icon
  /static/browse/0.3.4/images/icons/favicon-16x16.png
- manifest
  /static/browse/0.3.4/images/icons/site.webmanifest

arxiv.org/abs/2305.17493

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

The Curse of Recursion: Training on Generated Data Makes Models Forget

Bing

The Curse of Recursion: Training on Generated Data Makes Models Forget

DuckDuckGo

The Curse of Recursion: Training on Generated Data Makes Models Forget

General Meta Tags

Open Graph Meta Tags

Twitter Meta Tags

Link Tags

Links