builders.mozilla.org/announcing-common-corpus

Preview meta tags from the builders.mozilla.org website.

Linked Hostnames

11

Thumbnail

Search Engine Appearance

Google

https://builders.mozilla.org/announcing-common-corpus

Announcing Common Corpus

We released Common Corpus, the largest fully open dataset of over 2 trillion tokens. Pleias is committed to training LLMs in the open. This means not only releasing our models but also being open about every aspect, from the training data to the training code. We define “open” strictly: all data must be both accessible […]



Bing

Announcing Common Corpus

https://builders.mozilla.org/announcing-common-corpus

We released Common Corpus, the largest fully open dataset of over 2 trillion tokens. Pleias is committed to training LLMs in the open. This means not only releasing our models but also being open about every aspect, from the training data to the training code. We define “open” strictly: all data must be both accessible […]



DuckDuckGo

https://builders.mozilla.org/announcing-common-corpus

Announcing Common Corpus

We released Common Corpus, the largest fully open dataset of over 2 trillion tokens. Pleias is committed to training LLMs in the open. This means not only releasing our models but also being open about every aspect, from the training data to the training code. We define “open” strictly: all data must be both accessible […]

  • General Meta Tags

    14
    • title
      Announcing Common Corpus - Mozilla Builders
    • charset
      utf-8
    • viewport
      width=device-width, initial-scale=1
    • application-name
       
    • msapplication-TileColor
      #FFFFFF
  • Open Graph Meta Tags

    10
    • US country flagog:locale
      en_US
    • og:type
      article
    • og:title
      Announcing Common Corpus
    • og:description
      We released Common Corpus, the largest fully open dataset of over 2 trillion tokens. Pleias is committed to training LLMs in the open. This means not only releasing our models but also being open about every aspect, from the training data to the training code. We define “open” strictly: all data must be both accessible […]
    • og:url
      https://builders.mozilla.org/announcing-common-corpus/
  • Twitter Meta Tags

    7
    • twitter:card
      summary_large_image
    • twitter:creator
      @mozillabuilders
    • twitter:site
      @mozillabuilders
    • twitter:label1
      Written by
    • twitter:data1
      Anastasia Stasenko, Pierre-Carl Langlais
  • Link Tags

    16
    • apple-touch-icon-precomposed
      https://builders.mozilla.org/wp-content/themes/mozilla-builders/static/img/icons/apple-touch-icon-57x57.png
    • apple-touch-icon-precomposed
      https://builders.mozilla.org/wp-content/themes/mozilla-builders/static/img/icons/apple-touch-icon-114x114.png
    • apple-touch-icon-precomposed
      https://builders.mozilla.org/wp-content/themes/mozilla-builders/static/img/icons/apple-touch-icon-72x72.png
    • apple-touch-icon-precomposed
      https://builders.mozilla.org/wp-content/themes/mozilla-builders/static/img/icons/apple-touch-icon-144x144.png
    • apple-touch-icon-precomposed
      https://builders.mozilla.org/wp-content/themes/mozilla-builders/static/img/icons/apple-touch-icon-60x60.png

Links

31