doi.org/10.1007/s00778-019-00552-1

Preview meta tags from the doi.org website.

Linked Hostnames

36

Thumbnail

Search Engine Appearance

Google

https://doi.org/10.1007/s00778-019-00552-1

Snorkel: rapid training data creation with weak supervision - The VLDB Journal

Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that en



Bing

Snorkel: rapid training data creation with weak supervision - The VLDB Journal

https://doi.org/10.1007/s00778-019-00552-1

Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that en



DuckDuckGo

https://doi.org/10.1007/s00778-019-00552-1

Snorkel: rapid training data creation with weak supervision - The VLDB Journal

Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that en

  • General Meta Tags

    151
    • title
      Snorkel: rapid training data creation with weak supervision | The VLDB Journal
    • charset
      UTF-8
    • X-UA-Compatible
      IE=edge
    • applicable-device
      pc,mobile
    • viewport
      width=device-width, initial-scale=1
  • Open Graph Meta Tags

    6
    • og:url
      https://link.springer.com/article/10.1007/s00778-019-00552-1
    • og:type
      article
    • og:site_name
      SpringerLink
    • og:title
      Snorkel: rapid training data creation with weak supervision - The VLDB Journal
    • og:description
      Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-art models without hand labeling any training data. Instead, users write labeling functions that express arbitrary heuristics, which can have unknown accuracies and correlations. Snorkel denoises their outputs without access to ground truth by incorporating the first end-to-end implementation of our recently proposed machine learning paradigm, data programming. We present a flexible interface layer for writing labeling functions based on our experience over the past year collaborating with companies, agencies, and research laboratories. In a user study, subject matter experts build models $$2.8\times $$ 2.8× faster and increase predictive performance an average $$45.5\%$$ 45.5% versus seven hours of hand labeling. We study the modeling trade-offs in this new setting and propose an optimizer for automating trade-off decisions that gives up to $$1.8\times $$ 1.8× speedup per pipeline execution. In two collaborations, with the US Department of Veterans Affairs and the US Food and Drug Administration, and on four open-source text and image data sets representative of other deployments, Snorkel provides $$132\%$$ 132% average improvements to predictive performance over prior heuristic approaches and comes within an average $$3.60\%$$ 3.60% of the predictive performance of large hand-curated training sets.
  • Twitter Meta Tags

    6
    • twitter:site
      @SpringerLink
    • twitter:card
      summary_large_image
    • twitter:image:alt
      Content cover image
    • twitter:title
      Snorkel: rapid training data creation with weak supervision
    • twitter:description
      The VLDB Journal - Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that enables users to train...
  • Item Prop Meta Tags

    3
    • position
      1
    • position
      2
    • position
      3
  • Link Tags

    9
    • apple-touch-icon
      /oscar-static/img/favicons/darwin/apple-touch-icon-6ef0829b9c.png
    • canonical
      https://link.springer.com/article/10.1007/s00778-019-00552-1
    • icon
      /oscar-static/img/favicons/darwin/android-chrome-192x192.png
    • icon
      /oscar-static/img/favicons/darwin/favicon-32x32.png
    • icon
      /oscar-static/img/favicons/darwin/favicon-16x16.png

Emails

1

Links

257