
forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
Preview meta tags from the forums.fast.ai website.
Linked Hostnames
4Thumbnail

Search Engine Appearance
https://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
Super convergence(ish) on wikitext-2
After a month or so of experimenting on wikitext-2 and training a LM as fast as possible, I wanted to share a few findings. Here is the notebook with my current results: in 150 epochs (instead of 750) I get to a perplexi…
Bing
Super convergence(ish) on wikitext-2
https://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
After a month or so of experimenting on wikitext-2 and training a LM as fast as possible, I wanted to share a few findings. Here is the notebook with my current results: in 150 epochs (instead of 750) I get to a perplexi…
DuckDuckGo

Super convergence(ish) on wikitext-2
After a month or so of experimenting on wikitext-2 and training a LM as fast as possible, I wanted to share a few findings. Here is the notebook with my current results: in 150 epochs (instead of 750) I get to a perplexi…
General Meta Tags
8- titleSuper convergence(ish) on wikitext-2 - Part 2 & Alumni (2018) - fast.ai Course Forums
- charsetutf-8
- descriptionAfter a month or so of experimenting on wikitext-2 and training a LM as fast as possible, I wanted to share a few findings. Here is the notebook with my current results: in 150 epochs (instead of 750) I get to a perplexi…
- generatorDiscourse 3.5.0.beta5-dev - https://github.com/discourse/discourse version eff31e0d42c535ee115317f5b63ff70d90097911
- theme-color#fff
Open Graph Meta Tags
9- og:site_namefast.ai Course Forums
- og:typewebsite
- og:imagehttps://forums.fast.ai/uploads/default/original/3X/b/3/b395de7a2ba00b82865031c97a8cadf3d80e71e5.png
- og:urlhttps://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
- og:titleSuper convergence(ish) on wikitext-2
Twitter Meta Tags
9- twitter:cardsummary
- twitter:imagehttps://forums.fast.ai/uploads/default/original/3X/b/3/b395de7a2ba00b82865031c97a8cadf3d80e71e5.png
- twitter:urlhttps://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
- twitter:titleSuper convergence(ish) on wikitext-2
- twitter:descriptionAfter a month or so of experimenting on wikitext-2 and training a LM as fast as possible, I wanted to share a few findings. Here is the notebook with my current results: in 150 epochs (instead of 750) I get to a perplexity of 70.73 (vs 68 for the benchmark given by Stephen Merity) then 53.1 with the cache pointer (vs 52 or the benchmark given by Stephen Merity). Here is a list of things that helped: using the dropouts from Stephen Merity and not Jeremy’s (I think this part comes from the diff...
Item Prop Meta Tags
60- position1
- headlineSuper convergence(ish) on wikitext-2
- datePublished2018-05-27T13:03:46Z
- articleSectionPart 2 & Alumni (2018)
- keywords
Link Tags
30- alternate nofollowhttps://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091.rss
- apple-touch-iconhttps://forums.fast.ai/uploads/default/optimized/3X/b/3/b395de7a2ba00b82865031c97a8cadf3d80e71e5_2_180x180.png
- canonicalhttps://forums.fast.ai/t/super-convergence-ish-on-wikitext-2/17091
- iconhttps://forums.fast.ai/uploads/default/optimized/3X/b/3/b395de7a2ba00b82865031c97a8cadf3d80e71e5_2_32x32.png
- searchhttps://forums.fast.ai/opensearch.xml
Links
21- http://forums.fast.ai/t/adamw-and-super-convergence-blog/18836/2
- https://ai.googleblog.com/2018/06/how-can-neural-network-similarity-help.html
- https://forums.fast.ai
- https://forums.fast.ai/c/part2-v2/15
- https://forums.fast.ai/categories