intelligentjello.substack.com/p/a-very-good-question/comment/95861210
Preview meta tags from the intelligentjello.substack.com website.
Linked Hostnames
2Thumbnail

Search Engine Appearance
Slicey Me Likey on Intelligent Jello
I don't fully buy this argument. First, most video models don't "predict the next frame", they generate full videos at once given a text prompt. That aside, there is no convincing argument that a model needs to understand 3D/physics in order to produce very convincing videos (models like Veo 2 are already getting close). Remember 12 months ago when image models kept generating 6 fingers on hands? People expected that something had to be done explicitly to handle this, or else it wouldn't be solved. All that mattered in the end was more data and more compute (problem solved). Read The Bitter Lesson by Sutton.
Bing
Slicey Me Likey on Intelligent Jello
I don't fully buy this argument. First, most video models don't "predict the next frame", they generate full videos at once given a text prompt. That aside, there is no convincing argument that a model needs to understand 3D/physics in order to produce very convincing videos (models like Veo 2 are already getting close). Remember 12 months ago when image models kept generating 6 fingers on hands? People expected that something had to be done explicitly to handle this, or else it wouldn't be solved. All that mattered in the end was more data and more compute (problem solved). Read The Bitter Lesson by Sutton.
DuckDuckGo
Slicey Me Likey on Intelligent Jello
I don't fully buy this argument. First, most video models don't "predict the next frame", they generate full videos at once given a text prompt. That aside, there is no convincing argument that a model needs to understand 3D/physics in order to produce very convincing videos (models like Veo 2 are already getting close). Remember 12 months ago when image models kept generating 6 fingers on hands? People expected that something had to be done explicitly to handle this, or else it wouldn't be solved. All that mattered in the end was more data and more compute (problem solved). Read The Bitter Lesson by Sutton.
General Meta Tags
16- titleComments - A Very Good Question - by Mike Gioia
- title
- title
- title
- title
Open Graph Meta Tags
7- og:urlhttps://intelligentjello.substack.com/p/a-very-good-question/comment/95861210
- og:imagehttps://substackcdn.com/image/fetch/$s_!KBn7!,f_auto,q_auto:best,fl_progressive:steep/https%3A%2F%2Fintelligentjello.substack.com%2Ftwitter%2Fsubscribe-card.jpg%3Fv%3D-1423682882%26version%3D9
- og:typearticle
- og:titleSlicey Me Likey on Intelligent Jello
- og:descriptionI don't fully buy this argument. First, most video models don't "predict the next frame", they generate full videos at once given a text prompt. That aside, there is no convincing argument that a model needs to understand 3D/physics in order to produce very convincing videos (models like Veo 2 are already getting close). Remember 12 months ago when image models kept generating 6 fingers on hands? People expected that something had to be done explicitly to handle this, or else it wouldn't be solved. All that mattered in the end was more data and more compute (problem solved). Read The Bitter Lesson by Sutton.
Twitter Meta Tags
8- twitter:imagehttps://substackcdn.com/image/fetch/$s_!KBn7!,f_auto,q_auto:best,fl_progressive:steep/https%3A%2F%2Fintelligentjello.substack.com%2Ftwitter%2Fsubscribe-card.jpg%3Fv%3D-1423682882%26version%3D9
- twitter:cardsummary_large_image
- twitter:label1Likes
- twitter:data10
- twitter:label2Replies
Link Tags
31- alternate/feed
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!3lYh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41999577-5034-4050-9dff-e9a5001903cb%2Fapple-touch-icon-57x57.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!jH2r!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41999577-5034-4050-9dff-e9a5001903cb%2Fapple-touch-icon-60x60.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!vHJA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41999577-5034-4050-9dff-e9a5001903cb%2Fapple-touch-icon-72x72.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!uue-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41999577-5034-4050-9dff-e9a5001903cb%2Fapple-touch-icon-76x76.png
Links
16- https://intelligentjello.substack.com
- https://intelligentjello.substack.com/p/a-very-good-question/comment/95861210
- https://intelligentjello.substack.com/p/a-very-good-question/comment/95865326
- https://intelligentjello.substack.com/p/a-very-good-question/comments#comment-95861210
- https://substack.com