blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comment/136884265
Preview meta tags from the blog.ai-futures.org website.
Linked Hostnames
3Thumbnail

Search Engine Appearance
David Spies on AI Futures Project
The AI's choice to be sycophantic is clearly the result of post-training. I just meant the downstream out-of-distribution consequences of that choice arise from pre-training priors for "what other things do sycophantic characters tend to do"
Bing
David Spies on AI Futures Project
The AI's choice to be sycophantic is clearly the result of post-training. I just meant the downstream out-of-distribution consequences of that choice arise from pre-training priors for "what other things do sycophantic characters tend to do"
DuckDuckGo
David Spies on AI Futures Project
The AI's choice to be sycophantic is clearly the result of post-training. I just meant the downstream out-of-distribution consequences of that choice arise from pre-training priors for "what other things do sycophantic characters tend to do"
General Meta Tags
17- titleComments - Against Misalignment As "Self-Fulfilling Prophecy"
- title
- title
- title
- title
Open Graph Meta Tags
7- og:urlhttps://blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comment/136884265
- og:imagehttps://substackcdn.com/image/fetch/$s_!xB2j!,f_auto,q_auto:best,fl_progressive:steep/https%3A%2F%2Faifutures1.substack.com%2Ftwitter%2Fsubscribe-card.jpg%3Fv%3D-688204751%26version%3D9
- og:typearticle
- og:titleDavid Spies on AI Futures Project
- og:descriptionThe AI's choice to be sycophantic is clearly the result of post-training. I just meant the downstream out-of-distribution consequences of that choice arise from pre-training priors for "what other things do sycophantic characters tend to do"
Twitter Meta Tags
8- twitter:imagehttps://substackcdn.com/image/fetch/$s_!xB2j!,f_auto,q_auto:best,fl_progressive:steep/https%3A%2F%2Faifutures1.substack.com%2Ftwitter%2Fsubscribe-card.jpg%3Fv%3D-688204751%26version%3D9
- twitter:cardsummary_large_image
- twitter:label1Likes
- twitter:data10
- twitter:label2Replies
Link Tags
31- alternate/feed
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!sC21!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc31e8a5-475f-4ac0-9697-f012e7030b43%2Fapple-touch-icon-57x57.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!XlU-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc31e8a5-475f-4ac0-9697-f012e7030b43%2Fapple-touch-icon-60x60.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!6aEK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc31e8a5-475f-4ac0-9697-f012e7030b43%2Fapple-touch-icon-72x72.png
- apple-touch-iconhttps://substackcdn.com/image/fetch/$s_!E09L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc31e8a5-475f-4ac0-9697-f012e7030b43%2Fapple-touch-icon-76x76.png
Links
18- https://blog.ai-futures.org
- https://blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comment/136884265
- https://blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comment/138459064
- https://blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comment/138464909
- https://blog.ai-futures.org/p/against-misalignment-as-self-fulfilling/comments#comment-136884265