
alignmentforum.org/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector
Preview meta tags from the alignmentforum.org website.
Linked Hostnames
27- 89 links toalignmentforum.org
- 23 links toarxiv.org
- 7 links tocolab.research.google.com
- 6 links togithub.com
- 3 links toproceedings.neurips.cc
- 2 links toen.wikipedia.org
- 2 links topredictionbook.com
- 2 links totransformer-circuits.pub
Thumbnail
Search Engine Appearance
https://alignmentforum.org/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector
Steering GPT-2-XL by adding an activation vector — AI Alignment Forum
Alex Turner and collaborators show that you can modify GPT-2's behavior in surprising and interesting ways by just adding activation vectors to its f…
Bing
Steering GPT-2-XL by adding an activation vector — AI Alignment Forum
https://alignmentforum.org/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector
Alex Turner and collaborators show that you can modify GPT-2's behavior in surprising and interesting ways by just adding activation vectors to its f…
DuckDuckGo

Steering GPT-2-XL by adding an activation vector — AI Alignment Forum
Alex Turner and collaborators show that you can modify GPT-2's behavior in surprising and interesting ways by just adding activation vectors to its f…
General Meta Tags
13- titleSteering GPT-2-XL by adding an activation vector — AI Alignment Forum
- titlespotify-podcast-badge-wht-blk-165x40
- charsetutf-8
- viewportwidth=device-width, initial-scale=1
- Accept-CHDPR, Viewport-Width, Width
Open Graph Meta Tags
5- og:titleSteering GPT-2-XL by adding an activation vector — AI Alignment Forum
- og:typearticle
- og:urlhttps://www.alignmentforum.org/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector
- og:imagehttps://res.cloudinary.com/lesswrong-2-0/image/upload/c_fill,ar_1.91,g_auto/SocialPreview/bj2r3jtoovu5gfcmpjrg
- og:descriptionAlex Turner and collaborators show that you can modify GPT-2's behavior in surprising and interesting ways by just adding activation vectors to its f…
Twitter Meta Tags
3- twitter:image:srchttps://res.cloudinary.com/lesswrong-2-0/image/upload/c_fill,ar_1.91,g_auto/SocialPreview/bj2r3jtoovu5gfcmpjrg
- twitter:descriptionAlex Turner and collaborators show that you can modify GPT-2's behavior in surprising and interesting ways by just adding activation vectors to its f…
- twitter:cardsummary
Link Tags
32- alternatehttps://www.alignmentforum.org/feed.xml
- canonicalhttps://www.alignmentforum.org/posts/5spBue2z2tw4JuDCx/steering-gpt-2-xl-by-adding-an-activation-vector
- preloadhttps://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Steering%20GPT-2-XL%20by%20adding%20an%20activation%20vector_a_compass_needle_adjusting_a_shi_0.26651304514537477/jqnevelzysz3adrrfefi
- preloadhttps://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/FEGguDMzGbKojSQF9/pg9d78ympfbw0zygoqim
- preloadhttps://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/FEGguDMzGbKojSQF9/eekt86qa7htfefs0sm5f
Links
153- http://yann.lecun.com/exdb/publis/pdf/gregor-icml-10.pdf
- https://academic.oup.com/edited-volume/40193/chapter-abstract/342292664?redirectedFrom=fulltext
- https://aclanthology.org/2022.findings-emnlp.336
- https://alignmentforum.org
- https://alignmentforum.org/bestoflesswrong?year=2023&category=all