alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient

Preview meta tags from the alignmentforum.org website.

Linked Hostnames

9

Thumbnail

Search Engine Appearance

Google

https://alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient

Training Process Transparency through Gradient Interpretability: Early experiments on toy language models — AI Alignment Forum

The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…



Bing

Training Process Transparency through Gradient Interpretability: Early experiments on toy language models — AI Alignment Forum

https://alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient

The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…



DuckDuckGo

https://alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient

Training Process Transparency through Gradient Interpretability: Early experiments on toy language models — AI Alignment Forum

The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…

  • General Meta Tags

    9
    • title
      Training Process Transparency through Gradient Interpretability: Early experiments on toy language models — AI Alignment Forum
    • charset
      utf-8
    • viewport
      width=device-width, initial-scale=1
    • Accept-CH
      DPR, Viewport-Width, Width
    • description
      The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…
  • Open Graph Meta Tags

    5
    • og:title
      Training Process Transparency through Gradient Interpretability: Early experiments on toy language models — AI Alignment Forum
    • og:type
      article
    • og:url
      https://www.alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient
    • og:image
      https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DtkA5jysFZGv7W4qP/pb8hyfkvlcoc6349yybq
    • og:description
      The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…
  • Twitter Meta Tags

    3
    • twitter:image:src
      https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DtkA5jysFZGv7W4qP/pb8hyfkvlcoc6349yybq
    • twitter:description
      The work presented in this post was conducted during the SERI MATS 3.1 program. Thank you to Evan Hubinger for providing feedback on the outlined exp…
    • twitter:card
      summary
  • Link Tags

    9
    • alternate
      https://www.alignmentforum.org/feed.xml
    • canonical
      https://www.alignmentforum.org/posts/DtkA5jysFZGv7W4qP/training-process-transparency-through-gradient
    • preload
      https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DtkA5jysFZGv7W4qP/pb8hyfkvlcoc6349yybq
    • preload
      https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DtkA5jysFZGv7W4qP/gt9xfbbe4nfyncikyiub
    • preload
      https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DtkA5jysFZGv7W4qP/hbwxheyctwu87kaywoql

Links

36