海行（うみゆき）: "RT @npaka123: RLHF (人間のフィードバックからの強化学習) の図解｜npaka …" - Mastodon

海行（うみゆき） @umiyuki@mstdn.soysoftware.net

RT @npaka123: RLHF (人間のフィードバックからの強化学習) の図解｜npaka @npaka123 #note https://t.co/6C8n9NoTwl

Apr 28, 2023, 02:15 · From Twitter · · ·

Sign in to participate in the conversation