Teaching Claude Why

37 points • by pretext • today at 5:59 PM • 4 comments • view on HN

Comments

Side note: Anthropic has done well at achieving an immediately-recognizable art style.

This reinforces my suspicion that alignment and training in general is closer to being a pedagogical problem than anything else. Given a finite amount of training input, how do we elicit the desired model behavior? I’m not sure if asking educators is the right answer, but it’s one place to start.

➕ show 1 reply

pkuschnirof • today at 10:06 PM

[flagged]

kdkdkslsouxns • today at 10:12 PM

[dead]

alt Hacker News

Teaching Claude Why

Comments