Attach the SRT to your frozen model Anthropic. Problem solved.

spacebacon • yesterday at 9:41 PM • 2 replies • view on HN

Attach the SRT to your frozen model Anthropic. Problem solved. https://github.com/space-bacon/SRT.

Replies

I see your repository’s README says

> Language models process signs (representamens) but are blind to when meaning forks — when the same word means different things to different communities.

But, haven’t interpretability results shown that these models internally represent several meanings of the same word, differently? In that case, why would they not already do the same for how words are used differently in different communities?

spacebacon • yesterday at 10:16 PM

[dead]

alt Hacker News

Replies