logoalt Hacker News

spacebaconyesterday at 9:41 PM2 repliesview on HN

Attach the SRT to your frozen model Anthropic. Problem solved. https://github.com/space-bacon/SRT.


Replies

drdecatoday at 12:31 AM

I see your repository’s README says

> Language models process signs (representamens) but are blind to when meaning forks — when the same word means different things to different communities.

But, haven’t interpretability results shown that these models internally represent several meanings of the same word, differently? In that case, why would they not already do the same for how words are used differently in different communities?

spacebaconyesterday at 10:16 PM

[dead]