Sure, maybe it's tricky to coerce an LLM into spitting out a near verbatim copy of prior data, ...

Calavar • today at 1:34 AM • 1 reply • view on HN

Sure, maybe it's tricky to coerce an LLM into spitting out a near verbatim copy of prior data, but that's orthoginal to whether or not the data to create a near verbatim copy exists in the model weights.

Replies

D-Machine • today at 2:31 AM

Especially since the recalls achieved in the paper are 96% (based on block largest-common substring approaches), the effort of extraction is utterly irrelevant.

alt Hacker News

Replies