4,096 token context window is pretty limiting. That's roughly 3,000 words — fine for "summ...

kangraemin • today at 1:28 PM • 1 reply • view on HN

4,096 token context window is pretty limiting. That's roughly 3,000 words — fine for "summarize this paragraph" but not enough for anything that needs real context. Still, zero cost and fully local is hard to beat for quick throwaway tasks. Does it handle streaming or is it request-response only?

Replies

xandrius • today at 4:30 PM

Try it and see

alt Hacker News

Replies