Personally I use llama3.1:8b or mistral-nemo:latest which have a decent contex window (even if it is...

tcsenpai • 10/11/2024 • 1 reply • view on HN

Personally I use llama3.1:8b or mistral-nemo:latest which have a decent contex window (even if it is less than the commercial ones usually). I am working on a token calculator / division of the content method too but is very early

Replies

garyfirestorm • 10/12/2024

why not llama3.2:3B? it has fairly large context window too

➕ show 1 reply

alt Hacker News

Replies