logoalt Hacker News

kramit1288today at 10:45 AM0 repliesview on HN

accurate memory estimation is key here. it will crash if that accurate and it cant be generic for all local llm. each local llm has different context estimates.