logoalt Hacker News

pjottoday at 2:56 AM1 replyview on HN

Re-reading your comment :) Yes, my demo has just a simple loop when loading the embeddings.

I was replying more towards the latency you mentioned. Because duckdb runs on device, you save yourself the additional round trip network time when comparing similarities.


Replies

jiggawattstoday at 3:14 AM

I was running SQL Server 2025 on my laptop. The source of latency is calling the Google Gemini API to compute the embedding of the query text.

I was hoping to make a demo that searches as you type, but the two second delay makes it more annoying than useful.

Looking at your sample you may be only grouping or categorising based on similarity between comments.

I was experimenting with a question -> answer tool for RAG applications.