logoalt Hacker News

zapnuk08/01/20251 replyview on HN

Good luck to anyone using it. We used it for embedding about 6k documents.

The API constantly gives you quota errors when you reach about 150 requests/min eventhough the quota should allow about 50_000 requests/min.

We’d like to use the Batch API, but the model isn’t available yet.

Quite a nice model though. Being able to get embeddings for a specific task type [1] is very interesting. We used classification specific embeddings and noticed a meaningful improvment when we used the embeddings as input for a classifier.

1: https://ai.google.dev/gemini-api/docs/embeddings#supported-t...


Replies

ofisboy08/01/2025

Same here.

I tested gemini embeddings api for 1 to 5,000ish social media comments. It filled up the quota almost immediately.

Since then, I’m just using qwen embeddings locally. Open source, free and relatively comparable.