The Matryoshka embeddings seem interesting: > The Gemini embedding model, gemini-embedding-001,...

bryan0 • 07/31/2025 • 4 replies • view on HN

The Matryoshka embeddings seem interesting:

> The Gemini embedding model, gemini-embedding-001, is trained using the Matryoshka Representation Learning (MRL) technique which teaches a model to learn high-dimensional embeddings that have initial segments (or prefixes) which are also useful, simpler versions of the same data. Use the output_dimensionality parameter to control the size of the output embedding vector. Selecting a smaller output dimensionality can save storage space and increase computational efficiency for downstream applications, while sacrificing little in terms of quality. By default, it outputs a 3072-dimensional embedding, but you can truncate it to a smaller size without losing quality to save storage space. We recommend using 768, 1536, or 3072 output dimensions. [0]

looks like even the 256-dim embeddings perform really well.

[0]: https://ai.google.dev/gemini-api/docs/embeddings#quality-for...

Replies

simonw • 07/31/2025

The Matryoshka trick is really neat - there's a good explanation here: https://huggingface.co/blog/matryoshka

I've seen it in a few models now - Nomic Embed 1.5 was the first https://www.nomic.ai/blog/posts/nomic-embed-matryoshka

➕ show 1 reply

OutOfHere • 07/31/2025

Does OpenAI's text-embedding-3-large or text-embedding-3-small embedding model have the Matryoshka property?

➕ show 1 reply

thefourthchime • 07/31/2025

It's interesting, but the improvement they're claiming isn't that groundbreaking.

ACCount36 • 07/31/2025

Google teams seem to be in love with that Matryoshka tech. I wonder how far that scales.

➕ show 1 reply

alt Hacker News

Replies