Ask HN: Is anyone doing anything cool with tiny language models?

684 points • by prettyblocks • 01/21/2025 • 346 comments • view on HN

I mean anything in the 0.5B-3B range that's available on Ollama (for example). Have you built any cool tooling that uses these models as part of your work flow?

Comments

codazoda • 01/22/2025

I had an LLM create a playlist for me.

I’m tired of the bad playlists I get from algorithms, so I made a specific playlist with an Llama2 based on several songs I like. I started with 50, removed any I didn’t like, and added more to fill in the spaces. The small models were pretty good at this. Now I have a decent fixed playlist. It does get “tired” after a few weeks and I need to add more to it. I’ve never been able to do this myself with more than a dozen songs.

➕ show 4 replies

sharnabeel • 01/24/2025

I have tired but chinese to english but it isn't good(none of them are), because for Chinese words meaning different depending on context so i am just stuck with large model but sometimes even they leave chinese text in translation(like google gemina 2),

I really hope there would be some amazing models this year for translation.

sebazzz • 01/22/2025

I built auto-summarization and grouping in an experimental branch of my hobby-retrospective tool: https://github.com/Sebazzz/Return/tree/experiment/ai-integra...

I’m now just wondering if there is any way to build tests on the input+output of the LLM :D

mogaal • 01/22/2025

I bought a tiny business in Brazil, the database (Excel) I inherited with previous customer data *do not include gender*. I need gender to start my marketing campaigns and learn more about my future customer. I used Gemma-2B and Python to determine gender based on the data and it worked perfect

➕ show 1 reply

kolinko • 01/22/2025

Apple’s on device models are around 3B if I’m nit mistaken, and they developed some nice tech around them that they published, if I’m not mistaken - where they have just one model, but have switchable finetunings of that model so that it can perform different functionalities depending on context.

itskarad • 01/22/2025

I'm using ollama for parsing and categorizing scraped jobs for a local job board dashboard I check everyday.

accrual • 01/22/2025

Although there are better ways to test, I used a 3B model to speed up replies from my local AI server when testing out an application I was developing. Yes I could have mocked up HTTP replies etc., but in this case the small model let me just plug in and go.

HexDecOctBin • 01/22/2025

Is there any experiments in a small models that does paraphrasing? I tried hsing some off-the-shelf models, but it didn't go well.

I was thinking of hooking them in RPGs with text-based dialogue, so that a character will say something slightly different every time you speak to them.

➕ show 1 reply

jftuga • 01/22/2025

I'm using ollama, llama3.2 3b, and python to shorten news article titles to 10 words or less. I have a 3 column web site with a list of news articles in the middle column. Some of the titles are too long for this format, but the shorter titles appear OK.

ittaboba • 01/24/2025

I am building a private text editor that runs LLMs locally https://manzoni.app/

evacchi • 01/22/2025

I'm interested in finding tiny models to create workflows stringing together several function/tools and running them on device using mcp.run servlets on Android (disclaimer: I work on that)

panchicore3 • 01/22/2025

I am moderating a playlists manager to restrict them to a range of genders so it classifies song requests as accepted/rejected.

numba888 • 01/22/2025

Many interesting projects, cool. I'm waiting to LLMs in games. That would make them much more fun. Any time now...

➕ show 1 reply

kristopolous • 01/21/2025

I'm working on using them for agentic voice commands of a limited scope.

My needs are narrow and limited but I want a bit of flexibility.

Havoc • 01/21/2025

Pretty sure they are mostly used as fine tuning targets, rather than as-is.

➕ show 1 reply

lizamomo96 • 01/23/2025

[dead]

lizamomo96 • 01/22/2025

[dead]

tomholandpick • 01/22/2025

[dead]

alt Hacker News

Ask HN: Is anyone doing anything cool with tiny language models?

Comments