Cool project! That validation loss curve screams train set memorization without generalizatio...

isoprophlex • today at 10:01 AM • 1 reply • view on HN

Cool project! That validation loss curve screams train set memorization without generalization ability.

Too little train data, and/or data of insufficient quality. Maybe let the robot run autonomously with an (expensive) VLM operating it to bootstrap a larger train dataset without needing to annotate it yourself.

Or maybe the problem itself is poorly specified, or intractable with your chosen network architecture. But if you see that a vision llm can pilot the bot, at least you know you have a fighting chance.

Replies

indraneelpatil • today at 3:18 PM

Thanks! Its probably both, too little train data and insufficient quality.

Thats a cool idea, is there any VLM you would suggest? I can think of Gemini maybe? Or any would do?

➕ show 1 reply

alt Hacker News

Replies