Other cool features: - Agents have cross-session memory: Your agents are able to automatically wri...

wayneshng • today at 3:59 PM • 1 reply • view on HN

Other cool features:

- Agents have cross-session memory: Your agents are able to automatically write memory when you tell them anything worth remembering or when it discovers something that might be useful in the future.

- Browser automation: Agents can operate a headless browser, navigate, fill forms, click, read pages, and take screenshots. Browsers are also managed by the host machine, so agents interact with them using proxy.

- Human in the loop: Whenever there is a critical decision to make, the agent pauses and asks the human for a set of options.

- Team knowledge base and skill system.

Overall, Valmis is designed to run on the cloud and collaborate with humans to get work done (Valmis means "done" and "completed" in Estonian). If you have any questions about the project, please leave a comment, and I'll reply to all questions.

Replies

wayneshng • today at 4:08 PM

Also here is something fun: Valmis is probably the first AI agent that is able to play real chess with legit moves. We all know LLMs are notoriously terrible at playing chess and always hallucinate moves. So we added a tool to the agent called chess-engine, which basically requires the agent not to rely on text generation to produce moves, but instead to produce each move strictly based on the calculation of a lightweight chess engine built in. And AI can be a great (and sportsmanlike!) chess player.

This is an example I created to show how LLMs can actually do rigorous work. We cannot always trust the output generated (or hallucinated most of the time) by LLMs, but if we add a deterministic tool layer and instruct the model to rely only on the tool's output, we can get more accurate results. In this case, the tool used is a lightweight chess engine.

➕ show 1 reply

alt Hacker News

Replies