logoalt Hacker News

koolalatoday at 4:01 AM3 repliesview on HN

Unless there is major administration change, how do things not get worse and worse from here? LLM's will only get more intelligent and be seen more of a national security risk. This brings the surveillance state deeper into every web connected device.


Replies

gamedevo37stoday at 5:06 AM

First I want to see them play video games at a high skill level, preferably without any access to game state beyond the same visual output that humans have access to, like a raster frame X number of times per second.

One LLM model played Factorio, albeit at a very, very poor level, which can be seen if you slow the video to 0.25 playback speed and pause frequently.

https://old.reddit.com/r/factorio/comments/1u1blr6/claude_fa...

There have been streams of other games, where LLMs and AIs have likewise performed very poorly.

I recognize that LLMs might be better at language processing than these sorts of tasks. But being able to play video games is part of general capability. And this kind of hardcore video game playing, with no access to game state, is also a general task where feigning skill can be harder. If LLMs excel at pretending to be competent without actually being competent, like this AI training approach is arguably about

https://en.wikipedia.org/wiki/Generative_adversarial_network

Then some AIs might be trained and designed for deceiving humans instead of actually being competent and capable. And thus, one response is that they should be met with more difficult tests.

Basically, make tests that AIs or LLMs will not have an easy time cheating. Hopefully, that will engender research in greater LLM/AI competence, not in greater ability to cheat or deceive, neither for LLM/AI researchers and companies, nor for LLMs/AIs themselves.

show 1 reply
teaearlgraycoldtoday at 4:30 AM

You think the AI boys are going to let the administration keep this up for long?

show 2 replies
coliveiratoday at 4:15 AM

Solution: get as far away as you can from these models. It is curiosity that kills the cat. If you stay away and use only open models they cannot control your work.

show 1 reply