They have developed an LLM, so they are an AI lab, but the quality of that model suggests they'...

hawkice • yesterday at 5:36 PM • 8 replies • view on HN

They have developed an LLM, so they are an AI lab, but the quality of that model suggests they're not a frontier anything.

Replies

leetharris • yesterday at 5:53 PM

I have the pro account for ChatGPT, Claude, Gemini, and Grok.

They all have various strengths and weaknesses. My favorite is still ChatGPT, then Gemini/Claude, then Grok.

Grok often feels 1-2 generations behind the competition in general use, but it has three things that I love:

1. It seems to be the best at understanding current events. Maybe due to X integration, or some other tool call optimization in the backend? I don't know, but I often ask about things going on, and the other models have outdated info, give unhelpful answers, etc.

2. It is generally the least sycophantic for personal things. Anthropic is getting here too. ChatGPT and Gemini are working on this, but previous models in those families would almost never say anything negative about what I am doing. Sometimes I need career advice, personal advice, etc and I like the tone of how it responds. I think Claude will be caught up soon.

3. For professional work, there are certain topics that other models would refuse to engage with. At my last company we had an enormous amount of legal users. When a deposition would need a summary on certain topics, most models would refuse. Grok would not. I understand the need for safety and I don't blame the other model providers, but for some professional use cases you NEED a model that is capable of handling sensitive subjects.

➕ show 10 replies

fooker • yesterday at 5:56 PM

> the quality of that model

I guess the benchmarks disagree, but whenever I need to find specific information that does not easily show up with a web search, I try chatgpt, gemini and grok. Grok surfaces what I was looking for more often than the others.

Things like "find the github repo from 2017 that does $vague_thing".

➕ show 4 replies

beepbopboopp • yesterday at 5:39 PM

Or the model was a marketing expense to capitalize the data center model. Im not saying it was intentionally that, but its been an effective "that."

➕ show 2 replies

mbesto • yesterday at 5:56 PM

And they are planning (well "planning" if you believe Elon) to start building their LLM over from scratch, which means they need a HUGE ass training data center, i.e. not a data center for inference to do so.

bottlepalm • yesterday at 5:47 PM

Grok isn't at the front of the frontier, but they are there for sure.

harrall • yesterday at 6:52 PM

But supposedly they’re the cheapest for certain workloads, especially ones that have high tokens and can make use of caching.

So they’re cutting edge in that way.

gowld • yesterday at 6:31 PM

I am also an "AI lab", but I look more like a corporate cog, because that's where most of my revenue comes from and how I spend the most my time.

throwaway67678 • yesterday at 5:54 PM

[flagged]

➕ show 1 reply

alt Hacker News

Replies