> Pre-training allows organizations to build domain-aware models by learning from large internal ...

upghost • today at 3:07 AM • 5 replies • view on HN

> Pre-training allows organizations to build domain-aware models by learning from large internal datasets.

> Post-training methods allow teams to refine model behavior for specific tasks and environments.

How do you suppose this works? They say "pretraining" but I'm certain that the amount of clean data available in proper dataset format is not nearly enough to make a "foundation model". Do you suppose what they are calling "pretraining" is actually SFT and then "post-training" is ... more SFT?

There's no way they mean "start from scratch". Maybe they do something like generate a heckin bunch of synthetic data seeded from company data using one of their SOA models -- which is basically equivalent to low resolution distillation, I would imagine. Hmm.

Replies

lelanthran • today at 7:47 AM

I would guess:

Pre-training: refining the weights in an existing model using more training data.

Post-training: Adding some training data to the prompt (RAG, basically).

mirekrusin • today at 5:07 AM

Probably marketing speak for full fine-tuning vs PEFT/LoRA.

gunalx • today at 7:05 AM

Probably just means SFT fine-tuning a base model, vs behavioural dpo and/or SFT fine-tuning a instruction model.

stingraycharles • today at 3:20 AM

I can imagine that, as usual, you start with a few examples and then instruct an LLM to synthesize more examples out of that, and train using that. Sounds horrible, but actually works fairly well in practice.

anon373839 • today at 3:38 AM

I think they are referring to “continued pretraining”.

alt Hacker News

Replies