For the life of me I will never understand the thought process that leads you to say "we don't really know who developed this LLM but I'm going to feed all of my business's data to it"
> I'm going to feed all of my business's data to it
Your business data is probably worthless, even considered harmful for the pretrain corpus.
Your interactions and decision making process are most valuable parts of the whole business.
You don't need to know who developed the LLM - whether it was Google or OpenAI.
What you need to know is who is the provider for the LLM, and whether their endpoints are zero data retention enabled and opted out of training. OpenRouter gives you an easy way to control this.
what can it do ? it's just a big set of numbers, if you trust the host that's good enough
If you Code open source projects anyway, might give it a spin.
How do you “feed data into a model” ? Use the correct terminology and concepts please. It is important.
It's from Tencent, says it in the article:
https://hy.tencent.com/research/hy3