you would have to train it three times for two colors. each by itself, they with both interactions...

cyanydeez • today at 10:20 AM • 2 replies • view on HN

you would have to train it three times for two colors.

each by itself, they with both interactions.

Replies

The models are already massively over trained. Perhaps you could do something like initialise the 2 new token sets based on the shared data, then use existing chat logs to train it to understand the difference between input and output content? That's only a single extra phase.

vanviegen • today at 10:24 AM

You should be able to first train it on generic text once, then duplicate the input layer and fine-tune on conversation.

alt Hacker News

Replies