logoalt Hacker News

pwythonyesterday at 4:20 PM3 repliesview on HN

For those that have homebrewed a base model, does your output have the same AI-isms like overusing em dashes? If so/not, what dataset did you use?


Replies

itissidyesterday at 4:56 PM

Does yours also use the oxford comma and generally more commas?

miki123211yesterday at 5:05 PM

AFAIK, those are mostly a consequence of posttraining.

whimsicalismyesterday at 7:12 PM

that is a post-training artifact