logoalt Hacker News

gustavoaca1997yesterday at 7:49 PM1 replyview on HN

I have this book. Really a life savior to help me catching up a few months ago when my team decided to use LLMs in our systems.


Replies

qoezyesterday at 8:26 PM

Don't really see why you'd need to understand how the transformer works to do LLMs at work. LLMs is just a synthetic human performing reasoning with some failure modes that in-depth knowledge of the transformer interals won't help you predict what they are (just have to use experience with the output to get a sense, or other peoples experiments).

show 3 replies