Top
Ask
Show
Best
New
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
534 points •
gpjt
• 9 days ago •
114 comments