Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
MegaTrain Full Precision Training of 100B+ Parameter LLMs on a Single GPU (github.com/dlyuangod)
1 point by adulau 27 days ago | past
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones (github.com/dlyuangod)
237 points by T-A on Jan 3, 2024 | past | 37 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: