Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I answered the question directly. IQ4_X_S is smaller, but slower and less accurate than Q4_0. The parent comment specifically asked about the QAT version. That's literally what this thread is about. The context-length mention was relevant to show how it's only barely usable.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: