Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s possible they’re using some new architecture to get more up-to-date data, but I think that’d be even more of a headline.

My hunch is that this is the same 5.1 post-training on a new pretrained base.

Likely rushed out the door faster than they initially expected/planned.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: