👨‍💻
myHN
Top
New
Best
Ask
Show
Job
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
by timhigins |
View on Hacker News