N
Hacker Next
new
show
ask
jobs
submit
login
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
arxiv.org
205 points by
timhigins
9 hours ago
|
84 comments
add comment