SM
Starmorph News
new
show
ask
jobs
submit
Implementing DeepSeek R1's GRPO algorithm from scratch
github.com
192 points by
xcodevn
6 days ago
|
3 comments
add comment