Destide@feddit.uk to Programming@programming.devEnglish · 4 个月前Open-R1: a fully open reproduction of DeepSeek-R1huggingface.coexternal-linkmessage-square9fedilinkarrow-up1109arrow-down15
arrow-up1104arrow-down1external-linkOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coDestide@feddit.uk to Programming@programming.devEnglish · 4 个月前message-square9fedilink
minus-squaremanicdave@feddit.uklinkfedilinkarrow-up4arrow-down1·4 个月前All I want is a 3gb model for the raspberry pi. 7b is too big and 1.5b is too stupid.
minus-squareTomasEkeli@programming.devcakelinkfedilinkarrow-up5·4 个月前honestly both 7b and 8b are pretty dumb as well.
minus-squareMadhuGururajan@programming.devlinkfedilinkEnglisharrow-up1·4 个月前we could add so much deterministic code at 1.5GB that would start religions…
All I want is a 3gb model for the raspberry pi. 7b is too big and 1.5b is too stupid.
3B is probably also pretty dumb
honestly both 7b and 8b are pretty dumb as well.
True
we could add so much deterministic code at 1.5GB that would start religions…