inari@piefed.zip to People Twitter@sh.itjust.worksEnglish · 1 month agoManagersmedia.piefed.zipimagemessage-square176linkfedilinkarrow-up1953
arrow-up1953imageManagersmedia.piefed.zipinari@piefed.zip to People Twitter@sh.itjust.worksEnglish · 1 month agomessage-square176linkfedilink
minus-squaretheunknownmuncher@lemmy.worldlinkfedilinkarrow-up2·edit-21 month agoEven so, your numbers are still a tiny fraction of GPU units compared to concurrent users, and the limit you “imagine” is just that, imagined. And you do need to remember that the majority of the compute at these companies is used for model training and not used for inference.
Even so, your numbers are still a tiny fraction of GPU units compared to concurrent users, and the limit you “imagine” is just that, imagined.
And you do need to remember that the majority of the compute at these companies is used for model training and not used for inference.