Neo-Liberalism: Let’s hire anyone from anywhere, just the best candidates no matter what, no other values but who we can extract the most value from. Let’s also take money from the government and lobby them to defund themselves and the country’s services to give us even more money.
Oh wow! How does China steal our tech?! Why wasn’t the government funding education and security to protect us?!?
https://arxiv.org/abs/2405.20304 they invented their own reinforcement learning framework called Group Relative Policy Optimization
EDIT: deepseek publicly released and published the model and methods to the global community, and there is now an open effort by researchers to reproduce them https://github.com/huggingface/open-r1 it is like the opposite of stealing
@deranger@theunknownmuncher the US trying to stifle Chinese progress/stop chip exports has had exactly what anyone could see. China is making leaps and bounds in all sorts of tech areas, innovating around obstacles
Like. You can compile better or more diverse datasets to train a model on. But you can also have better code training on the same dataset.
The model is what the code poops out after its eaten the dataset
I haven’t read the paper so no idea if the better training had to do with some super unique spin on their dataset but I’m assuming its better code.
Do you want my boss to ask me who I voted for and who I pray for? Are you crazy? That’s not their business. They HAVE to hire based on the value they’ll give to the company
Neo-Liberalism: Let’s hire anyone from anywhere, just the best candidates no matter what, no other values but who we can extract the most value from. Let’s also take money from the government and lobby them to defund themselves and the country’s services to give us even more money.
Oh wow! How does China steal our tech?! Why wasn’t the government funding education and security to protect us?!?
deepseek is not stolen tech, it was trained using novel innovations that western companies were not doing
I thought the innovative part was using more efficient code, not what it’s trained on.
https://arxiv.org/abs/2405.20304 they invented their own reinforcement learning framework called Group Relative Policy Optimization
EDIT: deepseek publicly released and published the model and methods to the global community, and there is now an open effort by researchers to reproduce them https://github.com/huggingface/open-r1 it is like the opposite of stealing
Yeah the original comment in this chain more describes US Telcos and shit, not this particular instance.
thats capitalisms dark secret. Its only innovative when it has to be.
@deranger @theunknownmuncher the US trying to stifle Chinese progress/stop chip exports has had exactly what anyone could see. China is making leaps and bounds in all sorts of tech areas, innovating around obstacles
That’s what they said basically.
Like. You can compile better or more diverse datasets to train a model on. But you can also have better code training on the same dataset.
The model is what the code poops out after its eaten the dataset I haven’t read the paper so no idea if the better training had to do with some super unique spin on their dataset but I’m assuming its better code.
Im guessing you have never done business in China.
Do you want my boss to ask me who I voted for and who I pray for? Are you crazy? That’s not their business. They HAVE to hire based on the value they’ll give to the company
You’ve distilled your own ignorance, not reality lol
Its only Ignorance if it comes from the Ignoramus region of France. Otherwise its just called sparkling stupidity.
Downvoted for misinformation, Ignoramus is in Italy