That's my cue to join another PeerTube instance

cloudskater@piefed.blahaj.zone · edit-2 2 months ago

That's my cue to join another PeerTube instance

Zagorath@aussie.zone · 2 months ago

I’ll admit I don’t know all the differences between locally run, open source models that are used for accessibility and the horrible plagiarism machines we all despise the most

Fwiw 99.9% of the time someone talks about ak “open source” generative AI model, what they really mean is “open weight”.

An open-source model has public training code and training dataset, allowing full reproduction

— a random Reddit post I found when looking for a good definition to share

Some people (including the author of that definition) don’t like the need for open source models to have an open source dataset. It’s also not clear to me whether that definition is even supposed to mean the dataset is actually public domain, or just clearly defined (e.g. “we trained on all top 100 best-selling books from the period 2000–2020”). The former would obviously be very meaningfully different from closed models in terms of accusations of ethical problems in the training process.

Open-weight models basically just mean you can download it and make some slight tweaks and run it at home. It means the big AI companies aren’t benefiting financially from your use and can’t train on what you feed them for their next model, and because these are typically designed to be run locally rather than in a data centre the environmental impacts are lessened. But in terms of the training process it’s no better than closed models.