- cross-posted to:
- technology@lemmy.zip
- cross-posted to:
- technology@lemmy.zip
Microsoft’s GitHub next month plans to begin using customer interaction data – “specifically inputs, outputs, code snippets, and associated context” – to train its AI models.
No shit. GitHub is owned by Microslop. It was only a matter of time.
Surely AI training was the top bullet point on the buy pitch.
Nah, they bought it way before LLMs were a mainstream / realistic thing
There’s really not much locking us in to GitHub. Even moving an existing repo is not that hard. I started using Codeberg a few months ago and have yet to see the downside
Bro, I dont dig this either, but the title is a bit misleading. What they said (and they have been pretty transpartent about it: banner on the site plus email if you have an account) is that they will train their Copilot models from the user interactions with copilot, and you can opt-out.
Now, I know the importance of defaults, but we are talking about Github, a platform for developers, I would REALLY assume these are the people that REALLY are able to toggle a setting to their preference, especially when they have been properly informed about it.
Let’s try to save the indignment for when it is justified, this was not executed in a shady way, I would much rather Microsoft do any policy change this way.
At least thats my opinion lol
It should be opt in, not opt out.
Date
As of April 24 you’ll be feeding the Octocat unless you opt out
Current scope
The code locker’s revised policy applies to Copilot Free, Pro, and Pro+ customers, as of April 24. Copilot Business and Copilot Enterprise users are exempt thanks to the terms of their contracts. Students and teachers who access Copilot will also be spared.
To opt out (link edited by me to make it clickable)
Those affected have the option to opt out in accordance with “established industry practices” – meaning according to US norms as opposed to European norms where opt-in is commonly required. To opt out, GitHub users should visit github.com/settings/copilot/features and disable “Allow GitHub to use my data for AI model training” under the Privacy heading.
Thank you!
Done.
Also, go Team Codeberg.
Strange, I was already opt-out, must be an European thing. We are “opt-out” to a lot of things going on in the world lately.
How long until that magically reenables itself
Interestingly, mine was still enabled from the last time I must have toggled that setting.
If they do screw around, they could just train on everything without asking anyone
I would bet literally any amount of money that the button doesn’t stop the AI from training on your data.
Federated ForgeJo can’t come soon enough.
GitHub : the best advertisement for CodeBerg out there !
Jokes on them. All my GitHub code is written by AI.
I’m not surprised, companies are starting to realise that AI is only as useful as the data it’s trained on. If you blast it with all the internet slop we have completely unfiltered, it’s going to start fucking up all it’s responses. It’s not just about the volume of data, it’s about the quality of that data. Sites like Github, and academic journals, contain the exact data that companies need to create well rounded LLMs, that don’t go off on racist rants and declare themselves as “MechaHitler”. That makes data like Github’s pure gold.
Counterpoint, I’ve poisoned it with absolute dumb shit and the worst code you’ve ever seen
Thank you for your service 07
Intentionally, right? Right?
I’m already in the process of leaving, not to Codeberg, but to a self-hosted instance of Forgejo.












