AIAgreement: clarify Forgejo project's current understanding wrt/ LLM/AI copyright #381
No reviewers
Labels
No labels
[Decision] Building proposal(s)
[Decision] Collecting stakeholders
[Decision] Documenting and implementing
[Decision] Gathering advice
[Decision] Gathering criteria
[Decision] Integrating concerns
[Decision] Phrasing shared motive
Membership
Moderation
Money
Scheduled chores
User research - Accessibility
User research - Blocked
User research - Community
User research - Config (instance)
User research - Errors
User research - Filters
User research - Future backlog
User research - Git workflow
User research - Labels
User research - Moderation
User research - Needs input
User research - Notifications/Dashboard
User research - Rendering
User research - Repo creation
User research - Repo units
User research - Security
User research - Settings (in-app)
No milestone
No project
No assignees
8 participants
Notifications
Due date
Reference
forgejo/governance!381
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "mfenniak/forgejo-governance:aiagreement-copyright"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
When the AI Agreement was authored, it was not clear whether there would be a way to satisfy point 2 in the AI Agreement in the near-future. This has led to some confusion over how a contributor can satisfy this point in the agreement. This PR amends the point to represent the current state of affairs, as it's currently understood.
Adapted from @aahlenst's proposal in Forgejo Chat, but tweaked to indicate all generated content (not just code), and I've removed "We will revisit that restriction once the legal situation changes." as that feels irrelevant to communicating the policy clearly to contributors today.
Including @Gusted's comments from Forgejo Chat about this point for context:
I'm proposing this as a PR on the existing agreement as I believe this edit is consistent with the original intent of the community-accepted agreement but clarifying that we don't believe it's possible to satisfy this requirement today. If the community indicates disagreement and believes that this is a rewrite, not a clarification, then this change should be brought into the formal decision-making process rather than be a simple PR. I'm open to receiving that feedback.
@ -26,1 +25,4 @@6. It is not allowed to use AI in an autonomous-looking way to contribute in Forgejo. This also applies when someone engages in 'vibe coding' or uses so-called 'agent mode'.[^1]: Under EU law, it is unclear whether code generated by LLMs is copyrightable. Furthermore, it is almost impossible to ascertain whether code in a PR does not violate somebody else's copyright.The footnote is too narrow. Proposal: "Under EU law, it is unclear whether output of LLMs is copyrightable. Furthermore, it is almost impossible to ascertain whether output of a LLM does not violate somebody else's copyright."
bca060a1b1to6a3fa3cfaeNew commits pushed, approval review dismissed automatically according to repository settings
@ -20,3 +20,3 @@1. If content was made with the help of AI, you must convey that this is the case. This includes content that you authored but was motivated by a suggestion of AI.2. If at any point you used AI's work in your contribution you should make an effort to verify that you can submit this under the license of the repository.2. Forgejo does not accept works of authorship (code, documentation, etc.) generated by LLMs or so-called "AI" due to legal uncertainties. [^1]I don't have any remarks regarding the meaning of the new text (which I think it's clear), but I do have some remarks regarding consistency:
With the new text (including the footnote, where LLM is used twice) to me it fells that a shift is made towards preferring the LLM term over the AI one.
I don't have any preference over one term or another, but I think consistency would be nice. In the same time (especially taking into account that the changes were already approved by 4 mergers and 1 contributor), maybe my remarks should be ignored at this point.
@floss4good wrote in #381/files (comment):
I think in general, US english is more often used in the Forgejo space. So I think we should stick to the double quotes 🤔
@floss4good wrote in #381/files (comment):
Agreed
@floss4good wrote in #381/files (comment):
Constructive voices are always good and welcome - I just haven't realized it when I read it. The input is valuable in my opinion 👍
Some off-topic remarks regarding UK vs. US English (expand if you are interested)
@Beowulf wrote in #381 (comment):
Since this is not the first time when I have remarks or questions regarding American English vs. British English please check the following excerpts (if there is interest in such a topic, a separate discussion for a convention/standard should be probably started):
@earl-warren wrote in forgejo/discussions#337 (comment):
I've updated the revision to just use the term AI, consistent with the rest of the bullet-points. 👍
6a3fa3cfae..968845daf66a3fa3cfaeto968845daf6New commits pushed, approval review dismissed automatically according to repository settings
New commits pushed, approval review dismissed automatically according to repository settings
New commits pushed, approval review dismissed automatically according to repository settings
New commits pushed, approval review dismissed automatically according to repository settings
New commits pushed, approval review dismissed automatically according to repository settings
@ -20,3 +20,3 @@1. If content was made with the help of AI, you must convey that this is the case. This includes content that you authored but was motivated by a suggestion of AI.2. If at any point you used AI's work in your contribution you should make an effort to verify that you can submit this under the license of the repository.2. Forgejo does not accept works of authorship (code, documentation, etc.) generated AI due to legal uncertainties. [^1]generated by AI?
Although, that could be interpreted as "It's fine as long as parts are made by a human." What about: "Forgejo does not accept works of authorship (code, documentation, etc.) that are either partially or completely generated by AI due to legal uncertainties."?
Whoops, yes, lost the word "by" in that edit. Re-added it.
I'm not sure what interpretation would lead one to think "It's fine as long as parts are made by a human", but I don't mind this being explicit. 👍
968845daf6toa6ecb70bb7a6ecb70bb7to57bf0779beSo does it also mean i can't use ai for basic auto complete? I usually suggests the code i would write anyways and i only used it if it matches what i was currently writing anyways. I've fully disabled ai for forgejo repo🤔
@viceice wrote in #381 (comment):
As written, I would interpret it as being prohibited if the auto complete is coming from an LLM, as that's the definition of AI given. I'd say that's where the copyright risk comes in terms of "what was this trained on" and "how much plagiarizing is it doing?"
Things that I might call "basic auto complete" that would fall outside of this definition and therefore be completely fine would be:
os.Open<Tab>->os.OpenFile, coming from a prefix-search or fuzzy-search with a language server or IDE integration.forand the editor pops up a programmed snippet of a for loop in a Go program.Probably a reasonable approximation is: is it happening locally on your machine? (... and you didn't download a multi-GB LLM model from Hugging Face trained on copy-written work? 🤣)