New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Repository licensing options that are appropriate for non-code content #5
Comments
I was talking with @jakevdp at a Data Science Summit, and he mentioned he was also dealing with how to license repositories with notebooks, which contain code, prose, figures, and data. @jakevdp — any advice you have here (now or later) would be appreciated. Also let us know how you end up licensing your notebooks with diverse content types. |
I'm not sure what is best in this case. I asked about this on twitter a while ago, and the responses were pretty informative: thread. |
Thanks @jakevdp, here are some of the tweets grouped by category: Dual license the entire repo
License by type
Tweet by @earthorguk:
Tweet by @benjaminrose:
In between
|
The reason I like to do dual licenses for the whole repo is letting the inheritor decide what's appropriate. Cases where BSD for code, CC-BY for non-code cause unnecessary headaches:
I'm not sure any benefit is achieved by separating which parts are under which license, but I could be wrong about that. |
@minrk your assessment makes a lot of sense to me. I think dual licensing is the way to go.
It seems like you could always derive the separate licensing scheme from the dual licensing scheme. Therefore, I don't think there should be any major disadvantages to dual rather than separately licensing the content. |
Permission from contributors to license this repository as CC0@clairemcleod & @stephenshank (the two other people with contributions to this repository): are you okay with releasing your past work under CC0 1.0 (the public domain dedication) as well as the current BSD 3-Clause license? |
@dhimmel fine by me! |
@dhimmel Sounds good to me! |
Dual license this repository by adding a CC0 1.0 License as discussed in cognoma#5. Permission to release this repository under CC0 was received from Claire McLeod (https://git.io/vXmQn) and Stephen Shank (https://git.io/vXmQl). The only remaining contributor thus far, Daniel Himmelstein, is the author of this commit. Closes cognoma#5.
See cognoma/cancer-data#5 for the motivations to dual-license data science repositories.
Currently, this repository is licensed under a BSD 3-Clause License, which is the default license for Project Cognoma repositories. This license was chosen for its permissive and open nature as well as it's compatibility with other Greene Lab software products. However, the BSD 3-Clause License is intended for code. Rather than referencing all content, the license specifically mentions "source code".
However, as a data repository,
cognoma/cancer-data
will hold much more than source code including data, visualizations, writing, documentation, and notebooks which combine all of the aforementioned content types into a single file.As far as openly releasing content that is not only code, there's currently a short list of recommended licenses. My favorite is CC0 because it waives all copyrights and effectively places work in the public domain. This means anyone can use CC0 content without agonizing over license compatibility.
Despite being one of the most liberating licensing options available, CC0 is not an OSI-approved open source license for complicated reasons. We're still investigating the license of TCGA and Xena Browser data (see #3), but it's possible our upstream data providers may impose restrictions on our use that may require attribution, in which case CC BY is an option. However, Creative Commons recommends against using its licenses (besides CC0) for software.
Hence, switching the entire repository to a Creative Commons license doesn't make sense, since we want to remain OSI conformant. However, we also need a licensing arrangement that is appropriate for content that is not "source code". One option I can think of is multi-licensing where we apply both a CC0/CC BY License and a BSD 3-Clause License to the repository. Users would then be able to choose either license based on their use case. I want to make sure that this is a legally valid approach. What do people think (also asked on Twitter)?
The text was updated successfully, but these errors were encountered: