Open Science

Basic Principles and Best Practices

Dr. Domenico Giusti
Paläoanthropologie, Senckenberg Centre for Human Evolution and Palaeoenvironment

Download

6. Open Licensing and File Formats

Outline

  • Definitions
  • Rationale
  • How to
  • Summary
  • FAQ
  • Food for thought

Definitions

Copyright

A bundle of rights protecting the rights of authors, composers, artists, film makers and other creators by ensuring that they receive recognition, payment and protection for their works. It includes moral and economic rights. EU copyright policies

The EU copyright legislation consists of 11 directives and 2 regulations, harmonising the essential rights of authors, performers, producers and broadcasters. The EU copyright legislation

  • Directive on the legal protection of computer programs (“Software Directive”), 23 April 2009
  • Directive on the enforcement of intellectual property right (“IPRED”), 29 April 2004
  • Directive on the legal protection of databases (“Database Directive”), 11 March 1996

Licensing is the main mechanism for the exercise of copyright and related rights.

Licences are most often granted directly by the rightholder.

Copyleft

The term 'copyleft' originated in the free software community and refers to licenses that allow derivative works but require them to use the same license as the original work.

The Turing Way

The 4 Essential Freedoms

A program is free software if the program's users have the four essential freedoms:

  • The freedom (0) to run the program as you wish, for any purpose
  • The freedom (1) to study how the program works, and change it so it does your computing as you wish. Access to the source code is a precondition for this.
  • The freedom (2) to redistribute copies so you can help others.
  • The freedom (3) to distribute copies of your modified versions to others. By doing this you can give the whole community a chance to benefit from your changes. Access to the source code is a precondition for this.

Free Software Foundation

What is an Open License?

Licensing is the main mechanism for the exercise of copyright and related rights.

"A license is a legal document that grants specific rights to user to reuse and redistribute a material under some conditions. Any right that is not granted by default by the licensor through the license can be asked. Licenses can be applied to any material (e.g., sound, text, image, multimedia, software) where some exploitation or usage rights exist." The Open Science Training Handbook

An OPEN LICENSE is a license that grant permission to access, re-use, and redistribute material with few or no restrictions (it ranges from very open to very restrictive).

Defining Open Licenses (Open Definition)

A license is open if its terms satisfy the following conditions:

  1. The license must irrevocably permit (or allow) the following:

    Use, Redistribution, Modification, Separation, Compilation, Non-discrimination, Propagation, Application to Any Purpose, No Charge

  2. The license must not limit, make uncertain, or otherwise diminish the permissions required in Section 1 except by the following allowable conditions:

    Attribution, Integrity, Share-alike, Notice, Source, Technical restriction prohibition, Non-aggression

Types of Open Licenses

Software licenses (Copyleft & Non-Copyleft/Permessive)

A software license governs the extent of use or redistribution of software

  • Code: MIT

Non-software licenses

--> V. Stodden, "The Legal Framework for Reproducible Scientific Research: Licensing and Copyright," in Computing in Science & Engineering, vol. 11, no. 1, pp. 35-40, Jan.-Feb. 2009, doi: 10.1109/MCSE.2009.19.

MIT

Copyright <YEAR> <COPYRIGHT HOLDER>

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Creative Commons Licenses

"The Creative Commons copyright licenses and tools forge a balance inside the traditional “all rights reserved” setting that copyright law creates. Our tools give everyone from individual creators to large companies and institutions a simple, standardized way to grant copyright permissions to their creative work." CC Licenses

CC0

BY (Attribution Required)

SA (ShareAlike - Distributed on Same Terms)

ND (NoDerivatives - Modifying Not Allowed)

NC (NonCommercial - Commercial Use Not Allowed)

The Creative Commons Public Domain Dedication (CC0)

"Public domain" is a technical term in copyright law that refers to works not under copyright — either because they were never in copyright to begin with [...], or because their copyright term has finally lapsed and they have "fallen into" the public domain. [...] Not all jurisdictions have a public domain, and it doesn't always mean exactly the same thing in the jurisdictions that do have it. Furthermore, even where it is clear what it means, it's still not a license. To be subject to a license, a work must still be in copyright. Open Source Initiative

For example, in Germany and other European countries it is not possible to fully waive copyright, and thus fully dedicating work to the public domain is not legally possible. Instead, the CC0 license can be used as an "effective" public domain license that allows unrestricted use. The Open Science Training Handbook

CC0 “No Rights Reserved”

CC0 enables scientists, educators, artists and other creators and owners of copyright- or database-protected content to waive those interests in their works and thereby place them as completely as possible in the public domain, so that others may freely build upon, enhance and reuse the works for any purposes without restriction under copyright or database law. CC0

  • Originally created as a legal tool to release scientific databases without any restriction.

The Creative Commons Attribution Required (CC BY)

What is an Open Format

An open format is one which places no restrictions, monetary or otherwise, upon its use and can be fully processed with at least one free/libre/open-source software tool. Open Definition

Open formats

  • Text: TXT, RTF, ODT, PDF/A (.doc, .docx)
  • Tabular data: CSV, TSV (.xls, .xlsx)
  • Image: TIFF, PNG, JPG 2000, SVG (.psd, .ai)
  • 3D: PLY, OBJ
  • Spatial data: CSV, GeoTIFF, Simple Features (.shp - Shape files)
  • Audio: WAV, FLAC, OPUS
  • Video: MPEG2, Theora, VP8, VP9, AV1, Motion JPG 2000 (MJ2)

Some file formats cannot be converted to open formats, but are nonetheless archived. They are often device-specific, but have a broad user community.

Within the context of Open Science, and for optimal long-term archiving, files should not be compressed and should avoid proprietary or patent-encumbered formats and in favor of open formats based on documented standards. This ensures the access and re-usability of the content. Only unencrypted files should be published and archived. The Open Science Training Handbook

Rationale

Open Access & Open Licences & Open Formats

By "open access" to this literature, we mean its free availability on the public internet, permitting any users to read, download, copy, distribute, print, search, or link to the full texts of these articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited. Budapest Open Access Initiative

The Open Science Training Handbook

Your Work. Your Rights.

Copyright arises automatically as soon as a work is created. For academic work (academic output), this usually means as soon as it is written. No application or registration is necessary.

https://sparcopen.org/our-work/author-rights/

Without a license, all rights are with the author [...], and that means nobody else can use, copy, distribute, or modify the work without consent. A license gives this consent. If you do not have a license for your [work], it is effectively unusable by the whole research community. https://the-turing-way.netlify.app/reproducible-research/licensing.html

A common misconception is that simply putting data/code online makes them Open data/code, free to be used, modified, and shared for any purpose. In fact, unless the data/code are accompanied by a license that grants permission for others to use, modify, and/or distribute, then the author(s) retain exclusive copyright. A license needs always to accompany the data/code to make them Open (and prevent research misconduct, such as plagiarism - the appropriation of another person's ideas, data, code, results, or words without giving appropriate credit). The Open Science Training Handbook

Transfer of Copyright Agreement

Copyright implications when signing a transfer document

Copyright can be transferred to a third party (a private individual or a legal person like a publishing company) by the creator in a written contract or an assignment

In traditional academic journals, with a 'Transfer of Copyright Agreement' the copyright is transferred from the author to the journal publishers, with minor variations -> the author transfers (all) their rights of the work to the publisher, including those pertaining to electronic forms and transmissions, and can no longer exercise control over how the publisher uses those rights.

Elsevier copyright policy

Transfer of Copyright Agreement

Rights usually retained by author(s) after signing a publishing agreement

  • to use their research data freely without any restriction
  • to make further copies of all or part of the work for private use and classroom teaching.
  • to reuse all or part of the work in a compilation of a work or text book of which you are the author.
  • to make copies of the published work for internal distribution within the institution that employs you.
  • to publicly share the preprint on any website or repository at any time.
  • to publicly share the accepted manuscript on non-commercial sites (for articles published open access) / using a CC BY-NC-ND license and usually only after an embargo period (for articles published under the subscription model)

Elsevier copyright policy

Elsevier Policy (Journal of Archaeological Science)

Author Addenda

EC Policy

How to

Considerations Before Licensing

There are a number of things you should consider before you apply a Creative Commons license to your work:

Remember the license may not be revoked • Make sure the material is appropriate for CC licensing • Specify precisely what it is you are licensing • Make sure the material is subject to copyright or similar rights • Clear rights needed to use the material • Indicate rights not covered by the license • Think about how you want the material to be used • Consider any obligations that may affect what type of license you apply • Consider offering a warranty • Specify additional permissions, if desired • Specify attribution information if desired • Indicate any non binding requests

Considerations for licensors

Considerations Before Licensing

  • The licenses and CC0 cannot be revoked. This means once you apply a CC license to your material, anyone who receives it may rely on that license for as long as the material is protected by copyright, even if you later stop distributing it.

  • You must own or control copyright in the work. Only the copyright holder or someone with express permission from the copyright holder can apply a CC license or CC0 to a copyrighted work. If you created a work in the scope of your job, you may not be the holder of the copyright.

--> Understanding your rights!

SPARC - Copyright Resources for Authors

Choose a license

License your Work

  • Stating the chosen license clearly and prominently, preferably in a machine readable format.
    • Licensing your work is as simple as marking it under the specific license you choose. This marking can be as simple as a bit of text stating the license in a copyright notice, or as complex as embedding the license information on your website using the HTML code associated with the particular license. We strongly recommend at least including a link to the applicable license.
  • Explain the liberations/limitations of the chosen license, and what barriers or restrictions may apply.
  • Let users know where they can find more information about this license.
  • Explain that the license applies to the data, and not the content that the data represents (an open license on the metadata is not the same as the content itself being open, out of copyright, or able to be used freely).
  • Explain why this license was chosen.

Data Packaging Guide

Checklist (GitHub repo)

✔️ Go to your repository folder (local computer or online repository on GitHub/GitLab/BitBucket)

✔️ Create a new file and name is License.txt or License.md based on your preference of the file format

✔️ Choose a type of license (or multiple license for mixed content) that is suitable for your project (visit choosealicense.com)

✔️ Copy the license content to the newly created file, for example, you can use an Open Source license CC-BY 4.0 for text content and MIT License for software

✔️ Save your file and add details in your README.md file

--> https://github.com/dncgst/openscience_course

The Turing Way

Summary

Summary

  • Definitions of Copyright & Open Licenses
    • MIT (software/code); CC0 (data); CC-BY (text/figures)
  • Open formats to ensure access and re-usability
  • Reasons to licence your work
    • Without a license, all rights are with the author
    • A license needs always to accompany the work to make it Open
  • How to licence your work
    • Understand yoour rights, before applying a license
    • Remember the license may not be revoked
    • Choose an open license
    • Licence your work, as simple as pasting a text (e.g., "This work is licensed under CC BY 4.0")

FAQ*

Why should I use the CC-BY license for my written/creative content?

The CC-BY license is the most permissive license that also retains some rights for the creators — the only requirement is that someone who uses, modifies, or distributes the content attributes the original creator. Other attributes of Creative Commons licenses include No Derivatives (ND), Non Commercial (NC), and Share Alike (SA), which add additional restrictions that may limit the potential use and impact of your work. Preventing derivatives with ND strongly limits the impact and use of your work, since no one else will be able to build on what you have done. Similarly, while many researchers may prefer the NC limitation to prevent companies from commercializing or making money on their work, strictly defining commercial use is challenging. Furthermore, the intent of much publicly funded research is to lead to economic development through (ventual) commercial use, which would be prevented by this license. Using an SA license allows reuse and distribution, but requires downstream works to apply the same license, limiting use and combination with other works.

The Share Alike concept is inappropriate in the scientific context because it can impose limits on the use and reuse of others’ work, which in the scientific context, should be avoided whenever possible. Stodden 2009

* The Open Science Training Handbook

Food for thought

Will Creative Commons licences encourage plagiarism? Plagiarism may affect all forms of publishing rather than specifically CC-licensed works. All CC licences requirethat the original author is attributed, meaning that any plagiaristic use is unacceptable; failure to attribute constitutes copyright infringement. It is important to stress that plagiarism is primarily an issue of academic ethics rather than the law. Guide to Creative Commons for Scholarly Publications and Educational Resources

References & further resources

Reading list

  • Guide to Creative Commons for Scholarly Publications and Educational Resources (Version final). Pascal Braak, Hans de Jonge, Giulia Trentacosti, Irene Verhagen, & Saskia Woutersen-Windhouwer. (2020, October 28). http://doi.org/10.5281/zenodo.4090923

References

Further resources