http://mediawiki.envri.eu/index.php?title=Identification_and_citation_requirements&feed=atom&action=historyIdentification and citation requirements - Revision history2024-03-29T04:57:33ZRevision history for this page on the wikiMediaWiki 1.31.0http://mediawiki.envri.eu/index.php?title=Identification_and_citation_requirements&diff=694&oldid=prevENVRIwiki: Created page with "Maggie Hellstrom and Alex Vermeulen with help from go betweens and others The questions that were sent to the RIs are available here: [http://mediawiki.envri.eu/images/f/f5/1..."2020-04-01T19:53:13Z<p>Created page with "Maggie Hellstrom and Alex Vermeulen with help from go betweens and others The questions that were sent to the RIs are available here: [http://mediawiki.envri.eu/images/f/f5/1..."</p>
<p><b>New page</b></p><div>Maggie Hellstrom and Alex Vermeulen with help from go betweens and others<br />
<br />
The questions that were sent to the RIs are available here: [http://mediawiki.envri.eu/images/f/f5/1_-_Identification_and_citation_questions.docx 1 - Identification and citation questions.docx]<br />
<br />
The following RIs have contributed to identifying and describing ENVRIplus identification and citation requirements: [[Identification and citation in ACTRIS|<u>ACTRIS</u>]], [[Identification and Citation for AnaEE|<u>AnaEE</u>]], [[Identification and citation in EISCAT-3D|<u>EISCAT-3D</u>]], [[Identification and citation in EMBRC|<u>EMBRC</u>]], [[Identification and citation in EMSO|<u>EMSO</u>]], [[Identification and citation in EPOS|<u>EPOS</u>]], [[Identification and citation in Euro-ARGO|<u>Euro-ARGO</u>]], [[Identification and Citation in EuroGOOS|<u>EuroGOOS</u>]], [[Identification and citation in IAGOS|IAGOS]], [[Identification and citation in ICOS|ICOS]], [[Identification and citation in IS-ENES2|<u>IS-ENES2</u>]], [[Identification and citation in LTER|<u>LTER</u>]], [[Identification and citation in SEADATANET|<u>SeaDataNet</u>]], and [[Identification and citation in SIOS|<u>SIOS</u>]] (click on individual RI names to see the respective responses).<br />
<br />
== <span style="color: #BBCE00">Introduction</span> ==<br />
<br />
Identification of data (and associated metadata) throughout all stages of processing is really central in any RI. This can be ensured by allocating unique and persistent digital identifiers (PIDs) to data objects throughout the data processing life cycle. The PIDs allow unambiguous references be made to data during curation, cataloguing and support provenance tracking. They are also a necessary requirements for correct citation (and hence attribution) of the data by end users, as this is only possible when persistent identifiers exist and are applied in the attribution.<br />
<br />
Environmental research infrastructures are often built on a large number of distributed observational or experimental sites, run by hundreds of scientists and technicians, financially supported and administrated by a large number of institutions. If this data is shared under an open access policy it becomes therefore very important to acknowledge the data sources and their providers. There is also a strong need for common data citation tracking systems that allow data providers to identify downstream usage of their data so as to prove their importance and show the impact to stakeholders and the public.<br />
<br />
== <span style="color: #BBCE00">Overview and summary of identification and citation requirements</span> ==<br />
<br />
'''Identification'''<br />
<br />
The survey found a large diversity between RIs regarding their practices. Most are applying file-based storage for their data, rather than data base technologies, which suggests that it should be relatively straightforward to assign PIDs to a majority of the RI data objects. A profound gap in knowledge about what persistent and unique identifiers are, what they can be used for, and best practices regarding their use, emerged. Most identifier systems used are based on handles (DOIs from DataCite most common, followed by ePIC PIDs), but some RIs rely on formalized file names. While a majority see a strong need for assigning PIDs to their “finalized” data (individual files and/or databases), few apply this to raw data, and even fewer to intermediate data - indicating PIDs are not used in workflow administration. Also, metadata objects are seldom assigned PIDs. Costs for maintaining PIDs are typically not treated explicitly.<br />
<br />
'''Citation'''<br />
<br />
Currently, users refer to data sets in publications using DOIs if available, and else provide information about producer, year, report number etc. either in the article text or in the References section. A majority of RIs feel it is absolutely necessary to allow unambiguous references to be made to subsets of data sets, preferably in the citation, while few find the ability to create and later cite collections of individual data sets is important. Ensuring that credit for producing (and to a lesser extent curating) scientific data sets is “properly assigned” is a common theme for all RIs - not the least because funding agencies and other stakeholders require such performance indicators, but also because individual PIs want and need recognition of their work. Connected to this, most RIs have strategies for collecting usage statistics for their data products, i.e. through bibliometric searches (quasi-automated or manual) of from scientific literature, but thus often rely on publishers indexing also data object DOIs.<br />
<br />
'''Conclusion'''<br />
<br />
The use of persistent and unique identifiers for both data and metadata objects throughout the entire data life cycle needs to be encouraged, e.g. by providing training and best-use cases. There is strong support for promoting “credit” to data collectors, through standards of data citation supporting adding specific sub-setting information to a basic (DOI-based) reference.<br />
<br />
== <span style="color: #BBCE00">Research Infrastructures</span> ==<br />
<br />
The following RIs have contributed to identifying and describing ENVRIplus identification and citation requirements: [[Identification and citation in ACTRIS|ACTRIS]], [[Identification and Citation for AnaEE|AnaEE]], [[Identification and citation in EISCAT-3D|EISCAT-3D]], [[Identification and citation in EMBRC|EMBRC]], [[Identification and citation in EMSO|EMSO]], [[Identification and citation in EPOS|EPOS]], [[Identification and citation in Euro-ARGO|Euro-ARGO]], [[Identification and Citation in EuroGOOS|EuroGOOS]], [[Identification and citation in IAGOS|IAGOS]], [[Identification and citation in ICOS|ICOS]], [[Identification and citation in IS-ENES2|IS-ENES2]], [[Identification and citation in LTER|LTER]], [[Identification and citation in SEADATANET|SeaDataNet]], and [[Identification and citation in SIOS|SIOS]] (click on individual RI names to see the respective responses).<br />
<br />
[[Category:ENVRI RI Requirements]]</div>ENVRIwiki