Creation Guidelines for Seismic Network Operators

Introduction

Data from seismic networks needs better recognition, and its use by the seismological community needs to be easier to acknowledge. To support this, GFZ’s GEOFON data centre, in conjunction with the Geophysical Instrument Pool Potsdam (GIPP), helps network operators create Digital Object Identifiers (DOI) for their data sets. Many data sets in other research communities are also gaining DOI. This is in line with the FDSN Recommendation [1] on this subject.

GEOFON can create, or “mint” a DOI on your behalf. National libraries, data centres and other organisations may also be able to do this for you. (The actual DOI name is not chosen by you, but by the DOI publisher, or “minter”; if we do this, it will be of the form 10.14470/xxx98765.) If you choose to use another agency to mint your DOI, we recommend you stay close to the suggestions below. These recommendations are based mainly on the adopted FDSN Recommendation [1], but also seek compatibility with European recommendations like INSPIRE [2]. [6] and [8] FDSN’s mapping service [3] helps authors discover DOIs assigned to seismic networks.

This document helps you understand the information we need to mint your DOI. As PI or operator, the DOI and its data is your summary of your data, so you want to prepare it just as carefully as you would an abstract to summarise a scientific article.

The DOI and its metadata

On the surface, a DOI appears to be just a label for an object, identified by a URL. (The actual DOI name is not chosen by you, but by the DOI publisher, or “minter”; if we do this, it will be of the form 10.14470/xxnnnnnn.) But behind this label is associated metadata describing the object, such as its creators, descriptions and other relevant information, and references to related objects. These related objects may include other publications, such as a data report. (This “bibliographic” metadata should not be confused with the “seismic” metadata related to waveforms, such as the information in dataless SEED headers or Station XML, i.e. “inventory”.) It also includes a URL of a “landing page” on the Internet, which helps users learn more about the object. To prepare for DOI minting, you may need to gather some information to better describe your seismic network.

This metadata is collected in an XML record defined by the DataCite Metadata Working Group [4]. The GEOFON data centre uses this with our existing inventory information (seismic metadata) to augment the network page describing your network.

Metadata fields we will need from you before we are able to create a DOI:

AUTHOR – the person or persons (or organisations) which should be seen as primary creators of the data of the seismic network. There may be more than one of these. For temporary networks we recommend the principal investigators and partners, while for permanent networks the organisations involved may be most suitable. If an author has a global resolvable identifier (e.g. ORCID [orcid]) this should also be incldued, as it is one way to unequivocally identify a person. Please also include affiliation(s).

YEAR – The year in which data were first collected.

TITLE – A short name (5-10 words) of the network, experiment, or deployment.

DESCRIPTION – A short “abstract”, no more than 200-300 words long. Include the number, location, and types of sensors, and the type of data collected. Suggested components: description, aim and scope–geodynamic setting–geographical coverage–instrument types and number–data policy (fully open/restricted/embargoed)–any other peculiarities of the network.

CONTACT – Who should be contacted in case of questions, such as data and metadata quality, or to request access if the data is restricted? The contact will not be published but is meant for internal use. A global identifier (e.g. ORCID [orcid]) would also be useful if the contact is a person.

LICENSE – Provide a rights management statement for the resource or reference a service providing such information. Include embargo information if applicable, and in particular a date at which an embargo period ends. Use the complete title of a license and include version information if applicable. For instance, Creative Commons Attribution 4.0 [5]. (We are still working to understand what licensing models are most appropriate for seismic data.) If known, the SPDX [SPDX] identifier for your license is helpful.

KEYWORDS/SUBJECTS – Judiciously chosen key words make it easier to discover the data set. (We will suggest key words and subjects from standard vocabularies such as the legacy GCMD Keyword Management Service and GEMET INSPIRE Spatial Data Themes Please choose from the GEOFON controlled vocabulary

FUNDER – Institution(s) that provided financial support for the development of the data set, and possibly additional information e.g. funder identifier, grant number, etc.

Where possible we collect e-mail addresses and globally resolvable identifers (such as ORCID [orcid]) as these assist greatly in contacting and uniquely identifying contributors. Be aware that these will be collected by other data catalogues and services and may be spread widely. Ensure that you have consent for publishing these details from all people mentioned. You are welcome to discuss any privacy concerns about these with us.

If you have related publications, or a data report, which the network’s DOI metadata should refer to, let us know their unique identifiers (DOI, Handle, ISBN, URL etc.). There is much room in the metadata model for additional information, and we welcome innovative suggestions about ways to use this.

Recognising Contributors

Many people other than the stated “authors” are involved in making data sets available. There is considerable room within the DataCite model for additional contributors, with identified roles [7] which we encourage DOI producers to use so that all parties involved in making data available on line can be adequately acknowledged. For example, GIPP will appear as a Sponsor for pool deployments. [If contributors are persons (not institutions) and have a global ID (e.g. ORCID [orcid]) this should also be provided for unambiguous identification as in the case of authors.]

_images/doi-geofon-fig1.png

Information flow in the DOI world. Given a DOI name, “10.14470/asdfad” the landing page URL, at http://geofon.gfz-potsdam.de/ in this case, can be found. This landing page presents information about the seismic network which is compiled from two sources of metadata, and displays a recommended citation string.

Conclusion

A DOI adds visibility to your seismic network, and helps you track citations of it e.g. at DataCite. Good quality metadata including a meaningful description will help others discover the data you have provided. For instance, FDSN provides a service [3] to get the proper citation text to reference a seismic network based on the metadata related to its DOI to help authors discover DOIs assigned to seismic networks.

Finally, don’t forget to encourage users of your data to complete the citation loop, by citing it using the DOI you have created!

For further information, contact

Deutsches Geoforschungszentrum Potsdam Telegrafenberg, 14473 Potsdam

[1](1, 2) A. Clark, P. L. Evans, A. Strollo, “FDSN recommendations for seismic network DOIs and related FDSN services”, http://www.fdsn.org/wgIII/V1.0-21Jul2014-DOIFDSN.pdf, doi:10.7914/D11596
[2]INSPIRE ref
[3](1, 2) FDSN, “Network Citations” web page [DOI mapping service], http://www.fdsn.org/networks/citation/
[4]Metadata Working Group, “DataCite Metadata Schema for the Publication and Citation of Research Data”, Version 3.1 October 2014, doi:10.5438/0010
[5]Creative Commons license page, CC-BY-4.0 <https://creativecommons.org/>
[orcid](1, 2, 3, 4) ORCID <https://orcid.org/>
[SPDX]The Software Package Data Exchange (SPDX) License List is at https://spdx.org/licenses/
[6]GFZ Statement on Data Principles, <https://media.gfz-potsdam.de/gfz/wv/doc/16/GFZ_Daten_Grundsaetze+Erg_en.pdf>
[7]The roles available in the Datacite schema v4.3 are: DataCollector, DataCurator, DataManager, Distributor, Editor, HostingInstitution, Producer, ProjectLeader, ProjectManager, ProjectMember, RegistrationAgency, RegistrationAuthority, RelatedPerson, ResearchGroup, RightsHolder, Researcher, Sponsor, Supervisor, WorkPackageLeader, ContactPerson, Other.
[8]Allianz der deutschen Wissenschaftsorganisationen, “Grundsätze zum Umgang mit Forschungsdaten” (German) <http://www.allianzinitiative.de/de/handlungsfelder/forschungsdaten/grundsaetze.html>_

Some questions

Q: What is a DOI anyway? A: See http://www.doi.org

Q: What if I already have a DOI? A: Great, we will display it. To best promote your seismic network, you should use a single DOI for all citations of your data. Otherwise citations of your data might be split between two or more different DOI.

Q: What else can go in DataCite metadata? A: Lots of stuff. See the GEOFON network pages for 5E and GE [doi:10.14470/TR560404], and the “DataCite Metadata Schema Documentation for the Publication and Citation of Research Data”, Version 4.1 at https://schema.datacite.org/ . doi:10.5438/0014

Q: Does GEOFON own my data? A: No, the existence of a DOI says nothing in itself about ownership or usage. The DOI metadata should make clear who created the data/network, and what the role of each participant was.

Q: Do I need a license? A: Choosing a license is important to make sure users are clear on how they may download and use your data, to require attribution, to address liability, and more. See the DCC’s “How to License Research Data”, http://www.dcc.ac.uk/resources/how-guides

Q: What about restricted/embargoed data? A: A DOI’s metadata can (and should) contain a statement concerning licensing and usage rights. GFZ’s data policy [6] allows limited embargo periods in certain cases. Embargoed data can have a DOI. Please advise us when the embargo period is scheduled to end.

Q: Where is my GEOFON network page today? A: If your FDSN network code is “ZZ”, its page is http://geofon.gfz-potsdam.de/waveform/archive/network.php?ncode=ZZ If you operate a temporary network with a reused network code, include the first year of operation: http://geofon.gfz-potsdam.de/waveform/archive/network.php?ncode=ZZ&year=2002

Indices and tables