Publication Instructions
Publishing data and software is the act of making data available for re-use by others in citable form. This provides a means to gain visibility and credit for one’s scientific endeavours and offers increased transparency and reproducibility of the scientific process. A data publication represents a stand-alone and ideally self-describing, research product. Due to the combination of data and metadata and supported by the use of persistent identifiers (PIDs) and controlled disciplinary vocabularies, data publications can be regarded as a best practice for sharing data following the FAIR Principles for research data management. Data and software publications with Digital Object Identifiers (DOI) are fully citable in research articles and should be included in the reference lists.
The data repository of GFZ Data Services makes data citable by assigning DOIs to data and scientific software. To foster reuse, emphasis is placed on cross referencing scholarly publications describing the full provenance of a dataset (methods, parameters and how the data were created) with the published datasets. GFZ Data Services increases the interoperability of long-tail data through:
- the provision of comprehensive domain-specific data description via standardised and machine-readable metadata with controlled domain vocabularies via an online Metadata Editor (see the ELMO Guide for instructions)
- complementing the metadata with comprehensive and standardised technical data descriptions or reports, and
- by embedding the research data in wider context by providing cross-references through Persistent Identifiers (DOI, IGSN, ORCID, Fundref) to related research products (text, data, software) and people or institutions involved.
- For guidance on file formats, please refer to the Data File Instructions. The Quick Start Guide summarises the publication instructions
Timing of data publication: For most supplementary data to research articles, the publication of the data will be timely correlated with the publication of the article.
Review Links: Review links show a preview of the future DOI Landing Page with the correct citation (the DOI is already reserved and will not change) and full access to the data and associated documents. The review links were developed for two purposes: (1) proofreading by the author and (2) providing data access for referees during the review process of articles (see below). Please keep in mind: our review links are static and every change is requiring the generation of a new review link.
Restricted data access during the review period of articles: Recently, publishers (e.g. AGU) require access to data for referees during the review process of manuscripts. To enable this restricted access to the still unpublished data, we can create a review links.
Metadata Editor
To be self-describing and re-usable, every data publication should contain rich metadata. Metadata refers to “data about the data” and comprises all information necessary (for humans and machines) to find, read, and interpret the data.
A key tool for metadata generation is the online GFZ Metadata Editor which assists scientists to create metadata compliant with metadata schemas popular in the Earth sciences (DataCite, ISO19115). Users can fill in their metadata and make a copy to their local hard disk to continue work at a later time. Scientists are not requested to provide information that may be generated automatically. To improve usability, form fields names are translated from the metadata schema to the language familiar to scientists. The GFZ Metadata Editor also provides search functionality of structured vocabulary lists. In addition, multiple geospatial references can be entered via an interactive mapping tool, which helps to minimize problems with different conventions to provide latitudes and longitudes.
In November 2025, GFZ Data Services launched ELMO – a completely new version of our traditional metadata editor. The foundations for the ELMO metadata editor were laid in 2024 by students at the University of Applied Sciences Potsdam in collaboration with GFZ Data Services. The IT development team of GFZ Data Services then continued to optimize ELMO.
ELMO is not only a new web interface, but also contains many revised functionalities that increase the quality of metadata and the FAIRness of the data describing it, while at the same time simplifying the entry of information for researchers. For example, authors' names and institutions can be automatically entered into the form by entering the ORCID (provided that the relevant information is set to ‘public’ in the ORCID profiles (read more about how to manage your ORCID profile in Lorenz et al. 2023). In addition, affiliations are selected from a drop-down menu that is linked to the Registry of research institutions (ROR), thus enabling unique identifiers for institutes and research funders in machine-readable form.
- ELMO is available in dark and light mode and - thanks to its modular structure - is highly flexible and expandable.
- ELMO draws on controlled vocabularies from established sources such as GCMD Science Keywords, GCMD Platforms and GCMD Instruments (GCMD = Global Change Master Directory developed by NASA). Regular automatic updates ensure that these vocabularies are always up to date, guaranteeing the long-term quality and consistency of the metadata.
- The application supports multilingualism through a flexible language file system – new languages can be easily added and are automatically recognised by ELMO.
- Practical features such as drag-and-drop sorting of authors, marking contact persons via toggle switch, and an autosave function make working with the editor intuitive and efficient.
- Like its predecessor, ELMO exports metadata in valid XML files in accordance with DataCite and ISO 19115 standards – a decisive advantage for interoperability in international research contexts.
In addition to the generic ELMO, we provide a second instance of that was specifically developed to describe data derived from laboratories. It includes detailed information (controlled vocabularies) that were jointly developed within the network Multi-scale laboratories of the European Plate Observing System (EPOS ERIC).
To access any of these instances or read the Metadata Editor Guidance, please follow the links below:
- ELMO - GFZ Metadata Editor 2.0
- EPOS-MSL - GFZ Metadata Editor 2.0 for the EPOS Multi-scale laboratories
Guidance
Data Description Templates
Even with a very rich metadata schema, published data are often not sufficiently described for being reusable. Additional technical descriptions, like the definition of column headers of tabular data, the description of the experiment setup, documentation of software tools used to manipulate the data or an overview of a complex file structure or specific data format, are necessary to fully understand the data. We have further observed that many researchers are unaware what a data publication represents and especially what to include in a data description. This observation, together with the aim to further standardize MSL data, led to the development of Data Description Templates. The templates are provided in commented and "usable" versions. Data descriptions with the templates are fully harmonized, curated by the repository staff and provided as a PDF data description in the data download folder.
Data Submission Workflow
| 1. Author: | Submits metadata using the GFZ Metadata Editor. Data can be sent via Email or ftp upload link (provided by GFZ Data Services) |
|---|---|
| 2. Curator: | Controls submitted metadata and data description for completeness and consistency |
| 3. Curator: | (If required) converts and allocates data on GFZ Data Services FTP server |
| 4. Curator: | Sends review link for proofreading by author |
| 5. Author: | Proofreads and confirms edited data |
| 6. Curator: | Publishes dataset and registers the DOI |