Editing Model Overview

Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 9: Line 9:
 
[[File:DataLifecycle2.png]]
 
[[File:DataLifecycle2.png]]
 
<div style='margin-left: 320px;'>'''The Research Data Lifecycle'''</div>
 
<div style='margin-left: 320px;'>'''The Research Data Lifecycle'''</div>
 
  
 
==== <span style="color: #BBCE00">Data Acquisition</span> ====
 
==== <span style="color: #BBCE00">Data Acquisition</span> ====
Line 52: Line 51:
 
[[File:DLCStageRelationships.png]]
 
[[File:DLCStageRelationships.png]]
 
<div style='margin-left: 80px;'>'''Illustration of the major integration (reference) points between different phases of the data lifecycle.'''</div>
 
<div style='margin-left: 80px;'>'''Illustration of the major integration (reference) points between different phases of the data lifecycle.'''</div>
 
  
 
The integration points described as follows refer to the components supporting a phase of the data lifecycle. However, the components being integrated can be within the same research infrastructure or in different research infrastructures.  
 
The integration points described as follows refer to the components supporting a phase of the data lifecycle. However, the components being integrated can be within the same research infrastructure or in different research infrastructures.  
Line 67: Line 65:
  
 
Depending on the distribution of resources in an implemented infrastructure, some of these integration points may not be present in the infrastructure. They take particular importance however when considering scenarios where a research infrastructure delegates or outsources functionalities to other infrastructures. For example, EPOS and LifeWatch both delegate data acquisition and some data curation activities to national-level and/or domain-specific infrastructures, but provide data processing services over the data held by those infrastructures. Thus reference points 4 and 5 become of great importance to the construction of those projects.
 
Depending on the distribution of resources in an implemented infrastructure, some of these integration points may not be present in the infrastructure. They take particular importance however when considering scenarios where a research infrastructure delegates or outsources functionalities to other infrastructures. For example, EPOS and LifeWatch both delegate data acquisition and some data curation activities to national-level and/or domain-specific infrastructures, but provide data processing services over the data held by those infrastructures. Thus reference points 4 and 5 become of great importance to the construction of those projects.
 
 
=== <span style="color: #BBCE00">Common Functions within a Common Lifecycle</span> ===
 
 
Analysis of requirements of environmental research infrastructures during the ENVRI and ENVRIplus projects has resulted in the identification of a set of common functionalities. These functionalities can be classified according to the five phases of the data lifecycle. The requirements encompass a range of concerns, from the fundamental (''e.g.'' data collection and storage, data discovery and access and data security) to more specific challenges (''e.g.'', data versioning, instrument monitoring and interactive visualisation).
 
 
In order to better manage the range of requirements, and in order to ensure rapid verification of compliance with the ENVRI-RM, a ''minimal model'' has been identified which describes the fundamental functionality necessary to describe an environmental research infrastructure. The minimal model is a practical tool to produce a partial specification of a research infrastructure which nonetheless reflects the final shape of the complete infrastructure without the need for significant refactoring. Further refinement of the models using the ENVRI-RM allow producing more refined models of designated priority areas, according to the purpose for which the models are created.
 
 
[[File:SpiderDiagram01.png|900px]]
 
<div style='margin-left: 80px;'>'''Radial depiction of ENVRI-RM requirements. The black labels correspond to  the minimal model requirements.'''</div>
 
 
 
The definitions of the minimal set of functions are given as follows (a full list of common functions is provided in [[Appendix A Common Requirements of Environmental Research Infrastructures|Appendix A Common Requirements of Environmental Research Infrastructures]]):
 
 
'''''(A)  Data Acquisition'''''
 
 
'''Process Control''': Functionality that receives input status, applies a set of logic statements or control algorithms, and generates a set of analogue / digital outputs to change the logic states of devices.
 
 
'''Data Collection''': Functionality that obtains digital values from a sensor instrument, associating consistent timestamps and necessary metadata.
 
 
'''Data Transmission''': Functionality that transfers data over a communication channel using specified network protocols.
 
 
'''''(B)  Data Curation'''''
 
 
'''Data Quality Checking''': Functionality that detects and corrects (or removes) corrupt, inconsistent or inaccurate records from datasets.
 
 
'''Data Identification''': Functionality that assigns (global) permanent unique identifiers to data products.
 
 
'''Data Cataloguing''': Functionality that associates a data object with one or more metadata objects which contain data descriptions.
 
 
'''Data Product Generation''': Functionality that processes data against requirement specifications and standardised formats and descriptions.
 
 
'''Data Storage & Preservation''': Functionality that deposits (over the long-term) data and metadata or other supplementary data and methods according to specified policies, and then to make them accessible on request.
 
 
'''''(C)  Data Publishing'''''
 
 
'''Access Control''': Functionality that approves or disapproves of access requests based on specified access policies.
 
 
'''Metadata Harvesting''': Functionality that (regularly) collects metadata in agreed formats from different sources.
 
 
'''Resource Registration''': Functionality that creates an entry in a resource registry and inserts a resource object or a reference to a resource object with specified representation and semantics.
 
 
'''Data Publication''': Functionality that provides clean, well-annotated, anonymity-preserving datasets in a suitable format, and by following specified data-publication and sharing policies to make the datasets publically accessible or to those who agree to certain conditions of use, and to individuals who meet certain professional criteria.
 
 
'''Data Citation''': Functionality that assigns an accurate, consistent and standardised reference to a data object, which can be cited in scientific publications and/or from other data collections.
 
 
'''Semantic Harmonisation''': Functionality that unifies similar data (knowledge) models based on the consensus of collaborative domain experts to achieve better data (knowledge) reuse and semantic interoperability.
 
 
'''Data Discovery and Access''': Functionality that retrieves requested data from a data resource by using suitable search technology.
 
 
'''''(D).  Data Processing'''''
 
 
'''Data Assimilation''': Functionality that combines observational data with output from a numerical model to produce an optimal estimate of the evolving state of the system.
 
 
'''Data Analysis''': Functionality that inspects, cleans, and transforms data, providing data models which highlight useful information, suggest conclusions, and support decision making.
 
 
'''Data Mining''': Functionality that supports the discovery of patterns in large datasets.
 
 
'''Data Extraction''': Functionality that retrieves data out of (unstructured) data sources, including web pages, emails, documents, PDFs, scanned text, mainframe reports, and spool files.
 
 
'''Scientific Modelling and Simulation''': Functionality that supports the generation of abstract, conceptual, graphical or mathematical models, and to run an instances of those models.
 
 
'''Scientific Workflow Enactment''': Functionality provided as a specialisation of Workflow Enactment supporting the composition and execution of computational or data manipulation steps in a scientific application. Important processing results should be recorded for provenance purposes.
 
 
'''Data Processing Control''': Functionality that initiates calculations and manages the outputs to be returned to the client.
 
 
'''''(E)  Data use'''''
 
 
'''Authentication''': Functionality that verifies the credentials of a user.
 
 
'''Authorisation''': Functionality that specifies access rights to resources.
 
  
 
[[Category:ENVRI Reference Model]]
 
[[Category:ENVRI Reference Model]]

Please note that all contributions to may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see Copyrights for details). Do not submit copyrighted work without permission!

Cancel Editing help (opens in new window)