Editing Example 1: Using the Reference Model to Guide Research Activities (EISCAT 3D - EGI)

Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 25: Line 25:
 
=== <span style="color: #BBCE00">Analysis of the EISCAT 3D Data Infrastructure</span> ===
 
=== <span style="color: #BBCE00">Analysis of the EISCAT 3D Data Infrastructure</span> ===
  
The initial challenge for the pilot project is to understand the EISCAT 3D data infrastructure. The existing design documents of EISCAT 3D has been focused on the incoherent scatter radar technologies. As shown in Figure 1, its data infrastructure is embedded within the overall design of the observatory system that is difficult for a computer scientist/technologist having little physics knowledge background to understand.
+
The initial challenge for the pilot project is to understand the EISCAT 3D data infrastructure. The existing design documents of EISCAT 3D has been focused on the incoherent scatter radar technologies. As shown in Figure 1, its data infrastructure is embedded within the overall design of the observatory system that is difficult for a computer scientist/technologist having little physics knowledge background to understand.  
 
 
[[File:EISCAT_3D_infrastructure.jpg]]
 
 
 
'''Figure 1''': The original design of EISCAT 3D data infrastructure is embedded within the overall observatory system design
 
 
 
 
 
We use the [[File:lightbulb_on.png|16px]] [[Model Overview|'''ENVRI_Common_Subsystem''']] framework to decompose the computational elements, clarifying the boundary between the radar network and data infrastructure, which results in Figure 2. This diagram now, instead of Figure 1, is frequently used in presentations and discussions of the EISCAT 3D data infrastructure.
 
 
 
[[File:EISCAT_3D_Infrastructure_interprated_by_Reference_Model.png]]
 
 
 
'''Figure 2''': Using the 5 ENVRI Common Subsystem to interpret the EISCAT 3D data infrastructure makes it easy to communicate with computer scientists/technologists
 
 
 
Figure 2 illustrates that the EISCAT 3D functional components can be placed into 2 ENVRI common subsystems, [[File:lightbulb_on.png|16px]] [[Model Overview|'''subsys_acq''']] and [[File:lightbulb_on.png|16px]] [[Model Overview|'''subsys_cur''']]. Briefly, at the [[File:lightbulb.png|16px]] [[Model Overview|'''subsys_acq''']], the raw signal voltage data will be generated by the antenna Receivers at the speed of 125 TB/hr, and be temporarily stored in a ''Ring buffer''. A second stream of RF signal voltages will be passed to a ''Beam-former'' to generate the beam-formed data (1MHz). Continually, the beam-formed data will be processed by a ''Correlator'' to generate correlation analysis data based on standard methods. Then, the correlation data will be delivered to a ''Fitter'' to produce the fitted data (1GB/year). In order to support different user requirements, EISCAT 3D will allow users to access and process the raw voltage data in the ''Ring buffer'' and to generate the specialised products based on self-defined analysis algorithms. Both raw data and their products will be stored in ''Intermediate storage'' (11PB/year), from where they will be delivered to the central site within the curation subsystem.
 
 
 
In [[File:lightbulb.png|16px]] [[Model Overview|'''the curation subsystem''']], ''Long-Term Storage'' will preserve the raw voltage data and their products. A ''High Performance Computer'' will be used for data searching and processing (e.g., beam forming, lag profiling or other correlation, and parameter fitting). Searching facilities will enable user to search over all data products and to identify significant data signatures. A ''Multi-static fitter'' will be installed to process the stored raw voltage data to generate the 3D plasma parameters that will then be stored back in ''Long-Term Storage''. A complete copy of ''Long-Term Storage'' data will be established at mirror sites; related data processing and searching tools will be provided.
 
 
 
While it is made clear that the design specification covers 2 of 5 common subsystems described in the ENVRI Reference Model, we understand functionalities of the other 3 subsystems are currently missing. The reason of this is likely due to resource limitations. However, the absent 3 subsystems are crucial for a big data system such as EISCAT 3D. Without providing services to support data discovery, access, processing and user community, the value of EISCAT 3D big data cannot be unlocked, and expensively generated and archived scientific data will be useless. 
 
 
 
Using the Reference Model as the analysis tool, we identified the missing pieces of the design specification, which gives the direction for future investigation.
 
 
 
=== <span style="color: #BBCE00">Analysis of EGI Enabling Services and Construction of an Integrated Infrastructure</span> ===
 
  
 
We need to understand the functionalities of EGI services and how to integrate them to support the EISCAT 3D requirements.
 
We need to understand the functionalities of EGI services and how to integrate them to support the EISCAT 3D requirements.

Please note that all contributions to may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see Copyrights for details). Do not submit copyrighted work without permission!

Cancel Editing help (opens in new window)