The Polar Environment Data Science Center (PEDSC) was launched in 2017 as one of the centers of the Joint Support-Center for Data Science Research (DS) of the Research Organization of Information and Systems (ROIS). In the following sections, brief histories of the ROIS, DS, and PEDSC are introduced in the sections 2 and 3, the data treated by the PEDSC and previous data activities and current status are described in the sections 4 and 5, the tasks and activity plan of the PEDSC are described in section 6, and the activities of the PEDSC during 2017-2019 are described in section 7.
The ROIS was established in 2004 as an Inter-university Research Institute Corporation, consisting of four national institutes: the National Institute of Polar Research (NIPR), the National Institute of Informatics (NII), the Institute of Statistical Mathematics (ISM) and the National Institute of Genetics (NIG). The third term of the ROIS started in 2016. The DS was established in 2016 at the start of the third term as a flagship platform of the ROIS. The main purpose of the DS is to support activities for data sharing, data analysis, and data scientist human resource development in universities and other external communities, responding to the needs of the times for the open science and data centric science. The history and current organization structure of the ROIS can be seen in the Web sites (https://www.rois.ac.jp/en/about/rois.html and https://www.rois.ac.jp/en/about/organization.html, respectively), and the structure of the DS can be seen in the Web site (https://ds.rois.ac.jp/en_aboutus/en_outline/).
The PEDSC is a center of the DS designed to support the data activities associated with polar science. The planning phase for the PEDSC started in 2015, and a preparatory office was made in the NIPR in 2016. After detailed planning for the schedule and structure of the PEDSC in the preparatory office, the PEDSC has been formally launched in 2017. The PEDSC has a close relationship with the scientific observation and research activities of the NIPR. The main purpose of the PEDSC is to strengthen and promote the opening and sharing of the polar science data for further broader communities, closely collaborating with the polar science research community led by the NIPR. Internationally, the PEDSC intends to play a key role of the National Data Center (NDC) for polar science in Japan, corresponding to international bodies for the Antarctic and Arctic sciences. The goals of the PEDSC are to play a central role of the data activities in the polar science research in Japan, to create a new data centric polar science, and to contribute to the advances of the global environmental studies. The current staff of the PEDSC consists of five research members, three assistants, and six concurrent advisory members from the NIPR. Details of the PEDSC can be seen in its Web site (http://pedsc.rois.ac.jp/en/).
Data treated by the PEDSC are obtained by scientific research observations and activities in the Antarctic and Arctic regions. Various observations and research activities have been carried out in both polar regions by research scientists collaborating with the NIPR in various research fields, including space and upper atmospheric sciences, meteorology, marine science, glaciology, solid earth science, and bioscience. The data are obtained in various forms, e.g., digital data in various recording media, or samples stored in various containers, etc. Those data are processed and analyzed afterward and converted to the secondary physically meaningful data, and then used for obtaining scientific results. In order to obtain accurate scientific results, reliability of the data should be ensured. For that purpose, it is required that the data should be securely stored, not be lost, not be degraded, not be falsified, and be reusable for everyone to obtain the same results. On the other hand, it is also required for doing multidisciplinary or cross-disciplinary studies, such as the study of global environmental change, to use data in various research fields together to invent new findings. In such a case, information on the location and attribute of those various data (metadata) should be treated in a unified manner. And also, it could be possible that some data in some research field are applied to the other unrelated research field to get unpredictable new findings and new values. For such a case, public property and accessibility of the data are important. The task of the PEDSC is to support such activities for processing, analyzing, archiving, sharing, and opening polar science data to promote further collaboration with external communities, as shown in Figure 1.
The Japanese Antarctic Research Expedition (JARE) was started in 1956, and Japanese Antarctic station Syowa was established in 1957. In the Arctic region, the NIPR Arctic research station was established in 1991 at Ny-Ålesund on Spitsbergen in the Svalbard Archipelago. In the history of the scientific activities in both polar regions over the past 60 years, databases have been created in a variety of research fields. Those data activities were done by data managers in each research field, and depend on the human, hardware, and software resources unique to each team, resulting in various current status of the data processing, database creation, and opening data to the public for each observation dataset. It is also a current problem that an integrated database covering all the research fields of polar science has not yet been constructed, and it is difficult to do a cross-disciplinary search, use, and access. On the other hand, some database systems to treat various data in various research fields have been developed, such as the ‘NIPR Science Database (NIPR-SDB)’ (https://scidbase.nipr.ac.jp/), ‘Arctic Data archive System (ADS)’ (https://ads.nipr.ac.jp/), and ‘Inter-university Upper atmosphere Global Observation NETwork system (IUGONET)’ (http://www.iugonet.org/).
The NIPR-SDB has been constructed and operated since 2007 as a metadata database covering almost all the data in both polar regions. The NIPR-SDB has a close relationship with international data activities, especially with the Standing Committee on Antarctic Data management (SCADM) of the Scientific Committee on Antarctic Research (SCAR) and the Global Change Master Directory (GCMD) of the National Aeronautics and Space Administration (NASA) (Kanao, et al., 2013; 2014; 2018). The NIPR-SDB is some kind of a complete data catalog. The ADS has been constructed and operated since 2012 mainly for the specific Arctic projects in Japan (GRENE (Green Network of Excellence) (https://www.nipr.ac.jp/grene/e/index.html) and ArCS (Arctic Challenge for Sustainability) (https://www.arcs-pro.jp/en/index.html)). The ADS is not only a metadata database system but also has a function for the online visualization and analysis of actual data. The ADS has a close relationship with international Arctic research community such as the International Arctic Science Committee (IASC). The IUGONET has been developed and operated as a Japanese inter-university research project among four Japanese universities and the NIPR (and 6 collaborative organizations), as shown in http://www.iugonet.org/about/organization.jsp?lang=en, since 2009 to build the metadata database for ground-based observations of the Earth’s upper atmosphere. Currently, ROIS-DS-PEDSC plays a central role in the IUGONET project in the sense that one research member of the PEDSC is the leader of the IUGONET system development team and the PEDSC is doing a financial support to employ the chief engineering research staff in the development team. The PEDSC is also responsible for supplying the related polar science data of the NIPR to the IUGONET system. The IUGONET also develops the software to analyze the observation data provided by various universities/institutes, closely collaborating with the Space Physics Environment Data Analysis System (SPEDAS) community in the United States (Hayashi, et al., 2013; Tanaka, et al., 2013; Yatagai, et al., 2014; Angelopoulos, et al., 2019). Data treated by the IUGONET are obtained not only in the polar region but also in the low and middle latitudes. As described above, each purpose and background of each existing database system is different from each other, and an integrated database system to search, visualize, and analyze all the data in polar region in all the research fields has not yet been constructed.
As for the publication of the polar science data, in 1968 the NIPR launched the ‘JARE Data Reports’ (http://polaris.nipr.ac.jp/~library/publication/pub/pube.html#anchor200820) for the data obtained in the JARE activities, and the ‘NIPR Arctic Data Reports’ (https://nipr.repo.nii.ac.jp/?action=repository_opensearch&index_id=784) in 1996 for the data obtained in the Arctic region. In January, 2017, a new data journal ‘Polar Data Journal’ (PDJ) (https://pdr.repo.nii.ac.jp/) was launched by the NIPR, collaborating with the Japanese Institutional Repositories Online Cloud (JAIRO Cloud) system of the NII. The PDJ is a free-access, peer-reviewed online journal, and publishes not only the data paper but also the related data itself with assigning its own DOI (Digital Object Identifier). The NIPR Arctic Data Reports were merged into the PDJ in 2017, and the JARE Data Reports will also be merged into the PDJ in the future.
Based on the above-mentioned current status and scientific needs, the PEDSC defines the following 7 tasks to undertake between 2017 and 2021.
Figure 2 shows the relationship among the NIPR, PEDSC, and external community associated with the above seven tasks. The PEDSC plays a role of a bridge between the NIPR and the external community to support the data activity in polar science. Figure 3 shows the annual schedule of the PEDSC activity during the five years associated with the seven tasks, which was planned at the start of the PEDSC in 2017. It should be noted that the research staff in the PEDSC consists of the persons who are in charge of the NIPR-SDB, ADS, and IUGONET systems, and the editorial board members of the PDJ.
|2017–01||Building “digital archive” system for utilizing historical pictures about Antarctic and Arctic regions|
|2017–02||Data publishing framework for multi-station airglow imaging with IUGONET system|
|2017–03||Development of Antarctic rock samples repository|
|2018–01||Database of optical flow using data from Longyearbyen and Iceland|
|2018–02||Development of Antarctic rock samples repository (cont.)|
|2018–03||Development of database and data analysis system of ground-based atmospheric electric field observation data for global electrical circuit study|
|2018–04||Development of a real time database system for Cosmic Ray data observed at Syowa Station|
|2018–05||Construction of archiving/information disclosure system for infrasound observation in Antarctica|
|2018–06||Development of integrated database using various kinds of optical data obtained in South pole and McMurdo stations|
|2018–07||Redundant data server system for the service of the real-time information of geomagnetic field data and indices|
|2018–08||“Science and Data” -- In search of future directions of Open Science and Data-driven Science (meeting)|
|2019–01||Development of a real time database system for Cosmic Ray data observed at Syowa Station (cont.)|
|2019–02||Sharing and Visualization of 1-100Hz ELF Waveform Data Obtained at Syowa Station|
|2019–03||Method for detection of the snowy cloud and estimation of snow falling amount based on NOAA/AVHRR images|
|2019–04||Development of Antarctic rock samples repository (cont.)|
|2019–05||Construction of archiving/information disclosure system for infrasound observation in Antarctica (cont.)|
|2019–06||Application of machine learning to hypocenter determination for Seismic observation network in Showa base|
|2019–07||Development of a signal processing method for improving quality of wind and turbulence products obtained by atmospheric radars|
|2019–08||Research for participatory consortium for accelerating data-driven science|
|2019–09||Problems on mutual use and integrated analysis of solid earth science data (meeting)|
|International Workshop on Sharing, Citation and Publication of Scientific Data across Disciplines
|5–7 Dec.2017||Tachikawa, Tokyo||73||8||43|
|International Workshop on Data Science– Present & Future of Open Data & Open Science –https://ds.rois.ac.jp/article/dsws_2018/||12–15 Nov.2018||Mishima, Shizuoka||120||10||58|
The authors have no competing interests to declare.
Akira Kadokura, Masaki Kanao, Hironori Yabuki, Yoshimasa Tanaka, and Koji Nishimura are all members of the Polar Environment Data Science Center (PEDSC), and are involved with all of the activities of the PEDSC described in this manuscript. A. K. wrote this manuscript. M. K. is primarily responsible for the description on the NIPR Science Database; H. Y. is primarily responsible for the description on the ADS system; Y. T. is primarily responsible for the description on the IUGONET system and integrated database; and K. N. is primarily responsible for the description of the PANSY radar data.
Angelopoulos, V, Cruce, P, Drozdov, A, et al. 2019. The Space Physics Environment Data Analysis System (SPEDAS). Space Sci Rev, 215: 9. DOI: https://doi.org/10.1007/s11214-018-0576-4
Hayashi, H, Koyama, Y, Hori, T, Tanaka, Y, Abe, S, Shinbori, A, Kagitani, M, Kouno, T, Yoshida, D, UeNo, S, Kaneda, N, Yoneda, M, Umemura, N, Tadokoro, H and Motoba, T. 2013. IUGONET project team, Inter-university Upper Atmosphere Global Observation NETwork (IUGONET). Data Sci. J., 12: WDS179–WDS184. DOI: https://doi.org/10.2481/dsj.WDS-030
Kanao, M, Kadokura, A, Okada, M, Yamnouchi, T, Shiraishi, K, Sato, N and Parsons, MA. 2013. The State of IPY Data Management: The Japanese Contribution and Legacy. CODATA Data Science Journal, 12: WDS124–WDS128. DOI: https://doi.org/10.2481/dsj.WDS-021
Kanao, M, Okada, M, Friddell, J and Kadokura, A. 2018. Science Metadata Management, Interoperability and Data Citations of the National Institute of Polar Research, Japan. CODATA Data Science Journal, 17(1): 1–6. DOI: https://doi.org/10.5334/dsj-2018-001
Kanao, M, Okada, M and Kadokura, A. 2014. Metadata Management at the Polar Data Center of the National Institute of Polar Research, Japan. CODATA Data Science Journal, 13: PDA27–PDA31. DOI: https://doi.org/10.2481/dsj.IFPDA-05
Sato, K, Tsutsumi, M, Sato, T, Nakamura, T, Saito, A, Tomikawa, Y, Nishimura, K, Kohma, M, Yamagishi, H and Yamanouchi, T. 2014. Program of the Antarctic Syowa MST/IS Radar (PANSY). J. Atmos. Solar-Terr. Phys., 118A: 2–15. DOI: https://doi.org/10.1016/j.jastp.2013.08.022
Tanaka, Y, Shinbori, A, Hori, T, Koyama, Y, Abe, S, Umemura, N, Sato, Y, Yagi, M, Ueno, S, Yatagai, A, Ogawa, Y and Miyoshi, Y. 2013. Analysis software for upper atmospheric data developed by the IUGONET project and its application to polar science. Adv. Polar Sci., 24: 231–240. DOI: https://doi.org/10.3724/SP.J.1085.2013.00231
Yatagai, A, Tanaka, Y, Abe, S, Shinbori, A, Yagi, M, Ueno, S, Koyama, Y, Umemura, N, Nosé, M, Hori, T, Sato, Y, Hashiguchi, NO and Kaneda, N. 2014. IUGONET project team, Interuniversity upper atmosphere global observation network (IUGONET) metadata database and analysis software. Data Sci. J., 13. DOI: https://doi.org/10.2481/dsj.IFPDA-07