Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

CosmoHub and SciPIC: Massive cosmological data analysis, distribution and generation using a Big Data platform

J. CarreteroInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainPau TalladaCentro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT) Madrid, SpainJordi CasalsCentro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT) Madrid, SpainMarc CaubetCentro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT) Madrid, SpainF. J. CastanderInstitute of Space Sciences, IEEC-CSIC, Campus UAB, Carrer de Can Magrans, s/n 08193 Barcelona, SpainL. BlotInstitute of Space Sciences, IEEC-CSIC, Campus UAB, Carrer de Can Magrans, s/n 08193 Barcelona, SpainA. AlarconInstitute of Space Sciences, IEEC-CSIC, Campus UAB, Carrer de Can Magrans, s/n 08193 Barcelona, SpainS. SerranoInstitute of Space Sciences, IEEC-CSIC, Campus UAB, Carrer de Can Magrans, s/n 08193 Barcelona, SpainP. FosalbaInstitute of Space Sciences, IEEC-CSIC, Campus UAB, Carrer de Can Magrans, s/n 08193 Barcelona, SpainCarles Acosta‐SilvaInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainN. TonelloInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainF. TorradeflotInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainMartin EriksenInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainChristian NeissnerInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) SpainM. DelfinoInstitut de Física d'Altes Energies (IFAE), The Barcelona Institute of Science and Technology Campus UAB, 08193 Bellaterra (Barcelona) Spain
2017en
ABI

Аннотация

Galaxy surveys require support from massive datasets in order to achieve precise estimations of cosmological parameters. The CosmoHub platform (https://cosmohub.pic.es), a web portal to perform interactive analysis of massive cosmological data, and the SciPIC pipeline have been developed at the Port d'Informaci\'o Científica (PIC) to provide this support, achieving nearly interactive performance in the processing of multi-terabyte datasets. Cosmology projects currently supported include European Space Agency Euclid space mission, the Dark Energy Survey (DES), the Physics of the Accelerating Universe (PAU) survey and the Marenostrum Institut de Ciències de l'Espai Simulations (MICE). Support for additional projects can be added as needed. CosmoHub enables users to interactively explore and distribute data without any SQL knowledge. It is built on top of Apache Hive, part of the Apache Hadoop ecosystem, which facilitates reading, writing, and managing large datasets. More than 50 billion objects, from public and private data, as well as observed and simulated data, are available. Over 500 registered scientists have produced about 2000 custom catalogs occupying 10TiB in compressed format over the last three years. All those datasets can be interactively explored using an integrated visualization tool. The current implementation allows an interactive analysis of 1.1 billion object datasets to complete in 45 seconds. The SciPIC scientific pipeline has been developed to efficiently generate mock galaxy catalogs using as input a dark matter halo population. It runs on top of the Hadoop platform using Apache Spark, which is an open-source cluster-computing framework. The pipeline is currently being calibrated to populate the full sky Flagship dark matter halo catalog produced by the University of Zürich, which contains about 44 billion dark matter haloes in a box size of 3.78 Gpc/h. The resulting mock galaxy catalog is directly stored in the CosmoHub platform.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 2Использованных источников: 0