Astroinformatics

Last updated
Hyperion proto-supercluster unveiled by measurements and examination of archive data The Hyperion Proto-Supercluster.jpg
Hyperion proto-supercluster unveiled by measurements and examination of archive data

Astroinformatics is an interdisciplinary field of study involving the combination of astronomy, data science, machine learning, informatics, and information/communications technologies. [2] [3] The field is closely related to astrostatistics.

Contents

Background

Astroinformatics is primarily focused on developing the tools, methods, and applications of computational science, data science, machine learning, and statistics for research and education in data-oriented astronomy. [2] Early efforts in this direction included data discovery, metadata standards development, data modeling, astronomical data dictionary development, data access, information retrieval, [4] data integration, and data mining [5] in the astronomical Virtual Observatory initiatives. [6] [7] [8] Further development of the field, along with astronomy community endorsement, was presented to the National Research Council (United States) in 2009 in the astroinformatics "state of the profession" position paper for the 2010 Astronomy and Astrophysics Decadal Survey. [9] That position paper provided the basis for the subsequent more detailed exposition of the field in the Informatics Journal paper Astroinformatics: Data-Oriented Astronomy Research and Education. [2]

Astroinformatics as a distinct field of research was inspired by work in the fields of Geoinformatics, Cheminformatics, Bioinformatics, and through the eScience work [10] of Jim Gray (computer scientist) at Microsoft Research, whose legacy was remembered and continued through the Jim Gray eScience Awards. [11]

Although the primary focus of astroinformatics is on the large worldwide distributed collection of digital astronomical databases, image archives, and research tools, the field recognizes the importance of legacy data sets as well—using modern technologies to preserve and analyze historical astronomical observations. Some Astroinformatics practitioners help to digitize historical and recent astronomical observations and images in a large database for efficient retrieval through web-based interfaces. [3] [12] Another aim is to help develop new methods and software for astronomers, as well as to help facilitate the process and analysis of the rapidly growing amount of data in the field of astronomy. [13]

Astroinformatics is described as the "fourth paradigm" of astronomical research. [14] There are many research areas involved with astroinformatics, such as data mining, machine learning, statistics, visualization, scientific data management, and semantic science. [7] Data mining and machine learning play significant roles in astroinformatics as a scientific research discipline due to their focus on "knowledge discovery from data" (KDD) and "learning from data". [15] [16]

The amount of data collected from astronomical sky surveys has grown from gigabytes to terabytes throughout the past decade and is predicted to grow in the next decade into hundreds of petabytes with the Large Synoptic Survey Telescope and into the exabytes with the Square Kilometre Array. [17] This plethora of new data both enables and challenges effective astronomical research. Therefore, new approaches are required. In part due to this, data-driven science is becoming a recognized academic discipline. Consequently, astronomy (and other scientific disciplines) are developing information-intensive and data-intensive sub-disciplines to an extent that these sub-disciplines are now becoming (or have already become) standalone research disciplines and full-fledged academic programs. While many institutes of education do not boast an astroinformatics program, such programs most likely will be developed in the near future.

Informatics has been recently defined as "the use of digital data, information, and related services for research and knowledge generation". However the usual, or commonly used definition is "informatics is the discipline of organizing, accessing, integrating, and mining data from multiple sources for discovery and decision support." Therefore, the discipline of astroinformatics includes many naturally-related specialties including data modeling, data organization, etc. It may also include transformation and normalization methods for data integration and information visualization, as well as knowledge extraction, indexing techniques, information retrieval and data mining methods. Classification schemes (e.g., taxonomies, ontologies, folksonomies, and/or collaborative tagging [18] ) plus Astrostatistics will also be heavily involved. Citizen science projects (such as Galaxy Zoo) also contribute highly valued novelty discovery, feature meta-tagging, and object characterization within large astronomy data sets. All of these specialties enable scientific discovery across varied massive data collections, collaborative research, and data re-use, in both research and learning environments.

In 2012, two position papers [19] [20] were presented to the Council of the American Astronomical Society that led to the establishment of formal working groups in astroinformatics and Astrostatistics for the profession of astronomy within the US and elsewhere. [21]

Astroinformatics provides a natural context for the integration of education and research. [22] The experience of research can now be implemented within the classroom to establish and grow data literacy through the easy re-use of data. [23] It also has many other uses, such as repurposing archival data for new projects, literature-data links, intelligent retrieval of information, and many others. [24]

Conferences

YearPlaceLink
2021 Caltech
2020 Harvard
2019 Caltech
2018 Heidelberg, Germany
2017 Cape Town, South Africa
2016 Sorrento, Italy
2015 Dubrovnik, Dalmatia
2014 University of Chile
2013 Australia Telescope National Facility, CSIRO
2012 Microsoft Research Archived 2018-10-22 at the Wayback Machine
2011 Sorrento, Italy
2010 Caltech Archived 2018-10-22 at the Wayback Machine

Additional conferences and conference lists:

ItemLink
Machine Learning in Astronomy: Possibilities and Pitfalls (2022)
The Astrostatistics and Astroinformatics Portal (ASAIP) big list of conferences
Astronomical Data Analysis Software and Systems (ADASS) annual conferences

See also

Related Research Articles

<span class="mw-page-title-main">Astronomy</span> Scientific study of celestial objects

Astronomy is a natural science that studies celestial objects and phenomena. It uses mathematics, physics, and chemistry in order to explain their origin and evolution. Objects of interest include planets, moons, stars, nebulae, galaxies, meteoroids, asteroids, and comets. Relevant phenomena include supernova explosions, gamma ray bursts, quasars, blazars, pulsars, and cosmic microwave background radiation. More generally, astronomy studies everything that originates beyond Earth's atmosphere. Cosmology is a branch of astronomy that studies the universe as a whole.

<span class="mw-page-title-main">Giant Metrewave Radio Telescope</span>

The Giant Metrewave Radio Telescope (GMRT), located near Narayangaon, Pune in India, is an array of thirty fully steerable parabolic radio telescopes of 45 metre diameter, observing at metre wavelengths. It is the largest and most sensitive radio telescope array in the world at low frequencies. It is operated by the National Centre for Radio Astrophysics (NCRA), a part of the Tata Institute of Fundamental Research, Mumbai. It was conceived and built under the direction of Late Prof. Govind Swarup during 1984 to 1996. It is an interferometric array with baselines of up to 25 kilometres (16 mi). It was recently upgraded with new receivers, after which it is also known as the upgraded Giant Metrewave Radio Telescope (uGMRT).

<span class="mw-page-title-main">Institut de radioastronomie millimétrique</span> Observatory

Institut de Radioastronomie Millimetrique (IRAM) is an international research institute and Europe's leading center for radio astronomy at millimeter wavelengths. Its mission is to explore the universe, study its origins and its evolution with two of the most advanced radio facilities in the world:

<span class="mw-page-title-main">Gravitational-wave astronomy</span> Branch of astronomy using gravitational waves

Gravitational-wave astronomy is an emerging field of science, concerning the observations of gravitational waves to collect relatively unique data and make inferences about objects such as neutron stars and black holes, events such as supernovae, and processes including those of the early universe shortly after the Big Bang.

<span class="mw-page-title-main">Alberto Conti</span> Italian-American astrophysicist (born 1966)

Alberto Conti, is an astrophysicist and the Vice President and General Manager of the Civil Space Strategic Business Unit (SBU) at Ball Aerospace. He is one of the creators of the GoogleSky concept, of the idea of astronomical outreach at South by SouthWest 2013 and of the James Webb Space Telescope iBook. He is also the Executive Producer of the Emmy Winning CNN Films The Hunt for Planet B.

<span class="mw-page-title-main">Somak Raychaudhury</span> Indian astrophysicist

Somak Raychaudhury is an Indian astrophysicist. He is the Vice-Chancellor at Ashoka University and was the Director of the Inter-University Centre for Astronomy and Astrophysics (IUCAA), Pune. He is on leave from Presidency University, Kolkata, India, where he is a Professor of Physics, and is also affiliated to the University of Birmingham, United Kingdom. He is known for his work on stellar mass black holes and supermassive black holes. His significant contributions include those in the fields of gravitational lensing, galaxy dynamics and large-scale motions in the Universe, including the Great Attractor.

<span class="mw-page-title-main">Ofer Lahav</span>

Ofer Lahav is Perren Chair of Astronomy at University College London (UCL), Vice-Dean (International) of the UCL Faculty of Mathematical and Physical Sciences (MAPS) and Co-Director of the STFC Centre for Doctoral Training in Data Intensive Science. His research area is Observational Cosmology, in particular probing Dark Matter and Dark Energy. His work involves Machine Learning for Big Data.

WASP-18 is a magnitude 9 star located 400 light-years away in the Phoenix constellation of the southern hemisphere. It has a mass of 1.29 solar masses.

Stanislav George Djorgovski is an American scientist and scholar. He obtained his B.A. in astrophysics in 1979 at the University of Belgrade. After receiving his PhD in astronomy from U.C. Berkeley in 1985, he was a Harvard Junior Fellow until 1987 when he joined the faculty at the California Institute of Technology, where he is currently a professor of astronomy and data science.

<span class="mw-page-title-main">TOPCAT (software)</span> Graphical viewer of tabular data mainly used in astronomical applications

TOPCAT is an interactive graphical viewer and editor for tabular data. Although a general purpose tool capable of handling large and sparse datasets with correlation functionality its specialist application area is astronomy and it was initially designed to support virtual observatories. It is able to handle several digital file formats including FITS which is in common use in astronomy. The Acronym TOPCAT derives from Tool for OPerations on Catalogues And Tables.

Astrostatistics is a discipline which spans astrophysics, statistical analysis and data mining. It is used to process the vast amount of data produced by automated scanning of the cosmos, to characterize complex datasets, and to link astronomical data to astrophysical theory. Many branches of statistics are involved in astronomical analysis including nonparametrics, multivariate regression and multivariate classification, time series analysis, and especially Bayesian inference. The field is closely related to astroinformatics.

<span class="mw-page-title-main">Time-domain astronomy</span> Study of how astronomical objects change with time

Time-domain astronomy is the study of how astronomical objects change with time. Though the study may be said to begin with Galileo's Letters on Sunspots, the term now refers especially to variable objects beyond the Solar System. Changes over time may be due to movements or changes in the object itself. Common targets included are supernovae, pulsating stars, novas, flare stars, blazars and active galactic nuclei. Visible light time domain studies include OGLE, HAT-South, PanSTARRS, SkyMapper, ASAS, WASP, CRTS, and in a near future the LSST at the Vera C. Rubin Observatory.

<span class="mw-page-title-main">Astropy</span> Python language software

Astropy is a collection of software packages written in the Python programming language and designed for use in astronomy. The software is a single, free, core package for astronomical utilities due to the increasingly widespread usage of Python by astronomers, and to foster interoperability between various extant Python astronomy packages. Astropy is included in several large Python distributions; it is part of package managers for Linux and macOS, the Anaconda Python Distribution, Enthought Canopy and Ureka.

Alice Eve Shapley is a professor at the University of California, Los Angeles (UCLA) in the Department of Physics and Astronomy. She was one of the discoverers of the spiral galaxy BX442. Through her time at University of California, Los Angeles (UCLA) she has taught Nature of the Universe, Black Holes and Cosmic Catastrophes, Cosmology: Our Changing Concepts of the Universe, Galaxies, Scientific Writing, AGNs, Galaxies, *and* Writing, and The Formation and Evolution of Galaxies and the IGM. Shapley has committed herself to over a two decades of research and publication in the interest of physics and astronomy.

<span class="mw-page-title-main">Warrick Couch</span> Australian astronomer

Warrick John Couch is an Australian professional astronomer. He is currently a professor at Swinburne University of Technology in Melbourne. He was previously the Director of Australia's largest optical observatory, the Australian Astronomical Observatory (AAO). He was also the president of the Australian Institute of Physics (2015–2017), and a non-executive director on the Board of the Giant Magellan Telescope Organization. He was a founding non-executive director of Astronomy Australia Limited.

<span class="mw-page-title-main">Alex Szalay</span> Astrophysicist, researcher (born 1949)

Alex Szalay is a Bloomberg Distinguished Professor of physics and astronomy and computer science at the Johns Hopkins University School of Arts and Sciences and Whiting School of Engineering. Szalay is an international leader in astronomy, cosmology, the science of big data, and data‐intensive computing. In 2023, he was elected to the National Academy of Sciences.

<span class="mw-page-title-main">Kim Venn</span>

Kim A. Venn is a professor of physics and astronomy at the University of Victoria, Canada, and director of the university's Astronomy Research Centre. She researches the chemo-dynamical analysis of stars in the galaxy and its nearby dwarf satellites.

<span class="mw-page-title-main">Joss Bland-Hawthorn</span> British-Australian astronomer

Jonathan (Joss) Bland-Hawthorn is a British-Australian astrophysicist. He is a Laureate professor of physics at the University of Sydney, and director of the Sydney Institute for Astronomy.

<span class="mw-page-title-main">Centro de Estudios de Fisica del Cosmos de Aragon</span> Research institute, Spain

The Centro de Estudios de Física del Cosmos de Aragón (CEFCA) is a Research institute in Teruel, Spain. Established in 2008 as a private foundation of public initiative. by the Government of Aragon. Besides research in astronomy, and leading several large astronomical surveys, CEFCA is the operator of the Astrophysical Observatory of Javalambre. The primary research interests at CEFCA are in Stellar evolution, Time-domain astronomy and Galaxy evolution.

References

  1. "Largest Galaxy Proto-Supercluster Found - Astronomers using ESO's Very Large Telescope uncover a cosmic titan lurking in the early Universe". www.eso.org. Retrieved 18 October 2018.
  2. 1 2 3 Borne, Kirk D. (12 May 2010). "Astroinformatics: data-oriented astronomy research and education". Earth Science Informatics. 3 (1–2): 5–17. doi:10.1007/s12145-010-0055-2. S2CID   207393013.
  3. 1 2 Astroinformatics and digitization of astronomical heritage Archived 2017-12-26 at the Wayback Machine , Nikolay Kirov. The fifth SEEDI International Conference Digitization of cultural and scientific heritage, May 19–20, 2010, Sarajevo. Retrieved 1 November 2012.
  4. Borne, Kirk (2000). "Science User Scenarios for a Virtual Observatory Design Reference Mission: Science Requirements for Data Mining". arXiv: astro-ph/0008307 .
  5. Borne, Kirk (2008). "Scientific Data Mining in Astronomy". In Kargupta, Hillol; et al. (eds.). Next generation of data mining. London: CRC Press. pp. 91–114. ISBN   9781420085860.
  6. Borne, Kirk D (2003). "Distributed data mining in the National Virtual Observatory". In Dasarathy, Belur V (ed.). Data Mining and Knowledge Discovery: Theory, Tools, and Technology V. Vol. 5098. pp. 211–218. doi:10.1117/12.487536. S2CID   28195520.
  7. 1 2 Borne, Kirk (2013). "Virtual Observatories, Data Mining, and Astroinformatics". Planets, Stars and Stellar Systems. pp. 403–443. doi:10.1007/978-94-007-5618-2_9. ISBN   978-94-007-5617-5.
  8. Laurino, O.; D’Abrusco, R.; Longo, G.; Riccio, G. (21 December 2011). "Astroinformatics of galaxies and quasars: a new general method for photometric redshifts estimation". Monthly Notices of the Royal Astronomical Society. 418 (4): 2165–2195. arXiv: 1107.3160 . Bibcode:2011MNRAS.418.2165L. doi:10.1111/j.1365-2966.2011.19416.x. S2CID   7115554.
  9. Borne, Kirk (2009). "Astroinformatics: A 21st Century Approach to Astronomy". Astro2010: The Astronomy and Astrophysics Decadal Survey. 2010: P6. arXiv: 0909.3892 . Bibcode:2009astro2010P...6B.
  10. "Online Science". Talks by Jim Gray. Microsoft Research. Retrieved 11 January 2015.
  11. "Jim Gray eScience Award". Microsoft Research.
  12. Astroinformatics in Canada, Nicholas M. Ball, David Schade. Retrieved 1 November 2012.
  13. "'Astroinformatics' helps Astronomers explore the sky". Phys.org. Heidelberg University. Retrieved 11 January 2015.
  14. Hey, Tony (October 2009). "The Fourth Paradigm: Data-Intensive Scientific Discovery". Microsoft Research.
  15. Ball, N.M.; Brunner, R.J. (2010). "Data Mining and Machine Learning in Astronomy". International Journal of Modern Physics D. 19 (7): 1049–1106. arXiv: 0906.2173 . Bibcode:2010IJMPD..19.1049B. doi:10.1142/S0218271810017160. S2CID   119277652.
  16. Borne, K; Becla, J; Davidson, I; Szalay, A; Tyson, J. A; Bailer-Jones, Coryn A.L (2008). "The LSST Data Mining Research Agenda". AIP Conference Proceedings. pp. 347–351. arXiv: 0811.0167 . doi:10.1063/1.3059074. S2CID   118399971.
  17. Ivezić, Ž; Axelrod, T; Becker, A. C; Becla, J; Borne, K; Burke, D. L; Claver, C. F; Cook, K. H; Connolly, A; Gilmore, D. K; Jones, R. L; Jurić, M; Kahn, S. M; Lim, K.-T; Lupton, R. H; Monet, D. G; Pinto, P. A; Sesar, B; Stubbs, C. W; Tyson, J. A; Bailer-Jones, Coryn A.L (2008). "Parametrization and Classification of 20 Billion LSST Objects: Lessons from SDSS". AIP Conference Proceedings. Vol. 1082. pp. 359–365. arXiv: 0810.5155 . doi:10.1063/1.3059076. S2CID   117914490.{{cite book}}: |journal= ignored (help)
  18. Borne, Kirk. "Collaborative Annotation for Scientific Data Discovery and Reuse". Bulletin of the ASIS&T. American Society for Information Science and Technology. Archived from the original on 5 March 2016. Retrieved 11 January 2016.
  19. Borne, Kirk. "Astroinformatics in a Nutshell". asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. Retrieved 11 January 2016.
  20. Feigelson, Eric. "Astrostatistics in a Nutshell". asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. Retrieved 11 January 2016.
  21. Feigelson, E.; Ivezić, Ž.; Hilbe, J.; Borne, K. (2013). "New Organizations to Support Astroinformatics and Astrostatistics". Astronomical Data Analysis Software and Systems Xxii. 475: 15. arXiv: 1301.3069 . Bibcode:2013ASPC..475...15F.
  22. Borne, Kirk (2009). "The Revolution in Astronomy Education: Data Science for the Masses". Astro2010: The Astronomy and Astrophysics Decadal Survey. 2010: P7. arXiv: 0909.3895 . Bibcode:2009astro2010P...7B.
  23. "Using Data in the Classroom". Science Education Resource Center at Carleton College. National Science Digital Library. Retrieved 11 January 2016.
  24. Borne, Kirk. Astroinformatics: Data-Oriented Astronomy (PDF). George Mason University, USA. Retrieved January 21, 2015.