Astroinformatics

Last updated
Hyperion proto-supercluster unveiled by measurements and examination of archive data The Hyperion Proto-Supercluster.jpg
Hyperion proto-supercluster unveiled by measurements and examination of archive data

Astroinformatics is an interdisciplinary field of study involving the combination of astronomy, data science, machine learning, informatics, and information/communications technologies. [2] [3] The field is closely related to astrostatistics.

Contents

Data-driven astronomy (DDA) refers to the use of data science in astronomy. Several outputs of telescopic observations and sky surveys are taken into consideration and approaches related to data mining and big data management are used to analyze, filter, and normalize the data set that are further used for making Classifications, Predictions, and Anomaly detections by advanced Statistical approaches, digital image processing and machine learning. The output of these processes is used by astronomers and space scientists to study and identify patterns, anomalies, and movements in outer space and conclude theories and discoveries in the cosmos.

Background

Astroinformatics is primarily focused on developing the tools, methods, and applications of computational science, data science, machine learning, and statistics for research and education in data-oriented astronomy. [2] Early efforts in this direction included data discovery, metadata standards development, data modeling, astronomical data dictionary development, data access, information retrieval, [4] data integration, and data mining [5] in the astronomical Virtual Observatory initiatives. [6] [7] [8] Further development of the field, along with astronomy community endorsement, was presented to the National Research Council (United States) in 2009 in the astroinformatics "state of the profession" position paper for the 2010 Astronomy and Astrophysics Decadal Survey. [9] That position paper provided the basis for the subsequent more detailed exposition of the field in the Informatics Journal paper Astroinformatics: Data-Oriented Astronomy Research and Education. [2]

Astroinformatics as a distinct field of research was inspired by work in the fields of Geoinformatics, Cheminformatics, Bioinformatics, and through the eScience work [10] of Jim Gray (computer scientist) at Microsoft Research, whose legacy was remembered and continued through the Jim Gray eScience Awards. [11]

Although the primary focus of astroinformatics is on the large worldwide distributed collection of digital astronomical databases, image archives, and research tools, the field recognizes the importance of legacy data sets as well—using modern technologies to preserve and analyze historical astronomical observations. Some Astroinformatics practitioners help to digitize historical and recent astronomical observations and images in a large database for efficient retrieval through web-based interfaces. [3] [12] Another aim is to help develop new methods and software for astronomers, as well as to help facilitate the process and analysis of the rapidly growing amount of data in the field of astronomy. [13]

Astroinformatics is described as the "fourth paradigm" of astronomical research. [14] There are many research areas involved with astroinformatics, such as data mining, machine learning, statistics, visualization, scientific data management, and semantic science. [7] Data mining and machine learning play significant roles in astroinformatics as a scientific research discipline due to their focus on "knowledge discovery from data" (KDD) and "learning from data". [15] [16]

The amount of data collected from astronomical sky surveys has grown from gigabytes to terabytes throughout the past decade and is predicted to grow in the next decade into hundreds of petabytes with the Large Synoptic Survey Telescope and into the exabytes with the Square Kilometre Array. [17] This plethora of new data both enables and challenges effective astronomical research. Therefore, new approaches are required. In part due to this, data-driven science is becoming a recognized academic discipline. Consequently, astronomy (and other scientific disciplines) are developing information-intensive and data-intensive sub-disciplines to an extent that these sub-disciplines are now becoming (or have already become) standalone research disciplines and full-fledged academic programs. While many institutes of education do not boast an astroinformatics program, such programs most likely will be developed in the near future.

Informatics has been recently defined as "the use of digital data, information, and related services for research and knowledge generation". However the usual, or commonly used definition is "informatics is the discipline of organizing, accessing, integrating, and mining data from multiple sources for discovery and decision support." Therefore, the discipline of astroinformatics includes many naturally-related specialties including data modeling, data organization, etc. It may also include transformation and normalization methods for data integration and information visualization, as well as knowledge extraction, indexing techniques, information retrieval and data mining methods. Classification schemes (e.g., taxonomies, ontologies, folksonomies, and/or collaborative tagging [18] ) plus Astrostatistics will also be heavily involved. Citizen science projects (such as Galaxy Zoo) also contribute highly valued novelty discovery, feature meta-tagging, and object characterization within large astronomy data sets. All of these specialties enable scientific discovery across varied massive data collections, collaborative research, and data re-use, in both research and learning environments.

In 2007, the Galaxy Zoo project [19] was launched for morphological classification [20] [21] of a large number of galaxies. In this project, 900,000 images were considered for classification that were taken from the Sloan Digital Sky Survey (SDSS) [22] for the past 7 years. The task was to study each picture of a galaxy, classify it as elliptical or spiral, and determine whether it was spinning or not. The team of Astrophysicists led by Kevin Schawinski in Oxford University were in charge of this project and Kevin and his colleague Chris Linlott figured out that it would take a period of 3–5 years for such a team to complete the work. [23] There they came up with the idea of using Machine Learning and Data Science techniques for analyzing the images and classifying them. [24]

In 2012, two position papers [25] [26] were presented to the Council of the American Astronomical Society that led to the establishment of formal working groups in astroinformatics and Astrostatistics for the profession of astronomy within the US and elsewhere. [27]

Astroinformatics provides a natural context for the integration of education and research. [28] The experience of research can now be implemented within the classroom to establish and grow data literacy through the easy re-use of data. [29] It also has many other uses, such as repurposing archival data for new projects, literature-data links, intelligent retrieval of information, and many others. [30]

Methodology

The data retrieved from the sky surveys are first brought for data preprocessing. In this, redundancies are removed and filtrated. Further, feature extraction is performed on this filtered data set, which is further taken for processes. [31] Some of the renowned sky surveys are listed below:

The size of data from the above-mentioned sky surveys ranges from 3  TB to almost 4.6  EB. [31] Further, data mining tasks that are involved in the management and manipulation of the data involve methods like classification, regression, clustering, anomaly detection, and time-series analysis. Several approaches and applications for each of these methods are involved in the task accomplishments.

Classification

Classification [40] is used for specific identifications and categorizations of astronomical data such as Spectral classification, Photometric classification, Morphological classification, and classification of solar activity. The approaches of classification techniques are listed below:

Regression

Regression [41] is used to make predictions based on the retrieved data through statistical trends and statistical modeling. Different uses of this technique are used for fetching Photometric redshifts and measurements of physical parameters of stars. [42] The approaches are listed below:

Clustering

Clustering [43] is classifying objects based on a similarity measure metric. It is used in Astronomy for Classification as well as Special/rare object detection. The approaches are listed below:

Anomaly detection

Anomaly detection [45] is used for detecting irregularities in the dataset. However, this technique is used here to detect rare/special objects. The following approaches are used:

Time-series analysis

Time-Series analysis [46] helps in analyzing trends and predicting outputs over time. It is used for trend prediction and novel detection (detection of unknown data). The approaches used here are:

Conferences

YearPlaceLink
2021 Caltech
2020 Harvard
2019 Caltech
2018 Heidelberg, Germany
2017 Cape Town, South Africa
2016 Sorrento, Italy
2015 Dubrovnik, Dalmatia
2014 University of Chile
2013 Australia Telescope National Facility, CSIRO
2012 Microsoft Research Archived 2018-10-22 at the Wayback Machine
2011 Sorrento, Italy
2010 Caltech Archived 2018-10-22 at the Wayback Machine

Additional conferences and conference lists:

ItemLink
Machine Learning in Astronomy: Possibilities and Pitfalls (2022)
The Astrostatistics and Astroinformatics Portal (ASAIP) big list of conferences
Astronomical Data Analysis Software and Systems (ADASS) annual conferences

See also

Related Research Articles

Infrared astronomy is a sub-discipline of astronomy which specializes in the observation and analysis of astronomical objects using infrared (IR) radiation. The wavelength of infrared light ranges from 0.75 to 300 micrometers, and falls in between visible radiation, which ranges from 380 to 750 nanometers, and submillimeter waves.

<span class="mw-page-title-main">Messier 87</span> Elliptical galaxy in the Virgo Galaxy Cluster

Messier 87 is a supergiant elliptical galaxy in the constellation Virgo that contains several trillion stars. One of the largest and most massive galaxies in the local universe, it has a large population of globular clusters—about 15,000 compared with the 150–200 orbiting the Milky Way—and a jet of energetic plasma that originates at the core and extends at least 1,500 parsecs, traveling at a relativistic speed. It is one of the brightest radio sources in the sky and a popular target for both amateur and professional astronomers.

<span class="mw-page-title-main">Palomar Observatory</span> Astronomical observatory in Southern California

Palomar Observatory is an astronomical research observatory in the Palomar Mountains of San Diego County, California, United States. It is owned and operated by the California Institute of Technology (Caltech). Research time at the observatory is granted to Caltech and its research partners, which include the Jet Propulsion Laboratory (JPL), Yale University, and the National Astronomical Observatories of China.

<span class="mw-page-title-main">2MASS</span> Astronomical survey of the whole sky in the infrared

The Two Micron All-Sky Survey, or 2MASS, was an astronomical survey of the whole sky in infrared light. It took place between 1997 and 2001, in two different locations: at the U.S. Fred Lawrence Whipple Observatory on Mount Hopkins, Arizona, and at the Cerro Tololo Inter-American Observatory in Chile, each using a 1.3-meter telescope for the Northern and Southern Hemisphere, respectively. It was conducted in the short-wavelength infrared at three distinct frequency bands near 2 micrometres, from which the photometric survey with its HgCdTe detectors derives its name.

<span class="mw-page-title-main">NGC 6397</span> Globular cluster of stars in the Milky Way

NGC 6397 is a globular cluster in the constellation Ara that was discovered by French astronomer Nicolas-Louis de Lacaille in 1752. It is located about 7,800 light-years from Earth, making it one of the two nearest globular clusters to Earth. The cluster contains around 400,000 stars, and can be seen with the naked eye under good observing conditions.

<span class="mw-page-title-main">Giant Metrewave Radio Telescope</span> Radio telescope center

The Giant Metrewave Radio Telescope (GMRT), located near Narayangaon, Pune in India, is an array of thirty fully steerable parabolic radio telescopes of 45 metre diameter, observing at metre wavelengths. It is the largest and most sensitive radio telescope array in the world at low frequencies. It is operated by the National Centre for Radio Astrophysics (NCRA), a part of the Tata Institute of Fundamental Research, Mumbai. It was conceived and built under the direction of Govind Swarup during 1984 to 1996. It is an interferometric array with baselines of up to 25 kilometres (16 mi). It was recently upgraded with new receivers, after which it is also known as the upgraded Giant Metrewave Radio Telescope (uGMRT).

<span class="mw-page-title-main">47 Tucanae</span> Globular cluster in the constellation Tucana

47 Tucanae or 47 Tuc is a globular cluster located in the constellation Tucana. It is about 4.45 ± 0.01 kpc (15,000 ± 33 ly) from Earth, and 120 light years in diameter. 47 Tuc can be seen with the naked eye, with an apparent magnitude of 4.1. It appears about 44 arcminutes across including its far outreaches. Due to its far southern location, 18° from the south celestial pole, it was not catalogued by European astronomers until the 1750s, when the cluster was first identified by Nicolas-Louis de Lacaille from South Africa.

<span class="mw-page-title-main">Atacama Cosmology Telescope</span> Telescope in the Atacama Desert, northern Chile

The Atacama Cosmology Telescope (ACT) was a cosmological millimeter-wave telescope located on Cerro Toco in the Atacama Desert in the north of Chile. ACT made high-sensitivity, arcminute resolution, microwave-wavelength surveys of the sky in order to study the cosmic microwave background radiation (CMB), the relic radiation left by the Big Bang process. Located 40 km from San Pedro de Atacama, at an altitude of 5,190 metres (17,030 ft), it was one of the highest ground-based telescopes in the world.

<span class="mw-page-title-main">Vera C. Rubin Observatory</span> Astronomical observatory in Chile

The Vera C. Rubin Observatory, formerly known as the Large Synoptic Survey Telescope (LSST), is an astronomical observatory under construction in Chile. Its main task will be carrying out a synoptic astronomical survey, the Legacy Survey of Space and Time. The word "synoptic" is derived from the Greek words σύν and ὄψις, and describes observations that give a broad view of a subject at a particular time. The observatory is located on the El Peñón peak of Cerro Pachón, a 2,682-meter-high mountain in Coquimbo Region, in northern Chile, alongside the existing Gemini South and Southern Astrophysical Research Telescopes. The LSST Base Facility is located about 100 kilometres (62 mi) away from the observatory by road, in the town of La Serena. The observatory is named for Vera Rubin, an American astronomer who pioneered discoveries about galaxy rotation rates.

<span class="mw-page-title-main">Galaxy Zoo</span> Crowdsourced astronomy project

Galaxy Zoo is a crowdsourced astronomy project which invites people to assist in the morphological classification of large numbers of galaxies. It is an example of citizen science as it enlists the help of members of the public to help in scientific research.

Stanislav George Djorgovski is an American scientist and scholar. He obtained his B.A. in astrophysics in 1979 at the University of Belgrade. After receiving his PhD in astronomy from U.C. Berkeley in 1985, he was a Harvard Junior Fellow until 1987 when he joined the faculty at the California Institute of Technology, where he is currently a professor of astronomy and data science.

<span class="mw-page-title-main">TOPCAT (software)</span> Graphical viewer of tabular data mainly used in astronomical applications

TOPCAT is an interactive graphical viewer and editor for tabular data. Although a general purpose tool capable of handling large and sparse datasets with correlation functionality its specialist application area is astronomy and it was initially designed to support virtual observatories. It is able to handle several digital file formats including FITS which is in common use in astronomy. The Acronym TOPCAT derives from Tool for OPerations on Catalogues And Tables.

<span class="mw-page-title-main">Time-domain astronomy</span> Study of how astronomical objects change with time

Time-domain astronomy is the study of how astronomical objects change with time. Though the study may be said to begin with Galileo's Letters on Sunspots, the term now refers especially to variable objects beyond the Solar System. Changes over time may be due to movements or changes in the object itself. Common targets included are supernovae, pulsating stars, novas, flare stars, blazars and active galactic nuclei. Visible light time domain studies include OGLE, HAT-South, PanSTARRS, SkyMapper, ASAS, WASP, CRTS, GOTO and in a near future the LSST at the Vera C. Rubin Observatory.

<span class="mw-page-title-main">Warrick Couch</span> Australian astronomer

Warrick John Couch is an Australian professional astronomer. He is currently a professor at Swinburne University of Technology in Melbourne. He was previously the Director of Australia's largest optical observatory, the Australian Astronomical Observatory (AAO). He was also the president of the Australian Institute of Physics (2015–2017), and a non-executive director on the Board of the Giant Magellan Telescope Organization. He was a founding non-executive director of Astronomy Australia Limited.

<span class="mw-page-title-main">Amanda Bauer</span> American astronomer and science communicator (born 1979)

Amanda Elaine Bauer is an American professional astronomer and science communicator. She is the Deputy Director and Head of Science and Education at Yerkes Observatory in Williams Bay, Wisconsin. She was previously based in Tucson, Arizona, working as Head of Education and Public Outreach at the Large Synoptic Survey Telescope. From 2013 to 2016 she was a Research Astronomer at the Australian Astronomical Observatory (AAO). Her principal field of research concerns how galaxies form, how they create new stars, and particularly why they suddenly stop creating new stars.

<span class="mw-page-title-main">Laura Ferrarese</span> Italian astrophysicist

Laura Ferrarese is a researcher in space science at the National Research Council of Canada. Her primary work has been performed using data from the Hubble Space Telescope and the Canada-France-Hawaii Telescope.

<span class="mw-page-title-main">Radio Galaxy Zoo</span> Citizen science project

Radio Galaxy Zoo (RGZ) is an internet crowdsourced citizen science project that seeks to locate supermassive black holes in distant galaxies. It is hosted by the web portal Zooniverse. The scientific team want to identify black hole/jet pairs and associate them with the host galaxies. Using a large number of classifications provided by citizen scientists they hope to build a more complete picture of black holes at various stages and their origin. It was initiated in 2010 by Ray Norris in collaboration with the Zooniverse team, and was driven by the need to cross-identify the millions of extragalactic radio sources that will be discovered by the forthcoming Evolutionary Map of the Universe survey. RGZ is now led by scientists Julie Banfield and Ivy Wong. RGZ started operations on 17 December 2013, and ceased collecting new classifications on 1 May 2019.

<span class="mw-page-title-main">Kim Venn</span>

Kim A. Venn is a professor of physics and astronomy at the University of Victoria, Canada, and director of the university's Astronomy Research Centre. She researches the chemo-dynamical analysis of stars in the galaxy and its nearby dwarf satellites.

<span class="mw-page-title-main">Super-pressure Balloon-borne Imaging Telescope</span>

The Super-pressure Balloon-borne Imaging Telescope (SuperBIT) is a highly stabilized, high-resolution telescope that operates in the stratosphere via NASA's superpressure balloon (SPB) system. At 40 km altitude above sea level, the football-stadium-sized balloon carries SuperBIT to a suborbital environment above 99.2% of the Earth's atmosphere in order to obtain space-quality imaging. As a research instrument, SuperBIT's primary science goal is to provide insight into the distribution of dark matter in galaxy clusters and throughout the large-scale structure of the universe. As demonstrated by numerous test flights, the survey data generated by SuperBIT is expected to have similar quality and data collection efficiency as the Hubble Space Telescope while complementing surveys from other up-and-coming observatories such as the James Webb Space Telescope (JWST), the Vera C. Rubin Observatory, and the Nancy Grace Roman Space Telescope.

References

  1. "Largest Galaxy Proto-Supercluster Found - Astronomers using ESO's Very Large Telescope uncover a cosmic titan lurking in the early Universe". www.eso.org. Retrieved 18 October 2018.
  2. 1 2 3 Borne, Kirk D. (12 May 2010). "Astroinformatics: data-oriented astronomy research and education". Earth Science Informatics. 3 (1–2): 5–17. doi:10.1007/s12145-010-0055-2. S2CID   207393013.
  3. 1 2 Astroinformatics and digitization of astronomical heritage Archived 2017-12-26 at the Wayback Machine , Nikolay Kirov. The fifth SEEDI International Conference Digitization of cultural and scientific heritage, May 19–20, 2010, Sarajevo. Retrieved 1 November 2012.
  4. Borne, Kirk (2000). "Science User Scenarios for a Virtual Observatory Design Reference Mission: Science Requirements for Data Mining". arXiv: astro-ph/0008307 .
  5. Borne, Kirk (2008). "Scientific Data Mining in Astronomy". In Kargupta, Hillol; et al. (eds.). Next generation of data mining. London: CRC Press. pp. 91–114. ISBN   9781420085860.
  6. Borne, Kirk D (2003). "Distributed data mining in the National Virtual Observatory". In Dasarathy, Belur V (ed.). Data Mining and Knowledge Discovery: Theory, Tools, and Technology V. Vol. 5098. pp. 211–218. doi:10.1117/12.487536. S2CID   28195520.
  7. 1 2 Borne, Kirk (2013). "Virtual Observatories, Data Mining, and Astroinformatics". Planets, Stars and Stellar Systems. pp. 403–443. doi:10.1007/978-94-007-5618-2_9. ISBN   978-94-007-5617-5.
  8. Laurino, O.; D’Abrusco, R.; Longo, G.; Riccio, G. (21 December 2011). "Astroinformatics of galaxies and quasars: a new general method for photometric redshifts estimation". Monthly Notices of the Royal Astronomical Society. 418 (4): 2165–2195. arXiv: 1107.3160 . Bibcode:2011MNRAS.418.2165L. doi: 10.1111/j.1365-2966.2011.19416.x . S2CID   7115554.
  9. Borne, Kirk (2009). "Astroinformatics: A 21st Century Approach to Astronomy". Astro2010: The Astronomy and Astrophysics Decadal Survey. 2010: P6. arXiv: 0909.3892 . Bibcode:2009astro2010P...6B.
  10. "Online Science". Talks by Jim Gray. Microsoft Research. Retrieved 11 January 2015.
  11. "Jim Gray eScience Award". Microsoft Research.
  12. Astroinformatics in Canada, Nicholas M. Ball, David Schade. Retrieved 1 November 2012.
  13. "'Astroinformatics' helps Astronomers explore the sky". Phys.org. Heidelberg University. Retrieved 11 January 2015.
  14. Hey, Tony (October 2009). "The Fourth Paradigm: Data-Intensive Scientific Discovery". Microsoft Research.
  15. Ball, N.M.; Brunner, R.J. (2010). "Data Mining and Machine Learning in Astronomy". International Journal of Modern Physics D. 19 (7): 1049–1106. arXiv: 0906.2173 . Bibcode:2010IJMPD..19.1049B. doi:10.1142/S0218271810017160. S2CID   119277652.
  16. Borne, K; Becla, J; Davidson, I; Szalay, A; Tyson, J. A; Bailer-Jones, Coryn A.L (2008). "The LSST Data Mining Research Agenda". AIP Conference Proceedings. pp. 347–351. arXiv: 0811.0167 . doi:10.1063/1.3059074. S2CID   118399971.
  17. Ivezić, Ž; Axelrod, T; Becker, A. C; Becla, J; Borne, K; Burke, D. L; Claver, C. F; Cook, K. H; Connolly, A; Gilmore, D. K; Jones, R. L; Jurić, M; Kahn, S. M; Lim, K.-T; Lupton, R. H; Monet, D. G; Pinto, P. A; Sesar, B; Stubbs, C. W; Tyson, J. A; Bailer-Jones, Coryn A.L (2008). "Parametrization and Classification of 20 Billion LSST Objects: Lessons from SDSS". AIP Conference Proceedings. Vol. 1082. pp. 359–365. arXiv: 0810.5155 . doi:10.1063/1.3059076. S2CID   117914490.{{cite book}}: |journal= ignored (help)
  18. Borne, Kirk. "Collaborative Annotation for Scientific Data Discovery and Reuse". Bulletin of the ASIS&T. American Society for Information Science and Technology. Archived from the original on 5 March 2016. Retrieved 11 January 2016.
  19. "Zooniverse". www.zooniverse.org. Retrieved 2024-05-10.
  20. Cavanagh, Mitchell K.; Bekki, Kenji; Groves, Brent A. (2021-07-08). "Morphological classification of galaxies with deep learning: comparing 3-way and 4-way CNNs". Monthly Notices of the Royal Astronomical Society. 506 (1): 659–676. arXiv: 2106.01571 . doi: 10.1093/mnras/stab1552 . ISSN   0035-8711.
  21. Goyal, Lalit Mohan; Arora, Maanak; Pandey, Tushar; Mittal, Mamta (2020-12-01). "Morphological classification of galaxies using Conv-nets". Earth Science Informatics. 13 (4): 1427–1436. doi:10.1007/s12145-020-00526-w. ISSN   1865-0481.
  22. 1 2 "Sloan Digital Sky Survey-V: Pioneering Panoptic Spectroscopy - SDSS-V" . Retrieved 2024-05-10.
  23. Pati, Satavisa (2021-06-18). "How Data Science is Used in Astronomy?". Analytics Insight. Retrieved 2024-05-10.
  24. Baron, Dalya (2019-04-15), Machine Learning in Astronomy: a practical overview, arXiv: 1904.07248
  25. Borne, Kirk. "Astroinformatics in a Nutshell". asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. Retrieved 11 January 2016.
  26. Feigelson, Eric. "Astrostatistics in a Nutshell". asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. Retrieved 11 January 2016.
  27. Feigelson, E.; Ivezić, Ž.; Hilbe, J.; Borne, K. (2013). "New Organizations to Support Astroinformatics and Astrostatistics". Astronomical Data Analysis Software and Systems Xxii. 475: 15. arXiv: 1301.3069 . Bibcode:2013ASPC..475...15F.
  28. Borne, Kirk (2009). "The Revolution in Astronomy Education: Data Science for the Masses". Astro2010: The Astronomy and Astrophysics Decadal Survey. 2010: P7. arXiv: 0909.3895 . Bibcode:2009astro2010P...7B.
  29. "Using Data in the Classroom". Science Education Resource Center at Carleton College. National Science Digital Library. Retrieved 11 January 2016.
  30. Borne, Kirk. Astroinformatics: Data-Oriented Astronomy (PDF). George Mason University, USA. Retrieved January 21, 2015.
  31. 1 2 Zhang, Yanxia; Zhao, Yongheng (2015-05-22). "Astronomy in the Big Data Era". Data Science Journal. 14: 11. Bibcode:2015DatSJ..14...11Z. doi: 10.5334/dsj-2015-011 . ISSN   1683-1470.
  32. "The Palomar Digital Sky Survey (DPOSS)". sites.astro.caltech.edu. Retrieved 2024-05-10.
  33. "IRSA - Two Micron All Sky Survey (2MASS)". irsa.ipac.caltech.edu. Retrieved 2024-05-10.
  34. "GBT". Green Bank Observatory. 2023-06-26. Retrieved 2024-05-10.
  35. "GALEX - Galaxy Evolution Explorer". www.galex.caltech.edu. Retrieved 2024-05-10.
  36. "SkyMapper Southern Sky Survey". skymapper.anu.edu.au. Retrieved 2024-05-10.
  37. "Pan-STARRS1 data archive home page - PS1 Public Archive - STScI Outerspace". outerspace.stsci.edu. Retrieved 2024-05-10.
  38. Telescope, Large Synoptic Survey. "Rubin Observatory". Rubin Observatory. Retrieved 2024-05-10.
  39. "Explore | SKAO". www.skao.int. Retrieved 2024-05-10.
  40. Chowdhury, Shovan; Schoen, Marco P. (2020-10-02). "Research Paper Classification using Supervised Machine Learning Techniques". 2020 Intermountain Engineering, Technology and Computing (IETC). IEEE. pp. 1–6. doi:10.1109/IETC47856.2020.9249211. ISBN   978-1-7281-4291-3.
  41. Sarstedt, Marko; Mooi, Erik (2014), Sarstedt, Marko; Mooi, Erik (eds.), "Regression Analysis", A Concise Guide to Market Research: The Process, Data, and Methods Using IBM SPSS Statistics, Berlin, Heidelberg: Springer, pp. 193–233, doi:10.1007/978-3-642-53965-7_7, ISBN   978-3-642-53965-7 , retrieved 2024-05-10
  42. "Bulletin de la Société Royale des Sciences de Liège | PoPuPS". Bulletin de la Société Royale des Sciences de Liège (in French). ISSN   0037-9565.
  43. Bindra, Kamalpreet; Mishra, Anuranjan (September 2017). "A detailed study of clustering algorithms". 2017 6th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO). IEEE. pp. 371–376. doi:10.1109/ICRITO.2017.8342454. ISBN   978-1-5090-3012-5.
  44. Pizzuti, C.; Talia, D. (May 2003). "P-autoclass: scalable parallel clustering for mining large data sets". IEEE Transactions on Knowledge and Data Engineering. 15 (3): 629–641. doi:10.1109/TKDE.2003.1198395. ISSN   1041-4347.
  45. Thudumu, Srikanth; Branch, Philip; Jin, Jiong; Singh, Jugdutt (Jack) (2020-07-02). "A comprehensive survey of anomaly detection techniques for high dimensional big data". Journal of Big Data. 7 (1): 42. doi: 10.1186/s40537-020-00320-x . hdl: 10536/DRO/DU:30158643 . ISSN   2196-1115.
  46. Weiner, Irving B., ed. (2003-04-15). Handbook of Psychology (1 ed.). Wiley. doi:10.1002/0471264385.wei0223. ISBN   978-0-471-17669-5.