Target (project)

Last updated
Target
LocationUniversity of Groningen, Netherlands
EstablishedJanuary 2009
FundingFunded by the European Fund for Regional Development & partners
Website rug.nl/target

Target is the name of a collaborative research project specialising in big data processing and management in northern Netherlands. [1] It is a public-private cooperation, initiated in 2009 and supported by government subsidies. [2] It is run by a consortium of ten academic and computer industry partners, coordinated by the University of Groningen, and researches data management of science projects in the area of astronomy, life sciences, artificial intelligence and medical diagnosis.

Contents

Cooperating in the Target project are various divisions of the University of Groningen, its medical center, IBM, Oracle, ASTRON and Dutch IT firms Elkoog/Heeii and Nspyre.

Target's computer center is hosted by the Center for Information Technology, the computing center of the University of Groningen, and consist of more than 10 petabytes of storage [3] [4] based on IBM's GPFS storage technology, [5] a high-performance computing cluster and a grid cluster, which is a part of the European Grid Infrastructure.

The Target data center is hosted by the Donald Smits Center for Information Technology located at the University of Groningen, The Netherlands Target data center at CIT in Groningen.jpg
The Target data center is hosted by the Donald Smits Center for Information Technology located at the University of Groningen, The Netherlands

History

The project was initiated to transfer expertise of astronomers in massive data processing to other areas of science. Target builds on a distributed computing environment called Astro-WISE. [6] [7] Astro-WISE itself originated as an initiative of the OPTICON Wide Field Imaging Working Group, which was set up to consider a standardised European survey system to facilitate research, data reduction and data mining using data from the new generation of wide field survey cameras [8] The Target project launched in 2009 after receiving 32 million euros [9] of funding for a period of five years from the European Fund for Regional Development, the Dutch Ministry of Economic Affairs ("Pieken in de Delta" project), and the provinces of Groningen and Drenthe. The project runs under the auspices of the Northern Netherlands Provinces Alliance (SNN) and the Groningen municipality.

Technological findings

At the start of the project one aim was to develop a single integrated processing system, consisting of a multi-petabyte scale file system [10] and several different types of grid and compute clusters. [11] During the first years it became apparent that the requirements for the different e-Science disciplines are different. In some areas, a massive data streaming effort takes place, as in Lofar. In astronomy, the number of data objects may run in the billions, with a limited number of data columns. [12] In genomics, the number of rows is small, but the number of columns can be huge, in the hundreds of thousands. Other areas, such as visual text retrieval in the Monk search engine for historical manuscripts are at an intermediate position with hundreds of millions of rows and thousands of dimensions. Furthermore, genomics applications often require stringent access control, [13] whereas other disciplines have no privacy issues. Consequently, the various sub-projects within Target adopted a pragmatic approach on which aspects of the WISE technology and components of the Target hardware infrastructure were applicable to their field.

Projects

Target participates in a number of data-intensive scientific projects in astronomy, Big Data visualization (collaboration with the eScience center in Amsterdam), [14] handwritten text recognition algorithms, medical research on healthy aging, development of diagnostic tools for Parkinson's disease and more.

LOFAR Long-term Archive

Target has developed and maintains the LOFAR Long-term Archive. LOFAR Superterp.jpg
Target has developed and maintains the LOFAR Long-term Archive.

Much of the data from the LOFAR telescope is stored, accessed from and archived on the LOFAR Long-Term archive, designed by ASTRON and Target. [15] [16] [17] The data will be hosted at the Target data center and several other European centers.

Monk

A screenshot of a page from the Archive of the Cabinet of the Dutch Queen (KdK) on which the word Groningen has been found by Monk. KdKArchive Monk.jpg
A screenshot of a page from the Archive of the Cabinet of the Dutch Queen (KdK) on which the word Groningen has been found by Monk.

Monk is a system, developed by Schomaker and his group at the Artificial Intelligence Institute (ALICE) at the University of Groningen. It uses pattern-recognition and machine-learning algorithms for handwritten text recognition in a variety of existing archives. [18] [19] [20] Currently a number of books from the Dutch National Archives as well as more than 70 international historical collections, ranging from Western, medieval to handwritten Chinese manuscripts have been ingested into Monk. The systems applies continuous ('24/7') machine learning over internet, yielding fundamental results.[ vague ] [21] The MONK system employs the computational and storage resource of Target. It recently became part of a collaboration, led by Prof. Popovic from the Department of Theology and Religious Studies at the University of Groningen who will use a combination of carbon dating, paleography and text/image recognition techniques to try and pinpoint the authors of the popular Dead Sea Scrolls manuscripts. [22]

LifeLines

LifeLines is a long-term medical research project run by the University Medical Center Groningen (UMCG). An array of genotype and phenotype data will be gathered from 165000 people once every five years for a total period of thirty years. The accumulated data will be used by researchers and medical specialists to gain insights into the processes related to aging and understand why age-related health degradation varies so widely. [23] Target provides LifeLines with the infrastructure for data storage, access and processing. Data from LifeLines, as well as the SURFsara and Target infrastructure were used in the Genome of the Netherlands project, run by a consortium of the UMCG, LUMC, Erasmus MC, UMCU, Free University of Amsterdam. Results from the project using whole-genome sequencing to deduce population structure and demographic history of the Dutch population were published in June in the Nature Genetics journal. [24] [25]

GLIMPS

Run by K. Leenders, a professor of neurology at the UMCG, GLIMPS is a research project set to find faster and more reliable diagnostic tools for Parkinson's disease. [26] GLIMPS explores the possibilities of using complex image-based algorithms and PET scans for early detection of Parkinson's.[ citation needed ] To test the effectiveness of such algorithms, GLIMPS is building a large database of PET scans delivered by numerous hospitals in the Netherlands. Target is responsible for building and maintaining the GLIMPS database as well as ensuring the smooth running of the image-based algorithms on its computing facilities.

Others

Additionally, Target is involved in the data management for other astronomical projects such as KiDs/VIKING astronomical survey [27] [28] using OmegaCAM, [29] [30] the ESO's MUSE [31] instrument (mounted on the Very Large Telescope) and MICADO (to be mounted on the E-ELT). In addition the datacentric approach to data management prompted by Target has been adopted by the ESA's Euclid mission. [32] The project's spin-off company Target Holding B.V. also manages a number of commercial projects with private businesses in the North of the Netherlands. [33]

Public outreach and education is also part of the project remit and Target has organised many public events. [34] The Infoversum 3D theatre [35] [36] is a spin-off of the Target project and provides a facility for the visualisation and explanation of scientific data for large groups.

Related Research Articles

<i>Hipparcos</i> European Space Agency scientific satellite

Hipparcos was a scientific satellite of the European Space Agency (ESA), launched in 1989 and operated until 1993. It was the first space experiment devoted to precision astrometry, the accurate measurement of the positions of celestial objects on the sky. This permitted the first high-precision measurements of the intrinsic brightnesses, proper motions, and parallaxes of stars, enabling better calculations of their distance and tangential velocity. When combined with radial velocity measurements from spectroscopy, astrophysicists were able to finally measure all six quantities needed to determine the motion of stars. The resulting Hipparcos Catalogue, a high-precision catalogue of more than 118,200 stars, was published in 1997. The lower-precision Tycho Catalogue of more than a million stars was published at the same time, while the enhanced Tycho-2 Catalogue of 2.5 million stars was published in 2000. Hipparcos' follow-up mission, Gaia, was launched in 2013.

<span class="mw-page-title-main">University of Groningen</span> Public university in Groningen, the Netherlands

The University of Groningen is a public research university of more than 30,000 students in the city of Groningen in the Netherlands. Founded in 1614, the university is the second oldest in the country.

<span class="mw-page-title-main">Willem de Sitter</span> Dutch mathematician, physicist, and astronomer

Willem de Sitter was a Dutch mathematician, physicist, and astronomer.

<span class="mw-page-title-main">Square Kilometre Array</span> Radio telescope under construction in Australia and South Africa

The Square Kilometre Array (SKA) is an intergovernmental international radio telescope project being built in Australia (low-frequency) and South Africa (mid-frequency). The combining infrastructure, the Square Kilometre Array Observatory (SKAO), and headquarters, are located at the Jodrell Bank Observatory in the United Kingdom. The SKA cores are being built in the southern hemisphere, where the view of the Milky Way galaxy is the best and radio interference is at its least.

<span class="mw-page-title-main">Low-Frequency Array</span> Radio telescope network located mainly in the Netherlands

The Low-Frequency Array (LOFAR) is a large radio telescope, with an antenna network located mainly in the Netherlands, and spreading across 7 other European countries as of 2019. Originally designed and built by ASTRON, the Netherlands Institute for Radio Astronomy, it was first opened by Queen Beatrix of The Netherlands in 2010, and has since been operated on behalf of the International LOFAR Telescope (ILT) partnership by ASTRON.

<span class="mw-page-title-main">Johannes Martin Bijvoet</span> Dutch chemist and crystallographer

Johannes Martin Bijvoet was a Dutch chemist and crystallographer at the van 't Hoff Laboratory at Utrecht University. He is famous for devising a method of establishing the absolute configuration of molecules. In 1946, he became member of the Royal Netherlands Academy of Arts and Sciences.

High Performance Storage System (HPSS) is a flexible, scalable, policy-based, software-defined Hierarchical Storage Management product developed by the HPSS Collaboration. It provides scalable hierarchical storage management (HSM), archive, and file system services using cluster, LAN and SAN technologies to aggregate the capacity and performance of many computers, disks, disk systems, tape drives, and tape libraries.

Harvey Raymond Butcher III is an astronomer who has made significant contributions in observational astronomy and instrumentation which have advanced understanding of the formation of stars and of the universe. He received a B.Sc. in Astrophysics from the California Institute of Technology in 1969, where he contributed to the development of advanced infrared spectrometry applied in the first survey of the sky at infrared wavelengths.

Heeii, formerly known as Elkoog B.V., is a company based in Groningen, Netherlands that provides a recommendation service for web browsers by means of a plug-in. Website visitors that use Heeii recommendations are able to get the most appropriate content links that result in reaching their online objectives.

<span class="mw-page-title-main">Time-domain astronomy</span> Study of how astronomical objects change with time

Time-domain astronomy is the study of how astronomical objects change with time. Though the study may be said to begin with Galileo's Letters on Sunspots, the term now refers especially to variable objects beyond the Solar System. Changes over time may be due to movements or changes in the object itself. Common targets included are supernovae, pulsating stars, novas, flare stars, blazars and active galactic nuclei. Visible light time domain studies include OGLE, HAT-South, PanSTARRS, SkyMapper, ASAS, WASP, CRTS, GOTO and in a near future the LSST at the Vera C. Rubin Observatory.

<span class="mw-page-title-main">Alexander Ollongren</span> Dutch scientist (born 1928)

JonkheerAlexander Ollongren is a professor emeritus at Leiden University. He serves on the Advisory Council of METI.

DOME is a Dutch government-funded project between IBM and ASTRON in form of a public-private-partnership focussing on the Square Kilometre Array (SKA), the world's largest planned radio telescope. SKA will be built in Australia and South Africa. The DOME project objective is technology roadmap development that applies both to SKA and IBM. The 5-year project was started in 2012 and is co-funded by the Dutch government and IBM Research in Zürich, Switzerland and ASTRON in the Netherlands. The project ended officially on 30 September 2017.

GJ 625 is a small red dwarf star with an exoplanetary companion in the northern constellation of Draco. The system is located at a distance of 21.1 light-years from the Sun based on parallax, but is drifting closer with a radial velocity of −13 km/s. It is too faint to be visible to the naked eye, having an apparent visual magnitude of 10.13 and an absolute magnitude of 11.06.

<span class="mw-page-title-main">Breakthrough Listen</span> Initiative to search for intelligent extraterrestrial life

Breakthrough Listen is a project to search for intelligent extraterrestrial communications in the Universe. With $100 million in funding and thousands of hours of dedicated telescope time on state-of-the-art facilities, it is the most comprehensive search for alien communications to date. The project began in January 2016, and is expected to continue for 10 years. It is a component of Yuri Milner's Breakthrough Initiatives program. The science program for Breakthrough Listen is based at Berkeley SETI Research Center, located in the Astronomy Department at the University of California, Berkeley.

<span class="mw-page-title-main">Heino Falcke</span> German professor of radio astronomy and astroparticle physics

Heino Falcke is a German professor of radio astronomy and astroparticle physics at the Radboud University Nijmegen (Netherlands). His main field of study is black holes, and he is the originator of the concept of the 'black hole shadow'. In 2019, Falcke announced the first Event Horizon Telescope results at the EHT Press Conference in Brussels.

<span class="mw-page-title-main">Amina Helmi</span>

Amina Helmi is an Argentine astronomer and professor at the Kapteyn Astronomical Institute at the University of Groningen in the Netherlands.

<span class="mw-page-title-main">NGC 5982</span> Galaxy in the constellation Draco

NGC 5982 is an elliptical galaxy located in the constellation Draco. It is located at a distance of circa 130 million light years from Earth, which, given its apparent dimensions, means that NGC 5982 is about 100,000 light years across. It was discovered by William Herschel on May 25, 1788.

<span class="mw-page-title-main">Anna Scaife</span> Radio astronomer

Anna Margaret Mahala Scaife is a Professor of Radio Astronomy at the University of Manchester and Head of the Jodrell Bank Centre for Astrophysics Interferometry Centre of Excellence. She is the co-director of Policy@Manchester. She was awarded the 2019 Royal Astronomical Society Jackson-Gwilt Medal in recognition of her contributions to astrophysical instrumentation.

<span class="mw-page-title-main">Ronald Hanson</span> Dutch physicist

Ronald Hanson is a Dutch experimental physicist. He is best known for his work on the foundations and applications of quantum entanglement. He is Antoni van Leeuwenhoek Professor at the Kavli Institute of Nanoscience at Delft University of Technology and scientific director of QuTech. the Dutch Quantum Institute for quantum computing and quantum internet, founded by Delft University of Technology and the Netherlands Organisation for Applied Research.

<span class="mw-page-title-main">Peter van der Voort</span> Dutch physician and politician (born 1964)

Petrus Henricus Johannes "Peter" van der Voort is a Dutch physician, professor, and politician serving as a member of the Senate between 2020 and 2023. He is a member of the social-liberal party Democrats 66 (D66).

References

  1. "Nederlands project kan 1,5 petabytes verwerken en opslaan" [Netherlands Project can process and save 1.5 petabytes]. nu.nl (in Dutch). 2 September 2010.
  2. Edelman, Pieter (June 2, 2009). "Miljoenensubsidie voor Noord-Nederlands dataminingprogramma" [Millions in subsidy for North Netherlands datamining program]. Bits & Chips.
  3. Witold, Kepinski (19 November 2010). "Gronings ICT-project klaar voor petabytes data" [Groningen ICT project ready for petabytes of data]. Computable (in Dutch).
  4. Huisman, Zander (27 May 2011). "Aprycus breidt capaciteit Target-project uit" [Aprycus increases the capacity of the Target project]. Computable (in Dutch).
  5. IBM Case Studies, Showcasing IBM client stories. "University of Groningen breaks new ground in scientific research". www-03.ibm.com/software/businesscasestudies/us/en/corp?OpenDocument&Site=corp&cty=en_us. Archived from the original on November 12, 2014.
  6. Begeman, Kor; A.N.Belikov; D.R. Boxhoorn; E.A.Valentijn (January 2013). "The Astro-WISE data centric information system". Experimental Astronomy. 35 (1–2): 1. arXiv: 1208.0447 . Bibcode:2013ExA....35....1B. doi:10.1007/s10686-012-9311-4. S2CID   118620001.
  7. Mwebaze, Johnson (2012). Extreme Data Lineage in Ad-hoc Astronomical Data Processing (PhD). University of Groningen. ISBN   9789036757591.
  8. Gilmore, Gerald (January 2001). "OPTICON: EC Optical Infrared Coordination Network for Astronomy". Organizations and Strategies in Astronomy II. Astrophysics and Space Science Library. Vol. 2. pp. 83–102. arXiv: astro-ph/0105562 . Bibcode:2001ASSL..266...83G. doi:10.1007/978-94-010-0666-8_6. ISBN   978-0-7923-7172-4. S2CID   34032027.{{cite book}}: |journal= ignored (help)
  9. Edelman, Peter (20 July 2009). "Miljoenensubsidie voor Noord Nederland Dataminingprogram" [Millioans of subsidy for North Netherlands Datamining Programme]. Bits and Chips (in Dutch). p. 45.
  10. Jain, R; Sarkar, P.; Subhraveti, D. (2013), "GPFS-SNC: An enterprise cluster file system for Big Data", IBM Journal of Research and Development, 57 (3/4): 5:1, 5:10, doi:10.1147/JRD.2013.2243531
  11. Begeman, Kor; Begeman, K. G.; Belikov, A. N.; Boxhoorn, D. R.; Dijkstra, F.; Valentijn, E. A.; Vriend, W. J.; Zhao, Z. (2010), "Merging grid technologies", Journal of Grid Computing, 8 (2): 199–221, doi: 10.1007/s10723-010-9147-y
  12. Luri, X.; Palmer, M.; Arenou, F.; Masana, E.; De Bruijne, J.; Antiche, E.; Babusiaux, C.; Borrachero, R.; Sartoretti, P.; Julbe, F.; Isasi, Y.; Martinez, O.; Robin, A. C.; Reylé, C.; Jordi, C.; Carrasco, J. M. (2014). "Overview and stellar statistics of the expected GaiaCatalogue using the GaiaObject Generator". Astronomy & Astrophysics. 566: A119. arXiv: 1404.5861 . Bibcode:2014A&A...566A.119L. doi:10.1051/0004-6361/201423636. S2CID   119233012.
  13. Malin, B.; Loukides, G.; Benitez, K.; Clayton, E. W. (2011). "Identifiability in biobanks: Models, measures, and mitigation strategies". Human Genetics. 130 (3): 383–392. doi:10.1007/s00439-011-1042-5. PMC   3621020 . PMID   21739176.
  14. "eScience Center Projects". esciencecenter.nl.
  15. Belikov, A.N; Begeman, K.; Boxhoorn, D.R.; Dijkstra, F.; Holties, H.; Meyer-Zhao, Z.; Renting, G.A.; Valentijn, E.A.; Vriend, W-J. (2011). "LOFAR information system". Proc. Of ADASS XXI, ASP Conf. Series. 461: 693. arXiv: 1111.6443 . Bibcode:2012ASPC..461..693B.
  16. Begeman, K.; Boxhoorn, D.; Dijkstra, F.; Holties, H.A.; Vriend, W.-J (2011). "Target for LOFAR Long Term Archive: Architecture and Implementation". Future Generation Computer Systems. 27 (3): 319–328. doi:10.1016/j.future.2010.08.010.
  17. Holties, H.A; van Diepen, G.; et al. (2012). "Data Analysis in the LOFAR Long Term Archive". ASP Conference Series. 461, Astronomical Data Analysis Software and Systems XXI: 536. Bibcode:2012ASPC..461..635H.
  18. van der Zant, T; Schomaker, L.R.B.; Zinger, S.; van Schie, H. (2009). "Where are the Search Engines for Handwritten Documents?". Interdisciplinary Science Reviews. 34 (2–3): 224–235. Bibcode:2009ISRv...34..224V. doi:10.1179/174327909X441126. S2CID   57037481.
  19. van der Zant, T; Schomaker, L.R.B.; Valentijn, E.; B.A. Yanikoglu; K. Berkner (January 28, 2008). "Large scale parallel document image processing". In Yanikoglu, Berrin A; Berkner, Kathrin (eds.). Document Recognition and Retrieval XV. Vol. 6815. p. 68150N. Bibcode:2008SPIE.6815E..0SV. doi:10.1117/12.765482. S2CID   40083465.
  20. Schomaker, L.R.B.; B.A. Yanikoglu; K. Berkner (January 28, 2008). "Word mining in a sparsely labeled handwritten collection". In Yanikoglu, Berrin A.; Berkner, Kathrin (eds.). Document Recognition and Retrieval XV. Vol. 6815. pp. 6815–6823. Bibcode:2008SPIE.6815E..0NS. doi:10.1117/12.766329. S2CID   15863576.van der Zant, T; Schomaker, L.R.B.; Haak, K. (2008). "Handwritten-word spotting using biologically inspired features". IEEE Transactions on Pattern Analysis and Machine Intelligence. 30 (11): 1945–1957. doi:10.1109/tpami.2008.144. PMID   18787243. S2CID   18702898.
  21. van Oosten, J.-P.; Schomaker, L.R.B. (2014). "Separability versus Prototypicality in Handwritten Word-Image Retrieval" (PDF). Pattern Recognition. 47 (3): 1031–1038. Bibcode:2014PatRe..47.1031V. doi:10.1016/j.patcog.2013.09.006. hdl:11370/ca3dd93d-fbd9-4ae3-bfac-90bb148ea1dd. S2CID   215715932.
  22. RUG News (11 December 2014). "Prestigious ERC Starting Grant for Mladen Popović". Rijksuniversiteit Groningen. Retrieved 19 January 2015.
  23. Stolk, Ronald P; Rosmalen JG; Postma DS; de Boer RA; Navis G; Slaets JP; Ormel J; Wolffenbuttel BH (January 2008). "Universal risk factors for multifactorial diseases: LifeLines: a three-generation population-based study". European Journal of Epidemiology. 23 (1): 67–74. doi: 10.1007/s10654-007-9204-4 . PMID   18075776.
  24. Francioli, Laurent; Menelaou, Androniki; et al. (29 June 2014). "Whole-genome sequence variation, population structure and demographic history of the Dutch population". Nature Genetics. 46 (8): 818–825. doi:10.1038/ng.3021. PMID   24974849. S2CID   14316238.
  25. van Wijngaarden, Arend (June 30, 2014). "Genoom Nederlandse volk ontrafeld". Dagblad van het Noorden.
  26. Teune, Laura Klaaske (2013). Glucose metabolic patterns in neurodegenerative brain diseases (Thesis fully internal (DIV)). PhD Dissertation.
  27. Jong, J.T.; J.T. de Jong; G.A.V. Kleijn; K.H. Kuijken; E.A. Valentijn (2013). "The kilo-degree survey". Experimental Astronomy. 35 (1–2): 25–44. arXiv: 1206.1254 . Bibcode:2013ExA....35...25D. doi:10.1007/s10686-012-9306-1. S2CID   3082399.
  28. Netherlands Research School for Astronomy (NOVA). "NOVA Annual Report 2011-2011-2012" (PDF). nova-astronomy. NOVA.
  29. Schilling, Govert (8 June 2011). "Direct succes met supercamera" [Immediate success with super camera]. De Volkskrant (in Dutch).
  30. "Superscherpe hemelfotos vanaf Chileense berg" [Super sharp photos of the heavens from Chilean mountain]. NRC Handelsblad (in Dutch). 9 June 2011.
  31. Weilbacher, Peter; Peter M. Weilbacher; Ole Streicher; Tanya Urrutia; Aurélien Jarno; Arlette Pécontal-Rousset; Roland Bacon; Petra Böhm (September 2012). "Design and capabilities of the MUSE data reduction software and pipeline". In Radziwill, Nicole M; Chiozzi, Gianluca (eds.). Software and Cyberinfrastructure for Astronomy II. Vol. 8451. pp. 84510B. Bibcode:2012SPIE.8451E..0BW. doi:10.1117/12.925114. S2CID   120263291.
  32. Pasian, Fabio; Fabio Pasian; John Hoar; Marc Sauvage; Christophe Dabin; Maurice Poncet; Oriana Mansutti (September 2012). "Science ground segment for the ESA Euclid Mission". In Radziwill, Nicole M; Chiozzi, Gianluca (eds.). Software and Cyberinfrastructure for Astronomy II. Vol. 8451. p. 845104. Bibcode:2012SPIE.8451E..04P. doi:10.1117/12.926026. S2CID   123008296.
  33. Huisman, Zander (31 January 2013). "Target-project levert vier nieuwe bedrijven op" [Target project delivers four new companies]. Computable (in Dutch).
  34. Wind, Maike (12 May 2014). "Rapportage Big Data: De man met één been en vijf kinderen" [Big Data Report: The man with one leg and five children]. Dagblad van het Noorden (in Dutch). p. 2.
  35. Wind, Maike (19 June 2014). "Theater Infoversum open voor publiek" [Infoversum Theatre Opens to the public]. Groninger Gezinsbode (in Dutch). p. 19.
  36. "Infoversum". De Telegraaf (in Dutch). 4 July 2014.