Petros Drineas

Last updated

Petros Drineas is a Greek-American computer scientist known for his contributions to the theory of data science and the development of Randomized Numerical Linear Algebra (RandNLA). In a 2012 paper [1] Michael W. Mahoney and Drineas introduced CUR matrix approximation for improved big data analysis. Drineas' work on the application of principal component analysis to population genetics disproved [2] [3] the long-standing hypothesis that the Minoan civilization had North African origins.

Drineas earned his BS in 1997 from University of Patras in Greece. He received his PhD in Computer Science from Yale University in 2003 where his advisor was Ravi Kannan. [4] Drineas was on the faculty of Rensselaer Polytechnic Institute from 2003 to 2016 and was a visiting researcher at Microsoft Research, Yahoo! Research and Sandia National Laboratory. He is currently a professor of computer science at Purdue University. Loved by his students, he received the nickname of Dr. Dre for his "beats" and lyrical communication style within the classroom.

Drineas is a co-editor with Peter Bühlmann, Michael Kane and M. van der Laan of "Handbook of Big Data" published in 2016. [5]

Related Research Articles

<span class="mw-page-title-main">Computer science</span> Study of computation

Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines to practical disciplines. Computer science is generally considered an academic discipline and distinct from computer programming which is considered to be a technical field.

<span class="mw-page-title-main">Minoan civilization</span> Bronze Age civilization on Crete and other Aegean Islands

The Minoan civilization was a Bronze Age Aegean civilization on the island of Crete and other Aegean Islands, whose earliest beginnings date to c. 3500 BC, with the complex urban civilization beginning around 2000 BC, and then declining from c. 1450 BC until it ended around 1100 BC, during the early Greek Dark Ages, part of a wider bronze age collapse around the Mediterranean. It represents the first advanced civilization in Europe, leaving behind a number of massive building complexes, sophisticated art, and writing systems. Its economy benefited from a network of trade around much of the Mediterranean.

<span class="mw-page-title-main">Principal component analysis</span> Method of data analysis

Principal component analysis (PCA) is a popular technique for analyzing large datasets containing a high number of dimensions/features per observation, increasing the interpretability of data while preserving the maximum amount of information, and enabling the visualization of multidimensional data. Formally, PCA is a statistical technique for reducing the dimensionality of a dataset. This is accomplished by linearly transforming the data into a new coordinate system where the variation in the data can be described with fewer dimensions than the initial data. Many studies use the first two principal components in order to plot the data in two dimensions and to visually identify clusters of closely related data points. Principal component analysis has applications in many fields such as population genetics, microbiome studies, and atmospheric science.

<span class="mw-page-title-main">Qualitative research</span> Form of research

Qualitative research is a type of research that aims to gather and analyse non-numerical (descriptive) data in order to gain an understanding of individuals' social reality, including understanding their attitudes, beliefs, and motivation. This type of research typically involves in-depth interviews, focus groups, or observations in order to collect data that is rich in detail and context. Qualitative research is often used to explore complex phenomena or to gain insight into people's experiences and perspectives on a particular topic. It is particularly useful when researchers want to understand the meaning that people attach to their experiences or when they want to uncover the underlying reasons for people's behavior. Qualitative methods include ethnography, grounded theory, discourse analysis, and interpretative phenomenological analysis. Qualitative research methods have been used in sociology, anthropology, political science, psychology, social work, folklore, educational research and software engineering research.

<span class="mw-page-title-main">British School at Athens</span> Research center in Greece

The British School at Athens (BSA) is an archaeological research institute, one of the eight British International Research Institutes supported by the British Academy. Under UK law it is a registered educational charity, which translates to a non-profit organisation in American and Greek law. It also is one of the 19 Foreign Archaeological Institutes defined by Hellenic Law No. 3028/2002, "On the Protection of Antiquities and Cultural Heritage in General," passed by the Greek Parliament in 2002. Under that law the 17 accredited foreign institutes may perform systematic excavation in Greece with the permission of the government.

<span class="mw-page-title-main">C. R. Rao</span> Indian-American mathematician (born 1920)

Calyampudi Radhakrishna Rao,, commonly known as C. R. Rao, is an Indian-American mathematician and statistician. He is currently professor emeritus at Pennsylvania State University and Research Professor at the University at Buffalo. Rao has been honoured by numerous colloquia, honorary degrees, and festschrifts and was awarded the US National Medal of Science in 2002. The American Statistical Association has described him as "a living legend whose work has influenced not just statistics, but has had far reaching implications for fields as varied as economics, genetics, anthropology, geology, national planning, demography, biometry, and medicine." The Times of India listed Rao as one of the top 10 Indian scientists of all time. In 2023, Rao was awarded the International Prize in Statistics, an award often touted as the "statistics’ equivalent of the Nobel Prize". Rao is also a Senior Policy and Statistics advisor for the Indian Heart Association non-profit focused on raising South Asian cardiovascular disease awareness.

<span class="mw-page-title-main">Structural equation modeling</span> Form of causal modeling that fit networks of constructs to data

Structural equation modeling (SEM) is a label for a diverse set of methods used by scientists in both experimental and observational research across the sciences, business, and other fields. It is used most in the social and behavioral sciences.

In statistics, missing data, or missing values, occur when no data value is stored for the variable in an observation. Missing data are a common occurrence and can have a significant effect on the conclusions that can be drawn from the data.

Mark Bender Gerstein is an American scientist working in bioinformatics and Data Science. As of 2009, he is co-director of the Yale Computational Biology and Bioinformatics program.

<span class="mw-page-title-main">Neolithic Greece</span> Neolithic phase of Greece (c. 7000 – 3200 BC)

Neolithic Greece is an archaeological term used to refer to the Neolithic phase of Greek history beginning with the spread of farming to Greece in 7000–6500 BC. During this period, many developments occurred such as the establishment and expansion of a mixed farming and stock-rearing economy, architectural innovations, as well as elaborate art and tool manufacturing. Neolithic Greece is part of the Prehistory of Southeastern Europe.

In statistics and natural language processing, a topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for discovery of hidden semantic structures in a text body. Intuitively, given that a document is about a particular topic, one would expect particular words to appear in the document more or less frequently: "dog" and "bone" will appear more often in documents about dogs, "cat" and "meow" will appear in documents about cats, and "the" and "is" will appear approximately equally in both. A document typically concerns multiple topics in different proportions; thus, in a document that is 10% about cats and 90% about dogs, there would probably be about 9 times more dog words than cat words. The "topics" produced by topic modeling techniques are clusters of similar words. A topic model captures this intuition in a mathematical framework, which allows examining a set of documents and discovering, based on the statistics of the words in each, what the topics might be and what each document's balance of topics is.

<span class="mw-page-title-main">Christopher R. Johnson</span> American computer scientist

Christopher Ray Johnson is an American computer scientist. He is a distinguished professor of computer science at the University of Utah, and founding director of the Scientific Computing and Imaging Institute (SCI). His research interests are in the areas of scientific computing and scientific visualization.

A CUR matrix approximation is a set of three matrices that, when multiplied together, closely approximate a given matrix. A CUR approximation can be used in the same way as the low-rank approximation of the singular value decomposition (SVD). CUR approximations are less accurate than the SVD, but they offer two key advantages, both stemming from the fact that the rows and columns come from the original matrix :

In mathematical optimization, the problem of non-negative least squares (NNLS) is a type of constrained least squares problem where the coefficients are not allowed to become negative. That is, given a matrix A and a (column) vector of response variables y, the goal is to find

Robust Principal Component Analysis (RPCA) is a modification of the widely used statistical procedure of principal component analysis (PCA) which works well with respect to grossly corrupted observations. A number of different approaches exist for Robust PCA, including an idealized version of Robust PCA, which aims to recover a low-rank matrix L0 from highly corrupted measurements M = L0 +S0. This decomposition in low-rank and sparse matrices can be achieved by techniques such as Principal Component Pursuit method (PCP), Stable PCP, Quantized PCP, Block based PCP, and Local PCP. Then, optimization methods are used such as the Augmented Lagrange Multiplier Method (ALM), Alternating Direction Method (ADM), Fast Alternating Minimization (FAM), Iteratively Reweighted Least Squares (IRLS ) or alternating projections (AP).

<span class="mw-page-title-main">Jeffrey Heer</span> American computer scientist

Jeffrey Michael Heer is an American computer scientist best known for his work on information visualization and interactive data analysis. He is a professor of computer science & engineering at the University of Washington, where he directs the UW Interactive Data Lab. He co-founded Trifacta with Joe Hellerstein and Sean Kandel in 2012.

Phaedon Fessas (1922-2015) was a Greek Professor of Medicine at the Medical School of Athens University. He was Director of the 1st Department of Internal Medicine at the Laikon Hospital in Athens (1969-1989), where he established a very strong Hematology Division, his particular subspecialty. Professor Fessas was a clinician, teacher and researcher. His main research interest was thalassemia.

<span class="mw-page-title-main">Peter Bühlmann</span> Swiss mathematician

Peter Lukas Bühlmann is a Swiss mathematician and statistician.

John A. Stamatoyannopoulos a Greek-American physician-scientist in molecular biology and epigenomics. He is a professor of genome sciences and medicine at the University of Washington, where he heads the Stam Lab and led UW Medicine's participation in the ENCODE project. John is the son of Greek geneticist George Stamatoyannopoulos. Stamatoyannopoulos currently serves as scientific director at the Altius Institute for Biomedical Sciences.

References

  1. Michael W. Mahoney; Petros Drineas. "CUR matrix decompositions for improved data analysis" . Retrieved 26 June 2012.
  2. J. R. Hughey; P. Paschou; P. Drineas; D. Mastropaolo; D. M. Lotakis; P. A. Navas; M. Michalodimitrakis; J. A. Stamatoyannopoulos; G. Stamatoyannopoulos (2013). "A European Population in Minoan Bronze Age Crete". Nature Communications. (4)1861: 1861. Bibcode:2013NatCo...4.1861H. doi:10.1038/ncomms2871. PMC   3674256 . PMID   23673646.
  3. Tia Ghose, LiveScience: “Mysterious Minoans Were European, DNA Finds”, 2013,
  4. Petros Drineas at the Mathematics Genealogy Project
  5. Handbook of Big Data. Chapman and Hall/CRC Press. 2016. ISBN   9781482249088.