Hui Zou | |||||||
---|---|---|---|---|---|---|---|
Alma mater | Stanford University University of Science and Technology of China | ||||||
Known for | Elastic net Adaptive Lasso Sparse PCA LLA for Nonconvex Penalization | ||||||
Scientific career | |||||||
Fields | Statistics, Statistical learning | ||||||
Institutions | University of Minnesota | ||||||
Doctoral advisor | Trevor Hastie | ||||||
Chinese name | |||||||
Simplified Chinese | 邹晖 | ||||||
|
Hui Zou is currently a professor of statistics at the University of Minnesota.
Sparse principal component analysis is a technique used in statistical analysis and, in particular, in the analysis of multivariate data sets. It extends the classic method of principal component analysis (PCA) for the reduction of dimensionality of data by introducing sparsity structures to the input variables.
In statistical theory, the field of high-dimensional statistics studies data whose dimension is larger than typically considered in classical multivariate analysis. The area arose owing to the emergence of many modern data sets in which the dimension of the data vectors may be comparable to, or even larger than, the sample size, so that justification for the use of traditional techniques, often based on asymptotic arguments with the dimension held fixed as the sample size increased, was lacking.
In statistics and machine learning, lasso is a regression analysis method that performs both variable selection and regularization in order to enhance the prediction accuracy and interpretability of the resulting statistical model. It was originally introduced in geophysics, and later by Robert Tibshirani, who coined the term.
Cynthia Dwork is an American computer scientist best known for her contributions to cryptography, distributed computing, and algorithmic fairness. She is one of the inventors of differential privacy and proof-of-work.
Statistics education is the practice of teaching and learning of statistics, along with the associated scholarly research.
Peter J. Rousseeuw is a statistician known for his work on robust statistics and cluster analysis. He obtained his PhD in 1981 at the Vrije Universiteit Brussel, following research carried out at the ETH in Zurich, which led to a book on influence functions. Later he was professor at the Delft University of Technology, The Netherlands, at the University of Fribourg, Switzerland, and at the University of Antwerp, Belgium. Next he was a senior researcher at Renaissance Technologies. He then returned to Belgium as professor at KU Leuven, until becoming emeritus in 2022. His former PhD students include Annick Leroy, Hendrik Lopuhaä, Geert Molenberghs, Christophe Croux, Mia Hubert, Stefan Van Aelst, Tim Verdonck and Jakob Raymaekers.
Robert Tibshirani is a professor in the Departments of Statistics and Biomedical Data Science at Stanford University. He was a professor at the University of Toronto from 1985 to 1998. In his work, he develops statistical tools for the analysis of complex datasets, most recently in genomics and proteomics.
In statistics and, in particular, in the fitting of linear or logistic regression models, the elastic net is a regularized regression method that linearly combines the L1 and L2 penalties of the lasso and ridge methods.
Trevor John Hastie is an American statistician and computer scientist. He is currently serving as the John A. Overdeck Professor of Mathematical Sciences and Professor of Statistics at Stanford University. Hastie is known for his contributions to applied statistics, especially in the field of machine learning, data mining, and bioinformatics. He has authored several popular books in statistical learning, including The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Hastie has been listed as an ISI Highly Cited Author in Mathematics by the ISI Web of Knowledge.
Stuart Alan Geman is an American mathematician, known for influential contributions to computer vision, statistics, probability theory, machine learning, and the neurosciences. He and his brother, Donald Geman, are well known for proposing the Gibbs sampler, and for the first proof of convergence of the simulated annealing algorithm.
David Sankoff is a Canadian mathematician, bioinformatician, computer scientist and linguist. He holds the Canada Research Chair in Mathematical Genomics in the Mathematics and Statistics Department at the University of Ottawa, and is cross-appointed to the Biology Department and the School of Information Technology and Engineering. He was founding editor of the scientific journal Language Variation and Change (Cambridge) and serves on the editorial boards of a number of bioinformatics, computational biology and linguistics journals. Sankoff is best known for his pioneering contributions in computational linguistics and computational genomics. He is considered to be one of the founders of bioinformatics. In particular, he had a key role in introducing dynamic programming for sequence alignment and other problems in computational biology. In Pavel Pevzner's words, "[ Michael Waterman ] and David Sankoff are responsible for transforming bioinformatics from a ‘stamp collection' of ill-defined problems into a rigorous discipline with important biological applications."
Francis X. Diebold is an American economist known for his work in predictive econometric modeling, financial econometrics, and macroeconometrics. He earned both his B.S. and Ph.D. degrees at the University of Pennsylvania, where his doctoral committee included Marc Nerlove, Lawrence Klein, and Peter Pauly. He has spent most of his career at Penn, where he has mentored approximately 75 Ph.D. students. Presently he is Paul F. and Warren S. Miller Professor of Social Sciences and Professor of Economics at Penn’s School of Arts and Sciences, and Professor of Finance and Professor of Statistics at Penn’s Wharton School. He is also a Faculty Research Associate at the National Bureau of Economic Research in Cambridge, Massachusetts, and author of the No Hesitations blog.
Alan Enoch Gelfand is an American statistician, and is currently the James B. Duke Professor of Statistics and Decision Sciences at Duke University. Gelfand’s research includes substantial contributions to the fields of Bayesian statistics, spatial statistics and hierarchical modeling.
Jennifer Ann Hoeting is an American statistician known for her work with Adrian Raftery, David Madigan, and others on Bayesian model averaging. She is a professor of statistics at Colorado State University, and executive editor of the open-access journal Advances in Statistical Climatology, Meteorology and Oceanography, published by Copernicus Publications. With Geof H. Givens, a colleague at Colorado State, she is the author of Computational Statistics, a graduate textbook on computational methods in statistics.
Wendy L. Martinez is an American statistician. She directs the Mathematical Statistics Research Center of the Bureau of Labor Statistics, and is the coordinating editor of the journal Statistics Surveys. In 2018, Martinez was elected president of the American Statistical Association for the 2020 term.
Aparna V. Huzurbazar is an American statistician known for her work using graphical models to understand time-to-event data. She is the author of a book on this subject, Flowgraph Models for Multistate Time-to-Event Data.
Carol Anne Gotway Crawford is an American mathematical statistician and from 2018 to 2020 served as Chief Statistician of the U.S. Government Accountability Office (GAO). She joined the GAO in May 2017. From August 2014 to April 2017, she was with the Department of Agriculture's National Agricultural Statistics Service. She was formerly at the National Center for Environmental Health of the Centers for Disease Control and Prevention. She also holds an adjunct faculty position at the Rollins School of Public Health of Emory University, and is an expert in biostatistics, spatial analysis, environmental statistics, and the statistics of public health. She also maintains an interest in geoscience and has held executive roles in the International Association for Mathematical Geosciences.
Daniela M. Witten is an American biostatistician. She is a professor and the Dorothy Gilford Endowed Chair of Mathematical Statistics at the University of Washington. Her research investigates the use of machine learning to understand high-dimensional data.
Janet Turk Wittes is an American statistician known for her work on clinical trials.
Peter Lukas Bühlmann is a Swiss mathematician and statistician.