International Conference on Acoustics, Speech, and Signal Processing | |
---|---|
Abbreviation | ICASSP |
Discipline | signal processing, machine learning, audio signal processing, speech processing |
Publication details | |
Publisher | IEEE |
History | 1976–present |
Frequency | Annual |
Website | 2024 |
ICASSP, the International Conference on Acoustics, Speech, and Signal Processing, is an annual flagship conference organized by IEEE Signal Processing Society. Ei Compendex has indexed all papers included in its proceedings.
The first ICASSP was held in 1976 in Philadelphia, Pennsylvania, based on the success of a conference in Massachusetts four years earlier that had focused specifically on speech signals. [1]
As ranked by Google Scholar's h-index metric in 2016, ICASSP has the highest h-index of any conference in the Signal Processing field. The Brazilian ministry of education gave the conference an 'A1' rating based on its h-index. [2] [3]
Year | Dates | Location | Conference Chairs | Attendance | Accepted Papers |
---|---|---|---|---|---|
2024 | 14-19 April | Seoul, Korea | General Chair: Hanseok Ko and Monson Hayes Technical Program Chair: John Hansen | 4403 | 2712 |
2023 | 4-10 June | Rhodes Island, Greece | General Chair: Kostas Berberidis, Petros Boufounos, and Petros Maragos Technical Program Chair: Constantine Kotropoulos and Shrikanth Narayanan | 4086 | 2726 |
2022 | 22-27 May | Singapore, Singapore | General Chair: Haizhou Li Technical Program Chair: Woon-Seng Gan | 2510 | 1874 |
2021 | 6-11 June | Virtual | General Chair: Dimitrios Androutsos, Kostas Plataniotis, and Xiao-Ping Zhang Technical Program Chair: Tim Davidson and Dong Yu | 2184 | 1732 |
2020 | 4-8 May | Virtual | General Chair: Ana Perez-Neira and Xavier Mestre Technical Program Chair: Markus Rupp, Christian Jutten, and Pascale Fung | 16087 | 1871 |
2019 | 12-17 May | Brighton, UK | General Chair: Lajos Hanzo and Saeid Sanei Technical Program Chair: Nam Ik Cho, Petar M Djuric, and Andrzej Cichocki | 3060 | 1743 |
2018 | 15-20 April | Calgary, AB, Canada | General Chair: Monson Hayes and Hanseok Ko Technical Program Chair: Pascale Fung and Nam Ik Cho | 2269 | 1404 |
2017 | 5-9 March | New Orleans, LA, USA | General Chair: Magdy Bayoumi Technical Program Chair: Tulay Adali and Eli Saber | 2239 | 1331 |
2016 | 20-25 March | Shanghai, China | General Chair: Wenjun Zhang, Zhi Ding, and Zhi-Quan Luo Technical Program Chair: Xiaodong Wang | 2093 | 1333 |
2015 | 19-24 April | Brisbane, Australia | General Chair: Vaughan Clarkson and Jonathan Manton Technical Program Chair: Doug Gray and Doug Cochran | 1731 | 1209 |
2014 | 25-30 May | Florence, Italy | General Chair: Fulvio Gini Technical Program Chair: Abdelhak Zoubir | 2406 | 1692 |
2013 | 26-31 May | Vancouver, BC, Canada | General Chair: Li Deng and Rabab Ward Technical Program Chair: Vikram Krishnamurthy | 2431 | 1793 |
2012 | 25-30 March | Kyoto, Japan | General Chair: Hideaki Sakai and Takao Nishitani Technical Program Chair: Akihiko Sugiyama and Hitoshi Kiya | 2024 | 1357 |
2011 | 22-27 May | Prague, Czech Republic | General Chair: Petr Tichavsky Technical Program Chair: Jonathon Chambers | 2066 | 1505 |
2010 | 14-19 March | Dallas, TX, USA | General Chair: Scott Douglas Technical Program Chair: John Hansen | 1905 | 1386 |
2009 | 19-24 April | Taipei, Taiwan | General Chair: Lin-shan, Lee and Iee-Ray Wei Technical Program Chair: Liang-Gee Chen and James R. Glass | 1689 | 1689 |
2008 | 31 March-4 April | Las Vegas, NV, USA | General Chair: Ali H. Sayed Technical Program Chair: Björn Ottersten | 2467 | 1362 |
2007 | 16-20 April | Honolulu, HI, USA | General Chair: K.J. Ray Liu and Todd Reed Technical Program Chair: Anthony Kuh and Yi-Fang Huang | 1773 | 1390 |
2006 | 14-19 May | Toulouse, France | General Chair: Francis Castanie Technical Program Chair: P. Duhamel and L. Vandendorpe | 2169 | 1572 |
2005 | 19-23 March | Philadelphia, PA, USA | General Chair: Athina P. Petropulu Technical Program Chair: K. Barner and J.-C. Pesquet | 966 | 1479 |
2004 | 17-21 May | Montreal, QC, Canada | General Chair: Douglas O’Shaughnessy Technical Program Chair: Li Deng and Peter Kabal | 2211 | 1399 |
2003 | 6-10 April | Hong Kong (canceled) | General Chair: Wan-Chi Siu, A. G. Constantinides, and Yiu-tong Chan Technical Program Chair: P. C. Ching | Canceled | 1274 |
2002 | 13-17 May | Orlando, FL, USA | General Chair: Fred 1. Taylor Technical Program Chair: Jose Principe | 1621 | 1164 |
2001 | 7-11 May | Salt Lake City, UT, USA | General Chair: V. John Mathews Technical Program Chair: A. Lee Swindlehurst | 2023 | 1017 |
2000 | 5-9 June | Istanbul, Turkey | General Chair: Huseyin Abut and Levent Onural Technical Program Chair: A. Murat Tekalp and Biilent Sankur | 1453 | 989 |
1999 | 15-19 March | Phoenix, AZ, USA | General Chair: Andreas Spanias and Douglas Cochran Technical Program Chair: W. Bastiaan Kleijn and Joseph Picone | 2005 | 911 |
1998 | 12-15 May | Seattle, WA, USA | General Chair: Les Atlas Technical Program Chair: Hynek Hermansky and Jenq-Neng Hwang | 2100 | 965 |
1997 | 21-24 April | Munich, Germany | General Chair: Manfred Lang Technical Program Chair: Josef A. Nossek | 1800 | 1050 |
1996 | 7-10 May | Atlanta, GA, USA | General Chair: Monson Hayes Technical Program Chair: Mark A Clements | 1900 | 905 |
1995 | 9-12 May | Detroit, MI, USA | General Chair: Alfred Hero Technical Program Chair: William J. Williams and Andrew Yagle | 1941 | 917 |
1994 | 18-22 April | Adelaide, Australia | General Chair: Robert E. Bogner Technical Program Chair: Boualem Boashash | 1331 | 844 |
1993 | 27-30 April | Minneapolis, MN, USA | General Chair: Mos Kaveh Technical Program Chair: Jan Allebach and Kevin Buckley | 2000 | 814 |
1992 | 23-26 March | San Francisco, CA, USA | General Chair: Marcia Bush Technical Program Chair: Michael Portnoff and Gary Kopec | 1998 | 806 |
1991 | 14-17 May | Toronto, ON, Canada | General Chair: Y.T. Chan Technical Program Chair. AN. Venetsanopoulos | 1949 | 933 |
1990 | 3-6 April | Albuquerque, NM, USA | General Chair: Delores M. Etter Technical Program Chair: Nasir Ahmed | 1700 | 725 |
1989 | 23-26 May | Glasgow, Scotland | General Chair: Tariq Durrani Technical Program Chair: Peter Grant and Roy Chapman | 1617 | 711 |
1988 | 11-14 April | New York, NY, USA | General Chair: Jont B. Allen Technical Program Chair: John G. Ackenhusen | 1899 | 718 |
1987 | 6-9 April | Dallas, TX, USA | General Chair: Panos E. Papamichalis Technical Program Chair: Masud M. Arjmand | 1400 | 615 |
1986 | 8-11 April | Tokyo, Japan | General Chair: Hiroya Fujisaki Technical Program Chair: Shuzo Saito and Jae S. Lim | 1350 | 785 |
1985 | 26-29 March | Tampa, FL, USA | General Chair: N. Rex Dixon Technical Program Chair: Vijay Jain | 1400 | 472 |
1984 | 19-21 March | San Diego, CA, USA | General Chair: Stanley A. White Technical Program Chair: Y.T. Chan | 1520 | 537 |
1983 | 14-16 April | Boston, MA, USA | General Chair: Peter E. Blankenship Technical Program Chair: John Makhoul | 1300 | 372 |
1982 | 3-5 May | Paris, France | General Chair: Claude Gueguen Technical Program Chair: Maurice Bellanger | 1653 | 522 |
1981 | 30 March-1 April | Atlanta, GA, USA | General Chair: Ronald W. Schafer Technical Program Chair: Russell M. Mersereau | 950 | 295 |
1980 | 9-11 April | Denver, CO, USA | General Chair: J. Robert Ashley Technical Program Chair: Louis L. Scharf | 970 | 257 |
1979 | 2-4 April | Washington, DC, USA | General Chair: Anthony I. Eller Technical Program Chair: G. Robert Redinbo | 900 | 242 |
1978 | 10-12 April | Tulsa, OK, USA | General Chair: Rao Yarlagadda Technical Program Chair: Thomas H. Crystal | 641 | 210 |
1977 | 8-11 May | Hartford, CT, USA | General Chair: Harvey Silverman Technical Program Chair: N. Rex Dixon | ~640 | 220 |
1976 | 12-14 April | Philadelphia, PA, USA | General Chair: Charles F. Teacher Technical Program Chair: Thomas B. Martin | ~600 | 225 |
Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model.
Vector quantization (VQ) is a classical quantization technique from signal processing that allows the modeling of probability density functions by the distribution of prototype vectors. Developed in the early 1980s by Robert M. Gray, it was originally used for data compression. It works by dividing a large set of points (vectors) into groups having approximately the same number of points closest to them. Each group is represented by its centroid point, as in k-means and some other clustering algorithms. In simpler terms, vector quantization chooses a set of points to represent a larger set of points.
PSOLA is a digital signal processing technique used for speech processing and more specifically speech synthesis. It can be used to modify the pitch and duration of a speech signal. It was invented around 1986.
In signal processing, the chirplet transform is an inner product of an input signal with a family of analysis primitives called chirplets.
Keyword spotting is a problem that was historically first defined in the context of speech processing. In speech processing, keyword spotting deals with the identification of keywords in utterances.
Scale-space segmentation or multi-scale segmentation is a general framework for signal and image segmentation, based on the computation of image descriptors at multiple scales of smoothing.
Warped linear predictive coding is a variant of linear predictive coding in which the spectral representation of the system is modified, for example by replacing the unit delays used in an LPC implementation with first-order all-pass filters. This can have advantages in reducing the bitrate required for a given level of perceived audio quality/intelligibility, especially in wideband audio coding.
The IEEE Signal Processing Society is one of the nearly 40 technical societies of the Institute of Electrical and Electronics Engineers (IEEE) and the first one created. Its mission is to "advance and disseminate state-of-the-art scientific information and resources; educate the signal processing community; and provide a venue for people to interact and exchange ideas."
Speaker adaptation is an important technology to fine-tune either features or speech models for mis-match due to inter-speaker variation. In the last decade, eigenvoice (EV) speaker adaptation has been developed. It makes use of the prior knowledge of training speakers to provide a fast adaptation algorithm. Inspired by the kernel eigenface idea in face recognition, kernel eigenvoice (KEV) is proposed. KEV is a non-linear generalization to EV. This incorporates Kernel principal component analysis, a non-linear version of Principal Component Analysis, to capture higher order correlations in order to further explore the speaker space and enhance recognition performance.
Nelson Harold Morgan is an American computer scientist and professor in residence (emeritus) of electrical engineering and computer science at the University of California, Berkeley. Morgan is the co-inventor of the Relative Spectral (RASTA) approach to speech signal processing, first described in a technical report published in 1991.
Financial signal processing is a branch of signal processing technologies which applies to signals within financial markets. They are often used by quantitative analysts to make best estimation of the movement of financial markets, such as stock prices, options prices, or other types of derivatives.
In communications technology, the technique of compressed sensing (CS) may be applied to the processing of speech signals under certain conditions. In particular, CS can be used to reconstruct a sparse vector from a smaller number of measurements, provided the signal can be represented in sparse domain. "Sparse domain" refers to a domain in which only a few measurements have non-zero values.
An audio coding format is a content representation format for storage or transmission of digital audio. Examples of audio coding formats include MP3, AAC, Vorbis, FLAC, and Opus. A specific software or hardware implementation capable of audio compression and decompression to/from a specific audio coding format is called an audio codec; an example of an audio codec is LAME, which is one of several different codecs which implements encoding and decoding audio in the MP3 audio coding format in software.
In Western music, the term chroma feature or chromagram closely relates to twelve different pitch classes. Chroma-based features, which are also referred to as "pitch class profiles", are a powerful tool for analyzing music whose pitches can be meaningfully categorized and whose tuning approximates to the equal-tempered scale. One main property of chroma features is that they capture harmonic and melodic characteristics of music, while being robust to changes in timbre and instrumentation.
V John Mathews is an Indian-American engineer and educator who is currently a Professor of Electrical Engineering and Computer Science (EECS) at the Oregon State University, United States.
The IARPA Babel program developed speech recognition technology for noisy telephone conversations. The main goal of the program was to improve the performance of keyword search on languages with very little transcribed data, i.e. low-resource languages. Data from 26 languages was collected with certain languages being held-out as "surprise" languages to test the ability of the teams to rapidly build a system for a new language.
The IEEE Fourier Award for Signal Processing is a Technical Field Award that is given by the Institute of Electrical and Electronics Engineers. This award is presented for contributions in the field of signal processing.
Matti Antero Karjalainen was a Finnish speech processing researcher and inventor in the fields of speech synthesis, speech analysis, speech technology, audio signal processing and psychoacoustics. He was the head of Acoustics Laboratory at the Helsinki University of Technology from 1980 to 2006.
Namrata Vaswani is an Indian-American electrical engineer known for her research in compressed sensing, robust principal component analysis, signal processing, statistical learning theory, and computer vision. She is a Joseph and Elizabeth Anderlik Professor in Electrical and Computer Engineering at Iowa State University, and a professor of mathematics at Iowa State.
Athina Petropulu is a Greek electrical engineer, researcher and academic. She is Distinguished Professor in the Electrical and Computer Engineering (ECE) Department at Rutgers, The State University of New Jersey. She has made contributions in signal processing, wireless communications and networks, and radar systems. She received many awards for her work in these areas.