Deborah Washington Brown

Deborah Washington Brown
Deborah Washington Brown
Born	Deborah Blanche Washington; June 3, 1952; Washington, D.C.
Died	June 5, 2020 (aged 68); Atlanta, Georgia
Alma mater	Lowell Technological Institute, Harvard University ;
Known for	Speech recognition research; Classical piano;
Spouse	Ruel Brown
Children	2
Awards	first Black computer scientist to earn a Ph.D. in applied mathematics at Harvard
	Scientific career
Fields	Computer science
Thesis	The solution of difference equations describing array manipulation in program loops

Last updated March 21, 2024

Deborah Washington Brown (June 3, 1952 - June 5, 2020) was an American computer scientist and speech recognition researcher who worked at AT&T Bell Labs, and other companies for many years doing speech recognition research. She was the first black woman to earn a doctorate in computer science (then a part of their applied math program) at Harvard University in 1981^[1] from the Harvard John A. Paulson School of Engineering and Applied Sciences. She was one of the first black female computer scientists to graduate from a U. S. doctoral program. Mrs. Brown passed on June 5 after a long battle with cancer, her achievements and legacy remain as an inspiration for those who have followed in her footsteps.

Early life and education

Born Deborah Blanche Washington on June 3, 1952, in Washington D. C., Brown was the youngest of 4 children (with a twin brother Melvin Charles Washington) of Edwin and Lola Washington.^[2] She attended high school at the National Cathedral School 1966–70. She was admitted to the New England Conservatory of Music in 1970 to pursue her dream of becoming a classical pianist, but left in 1971 for Lowell Technological Institute after being dissuaded about her prospects. She received a bachelor's degree with honors in mathematics at Lowell in 1975. She received a Master's (1977) and a PhD (1981) in Applied Math at Harvard University advised first by Harry R. Lewis and then by Tom Cheatham.^[3] Her thesis was on "The solution of difference equations describing array manipulation in program loops".^[4] She was elected Commencement marshal at her Harvard graduation.^[1]

Computer science career

Brown's first job was at Norden Systems, developing software for missile defense technology. In the late 1980s, she joined AT&T Bell Labs as a Member of Technical Staff and later Principal Member of Technical Staff. Her speech technology career continued at other companies until her death in 2020.

Brown worked at the forefront of many applications of speech recognition during her career, and her contributions to the field are seen in part through her 11 United States Patents on which she is a named inventor. These include data collection methods using automatic speech recognition (ASR) instead of human agents, methods for correcting ASR errors in user id recognition (numbers or names) over the phone using confusion matrices, innovations in grammar generation and pruning for ASR, methods for identifying prompt-specific caller responses, multiple methods to identify errors in recognition of user account numbers due to ASR issues using confusion matrices of possible answers, a Natural Language Call Router, and a system to bridge text chat interaction with a voice-enabled interactive voice response system.^[5]^[6]^[7]^[8]^[9]^[10]^[11]^[12]^[13]^[14]^[15]

Personal life

In addition to her technological achievements, Brown was also an accomplished classical pianist. Throughout her career in computer science, Brown continued to study and teach piano, playing at Carnegie Hall and excelling in competitions.^[16]

Brown married Ruel “Rula” Brown on May 26, 1979. They have two daughters.^{[ citation needed ]}

Related Research Articles

Natural language processing (NLP) is an interdisciplinary subfield of computer science and linguistics. It is primarily concerned with giving computers the ability to support and manipulate human language. It involves processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves.

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

Interactive voice response (IVR) is a technology that allows telephone users to interact with a computer-operated telephone system through the use of voice and DTMF tones input with a keypad. In telephony, IVR allows customers to interact with a company's host system via a telephone keypad or by speech recognition, after which services can be inquired about through the IVR dialogue. IVR systems can respond with pre-recorded or dynamically generated audio to further direct users on how to proceed. IVR systems deployed in the network are sized to handle large call volumes and also used for outbound calling as IVR systems are more intelligent than many predictive dialer systems.

A voicemail system is a computer-based system that allows people to leave a recorded message when the recipient is unable to answer the phone. The caller is prompted to leave a message and the recipient can retrieve said message at a later time.

A formal system is an abstract structure and formalization of an axiomatic system used for inferring theorems from axioms by a set of inference rules.

Communication access realtime translation (CART), also called open captioning or realtime stenography or simply realtime captioning, is the general name of the system that stenographers and others use to convert speech to text. A trained operator writes the exact words spoken using a special phonetic keyboard, or stenography methods, relaying a reliable and accurate translation that is broadcast to the recipient on a screen, laptop, or other device. CART professionals have qualifications for added expertise (speed and accuracy) as compared to court reporters and other stenographers.

<span class="mw-page-title-main">Margaret Oakley Dayhoff</span> American biochemist

Margaret Belle (Oakley) Dayhoff was an American physical chemist and a pioneer in the field of bioinformatics. Dayhoff was a professor at Georgetown University Medical Center and a noted research biochemist at the National Biomedical Research Foundation, where she pioneered the application of mathematics and computational methods to the field of biochemistry. She dedicated her career to applying the evolving computational technologies to support advances in biology and medicine, most notably the creation of protein and nucleic acid databases and tools to interrogate the databases. She originated one of the first substitution matrices, point accepted mutations (PAM). The one-letter code used for amino acids was developed by her, reflecting an attempt to reduce the size of the data files used to describe amino acid sequences in an era of punch-card computing.

Logical security consists of software safeguards for an organization's systems, including user identification and password access, authenticating, access rights and authority levels. These measures are to ensure that only authorized users are able to perform actions or access information in a network or a workstation. It is a subset of computer security.

Multimodal interaction provides the user with multiple modes of interacting with a system. A multimodal interface provides several distinct tools for input and output of data.

A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

Speech analytics is the process of analyzing recorded calls to gather customer information to improve communication and future interaction. The process is primarily used by customer contact centers to extract information buried in client interactions with an enterprise. Although speech analytics includes elements of automatic speech recognition, it is known for analyzing the topic being discussed, which is weighed against the emotional character of the speech and the amount and locations of speech versus non-speech during the interaction. Speech analytics in contact centers can be used to mine recorded customer interactions to surface the intelligence essential for building effective cost containment and customer service strategies. The technology can pinpoint cost drivers, trend analysis, identify strengths and weaknesses with processes and products, and help understand how the marketplace perceives offerings.

Frederick Jelinek was a Czech-American researcher in information theory, automatic speech recognition, and natural language processing. He is well known for his oft-quoted statement, "Every time I fire a linguist, the performance of the speech recognizer goes up".

Lloyd Nicholas Trefethen is an American mathematician, professor of numerical analysis and head of the Numerical Analysis Group at the Mathematical Institute, University of Oxford.

Voice search, also called voice-enabled search, allows the user to use a voice command to search the Internet, a website, or an app.

A spoken dialog system (SDS) is a computer system able to converse with a human with voice. It has two essential components that do not exist in a written text dialog system: a speech recognizer and a text-to-speech module. It can be further distinguished from command and control speech systems that can respond to requests but do not attempt to maintain continuity over time.

Natural-language user interface is a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying data in software applications.

Deep learning is the subset of machine learning methods based on artificial neural networks (ANNs) with representation learning. The adjective "deep" refers to the use of multiple layers in the network. Methods used can be either supervised, semi-supervised or unsupervised.

The following outline is provided as an overview of and topical guide to natural-language processing:

Wilga Marie Rivers was an Australian linguist and Professor of Romance Languages. While she taught at both the secondary-education and college level throughout her life, she spent the majority of her career on the faculty of Harvard University. There, she served as a Professor of Romance Languages and Coordinator of Language Instruction in Romance Languages, fulfilling these roles until her eventual retirement in 1989.

Madeleine Ashcraft Bates is a researcher in natural language processing who worked at BBN Technologies in Cambridge, Massachusetts from the early 1970s to the late 1990s. She was president of the Association for Computational Linguistics in 1985, and co-editor of the book Challenges in Natural Language Processing (1993).

References

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[harvard-1] 1 2 Zewe, Adam (June 24, 2020), Alumni profile: Deborah Washington Brown, Ph.D. '81, Harvard School of Engineering, retrieved February 15, 2023

[2] "LOLA WASHINGTON Obituary - Washington, District of Columbia | Legacy.com". Legacy.com .

[3] "Thomas Edward Cheatham Jr". April 26, 2007.

[diss-4] Harvard University Dissertation and Historical Note

[5] "Telephone-based speech recognition for data collection".

[6] "System and method of recognizing letters and numbers by either speech or touch tone recognition utilizing constrained confusion matrices".

[7] "Method and apparatus for performing a grammar-pruning operation".

[8] "Method and apparatus for performing a name acquisition based on speech recognition".

[9] "Confusion set-base method and apparatus for pruning a predetermined arrangement of indexed identifiers".

[10] "Statistical database correction of alphanumeric account numbers for speech recognition and touch-tone recognition".

[11] "Distributed recognition system having multiple prompt-specific and response-specific speech recognizers".

[12] "Statistical database correction of alphanumeric identifiers for speech recognition and touch-tone recognition".

[13] "Natural language call router".

[14] "Concise dynamic grammars using N-best selection".

[15] "Bridge for non-voice communications user interface to voice-enabled interactive voice response system".

[16] "Romantic Music Competition 2015". Archived from the original on June 29, 2020. Retrieved June 28, 2020.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]