SpinVox

Last updated

SpinVox Ltd.
Type Private
Industry Telecommunications
Founded2003
HeadquartersMarlow, United Kingdom and New York, USA
Area served
Global
Key people
Co-Founders Christina Domecq, CEO
and Daniel Doulton, Chief Strategy Officer
Number of employees
250 (2009)
Website SpinVox.com

SpinVox was a start-up company that is now a subsidiary of global speech technology company Nuance Communications, an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, United States on the outskirts of Boston, that provides speech and imaging applications. Initially, SpinVox provided voice-to-text conversion services for carrier markets, including wireless, fixed, VoIP and cable, as well as for unified communications, enterprise and Web 2.0 environments. This service was ostensibly provided through an automated computer system, with human intervention where needed. However, there were accusations that the system operated almost exclusively through the use of call-center workers in South Africa and the Philippines. [1]

Contents

Company history

The company was founded in 2003 by Christina Domecq and Daniel Doulton. The company had raised $200 million in funding. [2] Company accounts for the company in 2007 stated that SpinVox made a loss of £36 million against £2 million of revenue. [3] In July 2009, in response to cash-flow problems it asked staff to take all or part of their salaries in stock to reduce costs. [4] In August 2009, a dossier, alleging financial irregularities, was circulated to shareholders, [5] leading to the company launching an inquiry into the activities of some senior executives. [6] Unaudited accounts for 2008 show the group's pre-tax loss widened to £49 million compared with £37 million a year earlier [7] In September 2009, Invesco Perpetual stated that it had written down the value of its investment in the company by 90% and that Spinvox was for sale. [8] [9] SpinVox was sold to US company Nuance Communications for $103 million (£64 million) in December 2009. [10]

Technology

The Voice Message Conversion System (VMCS) worked by combining speech technologies with live learning capabilities and human intelligence.[ specify ][ citation needed ] It was developed by the SpinVox Advanced Speech Group based in Cambridge, UK, led by Cambridge academic entrepreneur Dr. Tony Robinson and includes Cambridge University Professor Phil Woodland.[ citation needed ] The company supported the following languages: English; French, Spanish, German, Italian and Portuguese.[ citation needed ] Parent companies such as Nuance Communications have claimed that "spinvox is offering something that is impossible to deliver now" [11] Patent applications filed by the company in 2004 and 2008 note that "because human operators are used instead of machine transcription, voicemails are converted accurately, intelligently, appropriately and succinctly into text messages" [12]

In 2009 SpinVox also acquired New Zealand based company Angel Messaging, in the process gaining its second patent, 'Method and System of processing messages' which clearly outlines how Human transcribers can be efficiently used in real time transcription of voice messages. [13]

SpinVox voice-to-text conversion services included voicemail-to-text, speak-a-text, blog posts, social network updates, blast and memo messages. SpinVox also operated an open API to enable any developer to create speech-to-text based Web or mobile applications.

Data protection issues

A 2009 investigation by the BBC technology correspondent Rory Cellan-Jones alleged that the company transfers voicemail data out of the European Union to call centres in South Africa and the Philippines, in breach of its entry on the UK Register of data controllers, and that most of the transcription is done by humans rather than software. [1] SpinVox admitted that "parts of messages can be sent to a 'conversion expert'", but also claimed that "the part sent is anonymised so that there is no way of tracking back a particular number or person". [1]

SpinVox responded to allegations and stated that the company was in compliance with the Data Protection Act 1998. In a statement, the company said that the act permitted the processing of data outside of the EEA. [14]

Related Research Articles

Dictaphone American producer of dictation machines

Dictaphone was an American company founded by Alexander Graham Bell that produced dictation machines. It is now a division of Nuance Communications, based in Burlington, Massachusetts.

Voicemail Voice message storage and retrieval

A voicemail system is a computer-based system that allows users and subscribers to exchange personal voice messages; to select and deliver voice information; and to process transactions relating to individuals, organizations, products, and services, using an ordinary phone. The term is also used more broadly to denote any system of conveying a stored telecommunications voice messages, including using an answering machine. Most cell phone services offer voicemail as a basic feature; many corporate private branch exchanges include versatile internal voice-messaging services, and *98 vertical service code subscription is available to most individual and small business landline subscribers.

SpeechWorks was a company founded in Boston in 1994 by speech recognition pioneer Mike Phillips and Bill O'Farrell. The Boston-based company developed and supported speech-related computer software. Originally known as Applied Language Technologies, SpeechWorks went public in 2000 and tripled its value. ScanSoft acquired Nuance in 2003, and changed its name to Nuance Communications.

Nuance is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

Dragon NaturallySpeaking Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired first by Lernout & Hauspie Speech Products and later by Nuance Communications. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

Rory Cellan-Jones

Nicholas Rory Cellan-Jones is a British journalist. He was BBC News' technology correspondent. In August 2021, he announced that he will be leaving the BBC in late October, after 40 years of employment.

Gordon Matthews was an American inventor and businessman and started one of the first companies which pioneered the commercialization of voicemail.

Google Voice Telecommunications service by Google

Google Voice is a telephone service that provides a U.S. phone number to Google Account customers in the U.S. and Google Works customers in Canada, Denmark, France, the Netherlands, Portugal, Spain, Sweden, Switzerland and the United Kingdom. It is used for call forwarding and voicemail services, voice and text messaging, as well as U.S. and international. Calls are forwarded to the phone number that each user must configure in the account web portal. Users can answer and receive calls on any of the phones configured to ring in the web portal. While answering a call, the user can switch between the configured phones. Subscribers in the United States can make outgoing calls to domestic and international destinations. The service is configured and maintained by users in a web-based application, similar in style to Google's email service Gmail, or Android and iOS applications on smartphones or tablets.

Jott

Jott was a web-based voice-to-text transcription service which allowed its users to call a toll-free telephone number and speak for up to 30 seconds. The speech was then transcribed to text using a combination of computerized speech recognition software and human transcribers who worked in a "sterile environment which also includes medical dictation." The message could be sent back to oneself, turned into a reminder, sent to a contact or group, or sent to a third-party "Jott link" such as LiveJournal.

Unified communications (UC) is a business and marketing concept describing the integration of enterprise communication services such as instant messaging (chat), presence information, voice, mobility features, audio, web & video conferencing, fixed-mobile convergence (FMC), desktop sharing, data sharing, call control and speech recognition with non-real-time communication services such as unified messaging. UC is not necessarily a single product, but a set of products that provides a consistent unified user interface and user experience across multiple devices and media types.

eVoice is a telecommunications service owned by j2 Global, Inc. (NASDAQ:JCOM). The company manages incoming and outgoing calls using virtual phone numbers. The service was initially founded by Wendell Brown, Mark Klein, and Craig Taro Gold in 2000 and re-launched in March 2010 with an expansion of services that include both individual, personal uses as well as services for businesses.

Ribbit (telecommunications company)

Ribbit was a telecommunications company based in Mountain View, California. It was acquired by BT Group on July 29, 2008 for $105 million.

Voxeo Corporation was a technology company that specialized in providing development platforms for unified customer experience (self-service) and unified communications applications. Voxeo was headquartered in Orlando, Florida with main offices in Cologne, Germany; Beijing, China; London, UK and San Francisco, US.

Siri Software based personal assistant from Apple Inc.

Siri is a virtual assistant that is part of Apple Inc.'s iOS, iPadOS, watchOS, macOS, tvOS, and audioOS operating systems. It uses voice queries, gesture based control, focus-tracking and a natural-language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of Internet services. With continued use, it adapts to users' individual language usages, searches and preferences, returning individualized results.

Phone hacking is the practice of exploring a mobile device often using computer exploits to analyze everything from the lowest memory and central processing unit levels up to the highest file system and process levels. Modern open source tooling has become fairly sophisticated as to be able to "hook" into individual functions within any running App on an unlocked device and allow deep inspection and modification of their functions.

Vlingo was a speech recognition software company co-founded by speech-to-text pioneers Mike Phillips and John Nguyen in 2006. It was best known for its intelligent personal assistant and knowledge navigator, also named Vlingo, which functioned as a personal assistant application for Symbian, Android, iPhone, BlackBerry, and other smartphones. Vlingo was acquired by speech recognition giant Nuance Communications in 2012.

Fuze (company)

Fuze is a cloud communications and collaboration software platform designed for the enterprise. The company is headquartered in Boston, Massachusetts.

Yap Speech Cloud was a multimodal speech recognition system developed by American technology company Yap Inc. It offered a fully cloud-based speech-to-text transcription platform that was used by customers such as Microsoft.

Tony Robinson is a pioneer in the application of recurrent neural networks to speech recognition, being one of the first to discover the practical capabilities of deep neural networks and how they can be used to benefit speech recognition. He first published on the topic while studying for his PhD at Cambridge University in the 1980s. He has published over a hundred, widely cited research papers on automatic speech recognition (ASR) in the years since.

References

  1. 1 2 3 Cellan-Jones, Rory (23 July 2009). "Voice-to-text service scrutinised". BBC News. Retrieved 23 July 2009.
  2. Andrews, Robert (24 July 2009). "SpinVox Investor: 'It's A Nice Problem To Have'". The washington Post. Retrieved 28 July 2009.
  3. Cellan-Jones, Rory (24 July 2009). "Voice technology firm hits back". BBC News.{{cite web}}: Missing or empty |url= (help)
  4. Andrews, Robert (13 July 2009). "SpinVox Paying Staff In Stock To Save On Costs". The Washington Post. Retrieved 23 July 2009.
  5. "SpinVox examines dossier claims". BBC News. 10 August 2009. Retrieved 10 August 2009.
  6. Walsh, Kate; James Ashton (9 August 2009). "Spinvox in probe over financial mismanagement". The Sunday Times. Retrieved 10 August 2009.
  7. Ashton, James (23 August 2009). "Spinvox widens losses by 30%". The Sunday Times. Retrieved 23 August 2009.
  8. "UK firm Spinvox 'put up for sale'". BBC News. 11 September 2009. Retrieved 11 September 2009.
  9. Andrews, Robert (11 September 2009). "SpinVox For Sale, Investor Says, As It Kisses Its Cash Goodbye". moconews.net. Retrieved 11 September 2009.
  10. Judge, Elizabeth (31 December 2009). "SpinVox sold to US rival Nuance Communications for £64m". The Times. London. Retrieved 30 December 2009.
  11. "Spinvox "Faked" Speech Transcription Service And Broke Privacy". eWeek Europe. 23 July 2009. Retrieved 24 July 2009.
  12. Cellan-Jones, Rory (29 July 2009). "Humans central in Spinvox patents". BBC News. Retrieved 29 July 2009.
  13. Method and system of processing messages , retrieved 27 June 2015
  14. "Spinvox responds". Spinvox blog. 23 July 2009. Retrieved 28 July 2009.