Microsoft text-to-speech voices

Last updated

The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows operating systems; server voices are available for download for use with server applications such as Speech Server, Lync etc. for both Windows client and server platforms, and mobile voices are often shipped with more recent versions.

Contents

Voices

Windows 2000 and Windows XP

A speech sample of Microsoft Sam, using the SAPI 5 version of the voice.
The first part uses a variation of "The quick brown fox jumps over the lazy dog" panagram. The second part demonstrates the "soy/soi" glitch associated with Sam.

Microsoft Sam is the default text-to-speech male voice in Microsoft Windows 2000 and Windows XP. It is used by Narrator, the screen reader program built into the operating system.

Microsoft Mike and Microsoft Mary are optional male and female voices respectively, available for download from the Microsoft website. Michael and Michelle are also optional male and female voices licensed by Microsoft from Lernout & Hauspie, and are available through Microsoft Office XP and Microsoft Office 2003 or Microsoft Reader.

There are both SAPI 4 and SAPI 5 versions of these text-to-speech voices. The speech patterns of the SAPI 4 and SAPI 5 versions of the text-to-speech voices are different from each other. SAPI 4 voices are only available on Windows 2000 and later Windows NT-based operating systems. Redistributable versions of the SAPI 4 voices were available for download on Windows 9x operating systems, however they are no longer offered from the Microsoft website. While the SAPI 5 versions of Microsoft Mike and Microsoft Mary are only downloadable as a Merge Module, [1] the installable versions may be installed on end users' systems by speech applications such as Microsoft Reader.

The SAPI 4 versions of Microsoft Sam, Microsoft Mike and Microsoft Mary can be used on Windows XP, Windows Vista, and later with a third-party program (like Speakonia and TTSReader) installed on the machine that supports these operating systems. In addition, the SAPI 4 versions of the Michael and Michelle soundalikes from Lernout & Hauspie (with different dialects) can also be used on Windows Vista and later by downloading the respective British English pack and then using it with a third-party program like Speakonia (Conversely, said voices are also compatible with XP and prior as well).

The SAPI 5 versions of Microsoft Sam, Microsoft Mike and Microsoft Mary can also be used on Windows Vista and later by installing the SAPI 5.1 SDK, which can also be installed in versions of Windows prior to XP beginning with Windows NT 4.0 SP6a and Windows 98. [1] As well, the SAPI 5 versions of Michael and Michelle from Lernout & Hauspie may also be installed via programs such as Microsoft Office XP or Microsoft Office 2003, however the voices cannot be chosen under normal means.

Windows Vista and Windows 7

Beginning with Windows Vista and Windows 7, Microsoft Anna is the default English voice. It is a SAPI 5-only female voice and is designed to sound more natural than Microsoft Sam. [2] Microsoft Streets & Trips 2006 and later install the Microsoft Anna voice on Windows XP systems for the voice-prompt direction feature. There are no male voices shipping with Windows Vista and Windows 7, and neither Microsoft Mike or Mary will work on Windows 7.

A female voice called Microsoft Lili that replaces the earlier male SAPI 5 voice "Microsoft Simplified Chinese" is available in Chinese versions of Windows Vista and Windows 7. It can also be obtained in non-Chinese versions of Windows 7 or Vista by installing the Chinese language pack.

In 2010, Microsoft released the newer Speech Platform compatible voices for Speech Recognition and Text-to-Speech for use with client and server applications. These voices are available in 26 languages [3] and can be installed on Windows client and server operating systems. Speech Platform voices, unlike SAPI 5 voices, are female-only; no male voices were ever released.

Windows 8 and Windows 8.1

In Windows 8, there are three new client (desktop) voices - Microsoft David (US male), Hazel (UK female) and Zira (US female) which are intended to sound more natural than Microsoft Anna. The server versions of these voices are available via the above-mentioned Speech Platform for operating systems earlier than Windows 8. Other voices are available for specific language versions of either Windows 8 or Windows 8.1. [4]

Unlike Windows 7 or Vista, one cannot use any third-party program for Microsoft Anna because there is no official Anna Voice API for download (especially because Microsoft Anna was only available in SAPI 5 and no SAPI 4 version of the voice exists).

Windows 10

In Windows 10, Microsoft Hazel was removed from the US English Language Pack and the Microsoft voices for Mobile (Phone/tablet) are available (Microsoft Mark and Microsoft Zira). These are the same voices found on Windows Phone 8, Windows Phone 8.1 and Windows 10 Mobile.

Also with these voices language packs are also available for a variety of voices similar to that of Windows 8 and 8.1. None of these voices match the Cortana text-to-speech voice which can be found on Windows Phone 8.1, Windows 10, and Windows 10 Mobile.

In an attempt to unify its software with Windows 10, all of Microsoft's current platforms use the same text-to-speech voices except for Microsoft David and a few others.

Mobile

Every mobile voice package has the combination of male/female, while most of the desktop voice packages have only female voices. All mobile voices have been made universal and any user who downloads the language pack of that choice will have one extra male and female voice per that package.

A hidden text-to-speech voice in Windows 10 called Microsoft Eva Mobile is present within the system. Users can download a pre-packaged registry file from the windowsreport.com website. Microsoft Eva is believed to be the early voice for Cortana until Microsoft replaced her with the voice of Jen Taylor in most areas.

These voices are updated with Windows to sound more natural than in the original version as seen in updated retail builds of Windows 10.

Windows 11

Windows 11 introduced three new "natural voices" borrowed from Microsoft's Azure cloud computing platform starting with version 22H2: Microsoft Aria, Jenny, and Guy. [5] These natural voices are intended to sound more natural than previous text-to-speech voices. It is exclusively available in Narrator and cannot be used in any other applications outside of it, including all first-party and third-party applications as of 2024.

The voices from Windows 10 are now reclassified as "legacy voices", however Microsoft David was still used as the default for the desktop client.

See also

Related Research Articles

Windows is a product line of proprietary graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sectors of the computing industry – Windows (unqualified) for a consumer or corporate workstation, Windows Server for a server and Windows IoT for an embedded system. Windows is sold as either a consumer retail product or licensed to third-party hardware manufacturers who sell products bundled with Windows.

<span class="mw-page-title-main">Microsoft Agent</span> Virtual software agent technology

Microsoft Agent is a technology developed by Microsoft which employs animated characters, text-to-speech engines, and speech recognition software to enhance interaction with computer users. It came pre-installed as part of Windows 2000 and later versions of Microsoft Windows up to Windows Vista. It was not included with Windows 7, and was completely discontinued in Windows 8. Microsoft Agent functionality was exposed as an ActiveX control that can be used by web pages.

<span class="mw-page-title-main">Virtual PC</span> Emulator for PowerPC Macs and for Windows

Virtual PC is a discontinued x86 emulator software for Microsoft Windows hosts and PowerPC-based Mac hosts. It was created by Connectix in 1997 and acquired by Microsoft in 2003, after which the program was renamed Microsoft Virtual PC. In July 2006, Microsoft released the Windows version free of charge. The Mac version was discontinued following the transition to Intel processors that same year.

<span class="mw-page-title-main">Windows XP Professional x64 Edition</span> Edition of Windows XP for x86-64 computers, released in 2005

Windows XP Professional x64 Edition is an edition of Microsoft's Windows XP operating system for x86-64 personal computers. It was released on April 25, 2005, alongside the x86-64 versions of Windows Server 2003. It is designed to use the expanded 64-bit memory address space provided by the x86-64 architecture.

Remote Desktop Protocol (RDP) is a proprietary protocol developed by Microsoft Corporation which provides a user with a graphical interface to connect to another computer over a network connection. The user employs RDP client software for this purpose, while the other computer must run RDP server software.

Narrator is a screen reader in Microsoft Windows. Developed by Professor Paul Blenkhorn in 2000, the utility made the Windows operating system more accessible for blind and visually impaired users.

Windows Services for UNIX (SFU) is a discontinued software package produced by Microsoft which provided a Unix environment on Windows NT and some of its immediate successor operating-systems.

<span class="mw-page-title-main">Windows Fundamentals for Legacy PCs</span> Thin client operating system from Microsoft

Windows Fundamentals for Legacy PCs ("WinFLP") is a thin client release of the Windows NT operating system developed by Microsoft and optimized for older, less powerful hardware. It was released on July 8, 2006, nearly two years after its Windows XP SP2 counterpart was released in August 2004, and is not marketed as a full-fledged general purpose operating system, although it is functionally able to perform most of the tasks generally associated with one. It includes only certain functionality for local workloads such as security, management, document viewing related tasks and the .NET Framework. It is designed to work as a client–server solution with RDP clients or other third party clients such as Citrix ICA. Windows Fundamentals for Legacy PCs reached end of support on April 8, 2014 along with most other Windows XP editions.

<span class="mw-page-title-main">Windows Vista</span> Seventh major release of Windows NT

Windows Vista is a major release of the Windows NT operating system developed by Microsoft. It was the direct successor to Windows XP, released five years earlier, which was then the longest time span between successive releases of Microsoft Windows. It was released to manufacturing on November 8, 2006, and over the following two months, it was released in stages to business customers, original equipment manufacturers (OEMs), and retail channels. On January 30, 2007, it was released internationally and was made available for purchase and download from the Windows Marketplace; it is the first release of Windows to be made available through a digital distribution platform.

The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date, a number of versions of the API have been released, which have shipped either as part of a Speech SDK or as part of the Windows OS itself. Applications that use SAPI include Microsoft Office, Microsoft Agent and Microsoft Speech Server.

<span class="mw-page-title-main">Microsoft Management Console</span> Component of Microsoft Windows

Microsoft Management Console (MMC) is a component of Microsoft Windows that provides system administrators and advanced users an interface for configuring and monitoring the system. It was first introduced in 1998 with the Option Pack for Windows NT 4.0 and later came pre-bundled with Windows 2000 and its successors.

Background Intelligent Transfer Service (BITS) is a component of Microsoft Windows XP and later iterations of the operating systems, which facilitates asynchronous, prioritized, and throttled transfer of files between machines using idle network bandwidth. It is most commonly used by recent versions of Windows Update, Microsoft Update, Windows Server Update Services, and System Center Configuration Manager to deliver software updates to clients, Microsoft's anti-virus scanner Microsoft Security Essentials to fetch signature updates, and is also used by Microsoft's instant messaging products to transfer files. BITS is exposed through the Component Object Model (COM).

<span class="mw-page-title-main">Nokia PC Suite</span> Software to connect mobile devices to a PC

Nokia PC Suite is a discontinued software package used to establish an interface between Nokia mobile devices and computers that run the Microsoft Windows operating system. Its first release was in 1997, originally called Nokia Data Suite. It was replaced by Nokia Suite and integrated into the Ovi service suite.

Resource Kit is a term used by Microsoft for a set of software resources and documentation released for their software products, but which is not part of that product. Resource kits offer supplementary resources such as technical guidance, compatibility and troubleshooting information, management, support, maintenance and deployment guides and multipurpose useful administrative utilities, which are available separately.

Windows Vista has many significant new features compared with previous Microsoft Windows versions, covering most aspects of the operating system.

<span class="mw-page-title-main">Windows Speech Recognition</span> Speech recognition software

Windows Speech Recognition (WSR) is speech recognition developed by Microsoft for Windows Vista that enables voice commands to control the desktop user interface, dictate text in electronic documents and email, navigate websites, perform keyboard shortcuts, and operate the mouse cursor. It supports custom macros to perform additional or supplementary tasks.

Remote Desktop Services (RDS), known as Terminal Services in Windows Server 2008 and earlier, is one of the components of Microsoft Windows that allow a user to initiate and control an interactive session on a remote computer or virtual machine over a network connection.

Windows XP, which is the next version of Windows NT after Windows 2000 and the successor to the consumer-oriented Windows Me, has been released in several editions since its original release in 2001.

<span class="mw-page-title-main">.NET Framework version history</span>

Microsoft started development on the .NET Framework in the late 1990s originally under the name of Next Generation Windows Services (NGWS). By late 2001 the first beta versions of .NET Framework 1.0 were released. The first version of .NET Framework was released on 13 February 2002, bringing managed code to Windows NT 4.0, 98, 2000, ME and XP.

References

  1. 1 2 Speech SDK 5.1
  2. Chambers, Rob (August 29, 2006). "Microsoft Anna - The new TTS voice in Vista". MSDN Blogs. Microsoft. Retrieved June 26, 2015.
  3. "Microsoft Speech Platform". 20 January 2015.
  4. Free text-to-speech (TTS) or speech synthesizers in Microsoft Windows
  5. "Windows 11's Narrator Is Getting Better Voices". How-To Geek. 27 January 2022.