Conversational user interface

Last updated

A conversational user interface (CUI) is a user interface for computers that emulates a conversation with a real human. [1] Historically, computers have relied on text-based user interfaces and graphical user interfaces (GUIs) (such as the user pressing a "back" button) to translate the user's desired action into commands the computer understands. While an effective mechanism of completing computing actions, there is a learning curve for the user associated with GUI. [2] Instead, CUIs provide opportunity for the user to communicate with the computer in their natural language rather than in a syntax specific commands. [3]

Contents

To do this, conversational interfaces use natural language processing (NLP) to allow computers to understand, analyze, and create meaning from human language. [4] Unlike word processors, NLP considers the structure of human language (i.e., words make phrases; phrases make sentences which convey the idea or intent the user is trying to invoke). The ambiguous nature of human language makes it difficult for a machine to always correctly interpret the user's requests, which is why we have seen a shift toward natural-language understanding (NLU). [5]

NLU allows for sentiment analysis and conversational searches which allows a line of questioning to continue, with the context carried throughout the conversation. NLU allows conversational interfaces to handle unstructured inputs that the human brain is able to understand such as spelling mistakes of follow-up questions. [6] For example, through leveraging NLU, a user could first ask for the population of the United States. If the user then asks "Who is the president?", the search will carry forward the context of the United States and provide the appropriate response.

Conversational interfaces have emerged as a tool for businesses to efficiently provide consumers with relevant information, in a cost-effective manner. [7] CUI provide ease of access to relevant, contextual information to the end user without the complexities and learning curve typically associated with technology.

While there are a variety of interface brands, to date, there are two main categories of conversational interfaces; voice assistants and chatbots. [8]

Voice-based interfaces

A voice user interface allows a user to complete an action by speaking a command. Introduced in October 2011, Apple's Siri was one of the first voice assistants widely adopted. Siri allowed users of iPhone to get information and complete actions on their device simply by asking Siri. In the later years, Siri was integrated with Apple's HomePod devices.

Further development has continued since Siri's introduction to include home based devices such as Google Home or Amazon Echo (powered by Alexa) that allow users to "connect" their homes through a series of smart devices to further the options of tangible actions they can complete. Users can now turn off the lights, set reminders and call their friends all with a verbal queue.

These conversational interfaces that utilize a voice assistant have become a popular way for businesses to interact with their customers as the interface removes some friction in a customer journey. Customers no longer need to remember a long list of usernames and passwords to their various accounts; they simply link each account to Google or Amazon once, and gone are the days where you needed to wait on hold for an hour to ask a simple question.

Text-based interfaces

A chatbot is a web- or mobile-based interface that allows the user to ask questions and retrieve information. This information can be generic in nature such as the Google Assistant chat window that allows for internet searches, or it can be a specific brand or service which allows the user to gain information about the status of their various accounts. Their backend systems work in the same manner as a voice assistant, with the front end utilizing a visual interface to convey information. This visual interface can be beneficial for companies that need to do more complex business transactions with customers, as instructions, deep links and graphics can all be utilized to convey an answer. The complexity to which a chatbot answers questions depends on the development of the back end. Chatbots with hard-coded answers has a smaller base on information and corresponding skills. Chatbots that leverage machine learning will continue to grow and develop larger content bases for more complex responses[ citation needed ]. [9]

More frequently, companies are leveraging chatbots as a way to offload simple questions and transactions from human agents. [10] These chatbots provide the option to assist a user, but then directly transfer the customer to a live agent within the same chat window if the conversation becomes too complex, this feature is called Human Handover, chatbot platforms like BotPenguin and other platform offers such features in their chatbots. [11] Chatbots have evolved and have come a long way since their inception. Modern day chatbots have personas which make them sound more human-like.

See also

Related Research Articles

<span class="mw-page-title-main">User interface</span> Means by which a user interacts with and controls a machine

In the industrial design field of human–computer interaction, a user interface (UI) is the space where interactions between humans and machines occur. The goal of this interaction is to allow effective operation and control of the machine from the human end, while the machine simultaneously feeds back information that aids the operators' decision-making process. Examples of this broad concept of user interfaces include the interactive aspects of computer operating systems, hand tools, heavy machinery operator controls and process controls. The design considerations applicable when creating user interfaces are related to, or involve such disciplines as, ergonomics and psychology.

Natural-language understanding (NLU) or natural-language interpretation (NLI) is a subset of natural-language processing in artificial intelligence that deals with machine reading comprehension. Natural-language understanding is considered an AI-hard problem.

<span class="mw-page-title-main">Chatbot</span> Program that simulates conversation

A chatbot is a software application or web interface that is designed to mimic human conversation through text or voice interactions. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of maintaining a conversation with a user in natural language and simulating the way a human would behave as a conversational partner. Such chatbots often use deep learning and natural language processing, but simpler chatbots have existed for decades.

Natural language generation (NLG) is a software process that produces natural language output. A widely-cited survey of NLG methods describes NLG as "the subfield of artificial intelligence and computational linguistics that is concerned with the construction of computer systems than can produce understandable texts in English or other human languages from some underlying non-linguistic representation of information".

<span class="mw-page-title-main">Customer service</span> Provision of service to customers

Customer service is the assistance and advice provided by a company through phone, online chat, and e-mail to those who buy or use its products or services. Each industry requires different levels of customer service, but towards the end, the idea of a well-performed service is that of increasing revenues. The perception of success of the customer service interactions is dependent on employees "who can adjust themselves to the personality of the customer". Customer service is often practiced in a way that reflects the strategies and values of a firm. Good quality customer service is usually measured through customer retention.

A voice-user interface (VUI) enables spoken human interaction with computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

<span class="mw-page-title-main">Dialogue system</span>

A dialogue system, or conversational agent (CA), is a computer system intended to converse with a human. Dialogue systems employed one or more of text, speech, graphics, haptics, gestures, and other modes for communication on both the input and output channel.

A dialog manager (DM) is a component of a dialog system (DS), responsible for the state and flow of the conversation. Usually:

Artificial intelligence marketing (AIM) is a form of marketing that uses artificial intelligence concepts and models such as machine learning, Natural process Languages, and Bayesian Networks to achieve marketing goals. The main difference between AIM and traditional forms of marketing resides in the reasoning, which is performed by a computer algorithm rather than a human.

Natural-language programming (NLP) is an ontology-assisted way of programming in terms of natural-language sentences, e.g. English. A structured document with Content, sections and subsections for explanations of sentences forms a NLP document, which is actually a computer program. Natural language programming is not to be mixed up with natural language interfacing or voice control where a program is first written and then communicated with through natural language using an interface added on. In NLP the functionality of a program is organised only for the definition of the meaning of sentences. For instance, NLP can be used to represent all the knowledge of an autonomous robot. Having done so, its tasks can be scripted by its users so that the robot can execute them autonomously while keeping to prescribed rules of behaviour as determined by the robot's user. Such robots are called transparent robots as their reasoning is transparent to users and this develops trust in robots. Natural language use and natural-language user interfaces include Inform 7, a natural programming language for making interactive fiction, Shakespeare, an esoteric natural programming language in the style of the plays of William Shakespeare, and Wolfram Alpha, a computational knowledge engine, using natural-language input. Some methods for program synthesis are based on natural-language programming.

Natural-language user interface is a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying data in software applications.

<span class="mw-page-title-main">Virtual assistant</span> Software agent

A virtual assistant (VA) is a software agent that can perform a range of tasks or services for a user based on user input such as commands or questions, including verbal ones. Such technologies often incorporate chatbot capabilities to simulate human conversation, such as via online chat, to facilitate interaction with their users. The interaction may be via text, graphical interface, or voice - as some virtual assistants are able to interpret human speech and respond via synthesized voices.

<span class="mw-page-title-main">Siri</span> Software-based personal assistant from Apple Inc.

Siri is the digital assistant that is part of Apple Inc.'s iOS, iPadOS, watchOS, macOS, tvOS, audioOS, and visionOS operating systems. It uses voice queries, gesture based control, focus-tracking and a natural-language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of Internet services. With continued use, it adapts to users' individual language usages, searches, and preferences, returning individualized results.

One Voice Technologies was an Artificial Intelligence (AI) based Natural Language Processing (NLP) company founded in 1998 and based in San Diego, CA. One Voice was the developer of IVAN, an intelligent personal assistant, which commercially launched in 1999. Some of the customers for One Voice Technologies are Telefonos de Mexico, S.A.B. de C.V. (TELMEX), Intel Corporation, the Government of India, Fry's Electronics, Inland Cellular, and Nex-Tec Wireless.

Amazon Lex is a service for building conversational interfaces into any application using voice and text. It powers the Amazon Alexa virtual assistant. In April 2017, the platform was released to the developer community, and suggested that it could be used for conversational interfaces including Web, mobile apps, robots, toys, drones, and more. Amazon already had launched Alexa Voice Services, which developers can use to integrate Alexa into their own devices, like smart speakers, alarm clocks, etc.; however, Lex will not require that end users interact with the Alexa assistant per se, but rather any type of assistant or interface. As of February 2018, users can now define a response for Amazon Lex chatbots directly from the AWS management console.

<span class="mw-page-title-main">Haptik</span> Indian enterprise conversational AI platform

Haptik is an Indian enterprise conversational AI platform founded in August 2013, and acquired by Reliance Industries Limited in 2019. The company develops technology to enable enterprises to build conversational AI systems that allow users to converse with applications and electronic devices in free-format, natural language, using speech or text. The company has been accorded numerous accolades including the Frost & Sullivan Award, NASSCOM's Al Game Changer Award, and serves Fortune 500 brands globally in industries such as financial, insurance, healthcare, technology and communications.

<span class="mw-page-title-main">Semantic parsing</span>

Semantic parsing is the task of converting a natural language utterance to a logical form: a machine-understandable representation of its meaning. Semantic parsing can thus be understood as extracting the precise meaning of an utterance. Applications of semantic parsing include machine translation, question answering, ontology induction, automated reasoning, and code generation. The phrase was first used in the 1970s by Yorick Wilks as the basis for machine translation programs working with only semantic representations. Semantic parsing is one of the important tasks in computational linguistics and natural language processing.

Virtual assistants are software technology that assist users complete various tasks. Well known virtual assistants include Amazon Alexa, and Siri, produced by Apple. Other companies, such as Google and Microsoft, also have virtual assistants. There are privacy issues concerning what information can go to the third party corporations that operate virtual assistants and how this data can potentially be used.

References

  1. "What Are Conversational Interfaces? The Basics". CX Today. 2020-12-11. Retrieved 2021-07-22.
  2. "Conversational Interfaces: Where Are We Today? Where Are We Heading?". Smashing Magazine. 18 July 2016. Retrieved 2018-05-23.
  3. Brownlee, John (2016-04-04). "Conversational Interfaces, Explained". Co.Design. Retrieved 2018-05-23.
  4. Brownlee, Jason (2017-09-21). "What Is Natural Language Processing?". Machine Learning Mastery. Retrieved 2021-07-22.
  5. Pan, Jiaqi (2017-08-25). "Conversational Interfaces: The Future of Chatbots – Chatbots Magazine". Chatbots Magazine. Retrieved 2018-05-23.
  6. Lola (2016-10-05). "NLP vs. NLU: What's the Difference? – Lola – Medium". Medium. Retrieved 2018-05-23.
  7. "Integrating The Best Chatbots With Apps & Assistants". Voicify. Retrieved 2021-07-22.
  8. "Conversational User Interfaces: In-depth Guide [2021 update]". research.aimultiple.com. 2018-03-01. Retrieved 2021-07-22.
  9. "The History of Chatbots - from ELIZA to Alexa". 12 October 2017.
  10. "What is a Chatbot and How Does it Work?". SearchCustomerExperience. Retrieved 2021-07-22.
  11. "BotPenguin".