Voice-directed warehousing

Last updated

Voice-directed warehousing (VDW) refers to the use of the voice direction and speech recognition software in warehouses and distribution centers. VDW has been in use since the late 1990s, with its use increasing drastically since. [1]

Contents

In a voice directed warehouse, workers wear a headset connected to a small wearable computer, similar in size to a Sony Walkman, which tells the worker where to go and what to do using verbal prompts. Workers confirm their tasks by speaking pre-defined commands and reading confirmation codes printed on locations or products throughout the warehouse. The speech recognition software running on the wearable computer 'understands' the workers' responses. [2]

Voice-directed warehousing is typically used instead of paper- or mobile computer-based systems that require workers to read instructions and scan barcodes or key-enter information to confirm their tasks. By freeing a worker's hands and eyes, voice directed systems typically improve efficiency, accuracy, and safety. [3] Whilst VDW was originally used in picking orders, now all warehouse functions such as goods receiving, put-away, replenishment, shipping, and returns processing can be coordinated by voice systems.

Overview

The first incarnations of voice directed warehousing were implemented in distribution centers in the early 1990s. Since then, voice has changed dramatically. Most notably, the technology was originally limited to picking, [4] whereas now all warehouse functions (picking, receiving/put-away, replenishment, shipping) can be coordinated by voice systems. As these processes move from being paper-centric, to RF-centric (barcode scanning) and now voice-centric. For some, voice has become the starting point for re-engineering warehouse processes and systems, rather than an after-thought.

VDW technology has also undergone an evolution as more competitors have entered the market. [5] The first solutions were based on dedicated and rugged voice appliances, mobile computing devices that ran the speech recognition software and that communicated with a server over a wireless network. These special purpose voice appliances use a speech recognition engine that was specially designed for the warehouse and provided by the appliance manufacturer. Since the early 2000s, more voice suppliers have entered the market, providing voice recognition systems for standard mobile computing devices that had been used previously for barcode scanning applications in the warehouse. These standard mobile computers from companies like Motorola, Intermec and LXE also support non-proprietary recognition software. [6] The uncoupling of the hardware and speech recognition software has resulted in lower priced voice-directed warehousing solutions and an increase in the number of software providers. These two factors contributed to a rapid rise in adoption of VDW that continues today.

Potential benefits

Implementing voice systems in the warehouse has among its benefits: [7]

One of the great productivity benefits of voice-based systems is that they allow operators to do two things at once whereas other media used in warehouses such as paper or radio frequency terminals tend to require that you surrender the use of at least one hand or you have to stop and read something before proceeding. This productivity benefit tends to be inversely related to the fraction of an operator's day spent walking. Reduced picking errors and flexibility of the voice picking technology tend to improve customer service to meet just-in-time order fulfillment. Some robust systems have independent language recognition engines hence they can be used by non-native employees. [8]

Related Research Articles

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

<span class="mw-page-title-main">Optical character recognition</span> Computer recognition of visual text

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo or from subtitle text superimposed on an image.

Interactive voice response (IVR) is a technology that allows telephone users to interact with a computer-operated telephone system through the use of voice and DTMF tones input with a keypad. In telecommunications, IVR allows customers to interact with a company's host system via a telephone keypad or by speech recognition, after which services can be inquired about through the IVR dialogue. IVR systems can respond with pre-recorded or dynamically generated audio to further direct users on how to proceed. IVR systems deployed in the network are sized to handle large call volumes and also used for outbound calling as IVR systems are more intelligent than many predictive dialer systems.

<span class="mw-page-title-main">Logistics automation</span> Application of computer software or automated machinery

Logistics automation is the application of computer software or automated machinery to improve the efficiency of logistics operations. Typically this refers to operations within a warehouse or distribution center, with broader tasks undertaken by supply chain engineering systems and enterprise resource planning systems.

<span class="mw-page-title-main">Warehouse</span> Building for storing goods and giving services

A warehouse is a building for storing goods. Warehouses are used by manufacturers, importers, exporters, wholesalers, transport businesses, customs, etc. They are usually large plain buildings in industrial parks on the outskirts of cities, towns, or villages.

A voice-user interface (VUI) makes spoken human interaction with using computers, using speech recognition to understand spoken commands and answer questions, and typically text to speech to play a reply. A voice command device is a device controlled with a voice user interface.

<span class="mw-page-title-main">Tracking system</span>

A tracking system, also known as a locating system, is used for the observing of persons or objects on the move and supplying a timely ordered sequence of location data for further processing.

<span class="mw-page-title-main">Automated storage and retrieval system</span> Robotic warehouse for physical objects

An automated storage and retrieval system consists of a variety of computer-controlled systems for automatically placing and retrieving loads from defined storage locations. Automated storage and retrieval systems (AS/RS) are typically used in applications where:

<span class="mw-page-title-main">Intermec</span> American electronics manufacturing company

Intermec is a manufacturer and supplier of automated identification and data capture equipment, including barcode scanners, barcode printers, mobile computers, RFID systems, voice recognition systems, and life cycle services.

<span class="mw-page-title-main">Dictation machine</span> Device for recording human speech

A dictation machine is a sound recording device most commonly used to record speech for playback or to be typed into print. It includes digital voice recorders and tape recorder.

As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer.

Mobile tagging is the process of providing data read from tags for display on mobile devices, commonly encoded in a two-dimensional barcode, using the camera of a camera phone as the reader device. The contents of the tag code is usually a URL for information addressed and accessible through Internet.

Mobile translation is any electronic device or software application that provides audio translation. The concept includes any handheld electronic device that is specifically designed for audio translation. It also includes any machine translation service or software application for hand-held devices, including mobile telephones, Pocket PCs, and PDAs. Mobile translation provides hand-held device users with the advantage of instantaneous and non-mediated translation from one human language to another, usually against a service fee that is, nevertheless, significantly smaller than a human translator charges.

Asset tracking refers to the method of tracking physical assets, either by scanning barcode labels attached to the assets or by using tags using GPS, BLE, LoRa, or RFID which broadcast their location. These technologies can also be used for indoor tracking of persons wearing a tag.

Distribution Center Management System (DCMS) is a user friendly Warehouse Management System (WMS), designed to track the activities performed in a distribution center (DC)/warehouse. It is created and owned by a private company called Eclipse Systems Pvt Ltd. It automates the entire process flow of receiving, managing and shipping goods to customers from the warehouse. DCMS solutions are designed for both large and small scale businesses. In January 2015, the product went open source.

Inventory management software is a software system for tracking inventory levels, orders, sales and deliveries. It can also be used in the manufacturing industry to create a work order, bill of materials and other production-related documents. Companies use inventory management software to avoid product overstock and outages. It is a tool for organizing inventory data that before was generally stored in hard-copy form or in spreadsheets.

<span class="mw-page-title-main">Golden-i</span>

The Golden-i platform consists of multiple mobile wireless wearable headset computers operated by voice commands and head movements. It was developed at Kopin Corporation by a team led by Jeffrey Jacobsen, chief Golden-i architect and senior ddvisor to the CEO. Utilizing a speech controlled user interface and head-tracking functionality, Golden-i enables the user to carry out common computer functions whilst keeping their hands free.

Warehouse execution systems (WES) are computerized systems used in warehouses and distribution centers to manage and orchestrate the physical flow of products from receiving through shipping. Warehouses are storage facilities for raw materials and parts used in manufacturing operations; distribution centers (DCs) are facilities that store and distribute finished goods to retail locations, consumers, and other end customers.

Scandit AG, commonly referred to as Scandit, is a Swiss technology company that provides smart data capture software. Their technology allows any smart device equipped with a camera to scan barcodes, IDs and text and to perform additional functions using augmented reality and advanced analytics.

Smart data capture (SDC), also known as 'intelligent data capture' or 'automated data capture', describes the branch of technology concerned with using computer vision techniques like optical character recognition (OCR), barcode scanning, object recognition and other similar technologies to extract and process information from semi-structured and unstructured data sources. IDC characterize smart data capture as an integrated hardware, software, and connectivity strategy to help organizations enable the capture of data in an efficient, repeatable, scalable, and future-proof way. Data is captured visually from barcodes, text, IDs and other objects - often from many sources simultaneously - before being converted and prepared for digital use, typically by artificial intelligence-powered software. An important feature of SDC is that it focuses not just on capturing data more efficiently but serving up easy-to-access, actionable insights at the instant of data collection to both frontline and desk-based workers, aiding decision-making and making it a two-way process.

References

  1. "Voice Technology Up Sharply by 2015". April 1, 2011. Archived from the original on July 27, 2011. Retrieved May 20, 2011.
  2. "Just what the doctor ordered". DCV TV. 1 May 2009. Retrieved 13 December 2012.
  3. "Voice in the Warehouse". Manufacturing & Logistics IT. 2010. Retrieved 5 April 2011.
  4. "Voice Technology Solutions". BCP Software. 2010. Retrieved 8 July 2016.
  5. "Order Picking in the 21st Century". Vocollect. 2016. Retrieved 8 July 2016.
  6. "Voice picking module using Android". wms.com.es. Retrieved 2017-08-04.
  7. "Voice Picking Introduction". Speech Interface Design. 2016. Retrieved 8 July 2016.
  8. "Smart Technologies for E-commerce Fulfillment | SIPMM Publications". publication.sipmm.edu.sg. 18 January 2021. Retrieved 2022-07-29.