Natural user interface

Last updated June 20, 2025

In computing, a natural user interface (NUI) or natural interface is a user interface that is effectively invisible, and remains invisible as the user continuously learns increasingly complex interactions. The word "natural" is used because most computer interfaces use artificial control devices whose operation has to be learned. Examples include voice assistants, such as Alexa and Siri, touch and multitouch interactions on today's mobile phones and tablets, but also touch interfaces invisibly integrated into the textiles of furniture.^[1]

An NUI relies on a user being able to quickly transition from novice to expert. While the interface requires learning, that learning is eased through design which gives the user the feeling that they are instantly and continuously successful. Thus, "natural" refers to a goal in the user experience – that the interaction comes naturally, while interacting with the technology, rather than that the interface itself is natural. This is contrasted with the idea of an intuitive interface, referring to one that can be used without previous learning.

Several design strategies have been proposed which have met this goal to varying degrees of success. One strategy is the use of a "reality user interface" ("RUI"),^[2] also known as "reality-based interfaces" (RBI) methods. One example of an RUI strategy is to use a wearable computer to render real-world objects "clickable", i.e. so that the wearer can click on any everyday object so as to make it function as a hyperlink, thus merging cyberspace and the real world. Because the term "natural" is evocative of the "natural world", RBI are often confused for NUI, when in fact they are merely one means of achieving it.

One example of a strategy for designing a NUI not based in RBI is the strict limiting of functionality and customization, so that users have very little to learn in the operation of a device. Provided that the default capabilities match the user's goals, the interface is effortless to use. This is an overarching design strategy in Apple's iOS.^{[ citation needed ]} Because this design is coincident with a direct-touch display, non-designers commonly misattribute the effortlessness of interacting with the device to that multi-touch display, and not to the design of the software where it actually resides.

History

Evolution of user interfaces CLI-GUI-NUI.png — Evolution of user interfaces

In the 1990s, Steve Mann developed a number of user-interface strategies using natural interaction with the real world as an alternative to a command-line interface (CLI) or graphical user interface (GUI). Mann referred to this work as "natural user interfaces", "Direct User Interfaces", and "metaphor-free computing".^[3] Mann's EyeTap technology typically embodies an example of a natural user interface. Mann's use of the word "Natural" refers to both action that comes naturally to human users, as well as the use of nature itself, i.e. physics (Natural Philosophy), and the natural environment. A good example of an NUI in both these senses is the hydraulophone, especially when it is used as an input device, in which touching a natural element (water) becomes a way of inputting data. More generally, a class of musical instruments called "physiphones", so-named from the Greek words "physika", "physikos" (nature) and "phone" (sound) have also been proposed as "Nature-based user interfaces".^[4]

In 2006, Christian Moore established an open research community with the goal to expand discussion and development related to NUI technologies.^[5] In a 2008 conference presentation "Predicting the Past," August de los Reyes, a Principal User Experience Director of Surface Computing at Microsoft described the NUI as the next evolutionary phase following the shift from the CLI to the GUI.^[6] Of course, this too is an over-simplification, since NUIs necessarily include visual elements – and thus, graphical user interfaces. A more accurate description of this concept would be to describe it as a transition from WIMP to NUI.

In the CLI, users had to learn an artificial means of input, the keyboard, and a series of codified inputs, that had a limited range of responses, where the syntax of those commands was strict.

Then, when the mouse enabled the GUI, users could more easily learn the mouse movements and actions, and were able to explore the interface much more. The GUI relied on metaphors for interacting with on-screen content or objects. The 'desktop' and 'drag' for example, being metaphors for a visual interface that ultimately was translated back into the strict codified language of the computer.

An example of the misunderstanding of the term NUI was demonstrated at the Consumer Electronics Show in 2010. "Now a new wave of products is poised to bring natural user interfaces, as these methods of controlling electronics devices are called, to an even broader audience."^[7]

In 2010, Microsoft's Bill Buxton reiterated the importance of the NUI within Microsoft Corporation with a video discussing technologies which could be used in creating a NUI, and its future potential.^[8]

In 2010, Daniel Wigdor and Dennis Wixon provided an operationalization of building natural user interfaces in their book.^[9] In it, they carefully distinguish between natural user interfaces, the technologies used to achieve them, and reality-based UI.

Types of interface

Multi-touch

When Bill Buxton was asked about the iPhone's interface, he responded "Multi-touch technologies have a long history. To put it in perspective, the original work undertaken by my team was done in 1984, the same year that the first Macintosh computer was released, and we were not the first."^[10]

Multi-Touch is a technology which could enable a natural user interface. Applications of animation creation with multi-touch systems have been demonstrated.^[11] However, most UI toolkits used to construct interfaces executed with such technology are traditional GUIs.

Gesture control

Hand gesture presents a natural way of interacting with computer systems, especially for those that involve 3D information such as manipulating virtual reality or augmented reality objects, thanks to the 3 dimensional movements of hand gestures.^[12] Such hand gestures can be extracted from images collected in color or RGB-D cameras, or captured from sensor gloves. Since hand gestures from such input sources are usually noisy, solutions based on artificial intelligence are presented to remove the sensor noisy, allowing more effective controls. ^[13] Moreover, to tackle the computational delay in real-time applications, short-term prediction of hand gestures has enabled an improved level of interactivity.^[14]

NUI software and hardware platforms

Perceptive Pixel

One example is the work done by Jefferson Han on multi-touch interfaces. In a demonstration at TED in 2006, he showed a variety of means of interacting with on-screen content using both direct manipulations and gestures. For example, to shape an on-screen glutinous mass, Jeff literally 'pinches' and prods and pokes it with his fingers. In a GUI interface for a design application for example, a user would use the metaphor of 'tools' to do this, for example, selecting a prod tool, or selecting two parts of the mass that they then wanted to apply a 'pinch' action to. Han showed that user interaction could be much more intuitive by doing away with the interaction devices that we are used to and replacing them with a screen that was capable of detecting a much wider range of human actions and gestures. Of course, this allows only for a very limited set of interactions which map neatly onto physical manipulation (RBI). Extending the capabilities of the software beyond physical actions requires significantly more design work.

Microsoft PixelSense

Microsoft PixelSense takes similar ideas on how users interact with content, but adds in the ability for the device to optically recognize objects placed on top of it. In this way, users can trigger actions on the computer through the same gestures and motions as Jeff Han's touchscreen allowed, but also objects become a part of the control mechanisms. So for example, when you place a wine glass on the table, the computer recognizes it as such and displays content associated with that wine glass. Placing a wine glass on a table maps well onto actions taken with wine glasses and other tables, and thus maps well onto reality-based interfaces. Thus, it could be seen as an entrée to a NUI experience.

3D Immersive Touch

"3D Immersive Touch" is defined as the direct manipulation of 3D virtual environment objects using single or multi-touch surface hardware in multi-user 3D virtual environments. Coined first in 2007 to describe and define the 3D natural user interface learning principles associated with Edusim. Immersive Touch natural user interface now appears to be taking on a broader focus and meaning with the broader adaption of surface and touch driven hardware such as the iPhone, iPod touch, iPad, and a growing list of other hardware. Apple also seems to be taking a keen interest in “Immersive Touch” 3D natural user interfaces over the past few years. This work builds atop the broad academic base which has studied 3D manipulation in virtual reality environments.

Xbox Kinect

Kinect is a motion sensing input device by Microsoft for the Xbox 360 video game console and Windows PCs that uses spatial gestures for interaction instead of a game controller. According to Microsoft's page, Kinect is designed for "a revolutionary new way to play: no controller required.".^[15] Again, because Kinect allows the sensing of the physical world, it shows potential for RBI designs, and thus potentially also for NUI.

Notes

↑ Brauner, Philipp; van Heek, Julia; Ziefle, Martina; Hamdan, Nur Al-huda; Borchers, Jan (2017-10-17). "Interactive FUrniTURE". Proceedings of the 2017 ACM International Conference on Interactive Surfaces and Spaces. Brighton United Kingdom: ACM. pp. 151–160. doi:10.1145/3132272.3134128. ISBN 978-1-4503-4691-7. S2CID 10774834.
↑ Reality User Interface (RUI), in the paper of the Closing Keynote Address, entitled "Reconfigured Self as Basis for Humanistic Intelligence", Steve Mann, USENIX-98, New Orleans June 15–19, 1998, Published in: ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference USENIX Association Berkeley, CA, USA ©1998
↑ Intelligent Image Processing, John Wiley and Sons, 2001
↑ Natural Interfaces for Musical Expression, Steve Mann, Nime 2007
↑ Moore, Christian (2006-07-15). "New Community Open". NUI Group Community.
↑ de los Reyes, August (2008-09-25). "Predicting the Past". Web Directions South 2008. Sydney Convention Centre: Web Directions.
↑ Wingfield, Nick (2010-01-05). "Body in Motion: CES to Showcase Touch Gizmos". Wall Street Journal.
↑ Buxton, Bill (2010-01-06). "CES 2010: NUI with Bill Buxton". Microsoft Research.
↑ Brave NUI World
↑ Buxton, Bill. "Multi-Touch Systems that I Have Known and Loved". Bill Buxton.
↑ Shen, Yijun; Henry, Joseph; Wang, He; Ho, Edmond S. L.; Komura, Taku; Shum, Hubert P. H. (2018). "Data-Driven Crowd Motion Control with Multi-Touch Gestures". Computer Graphics Forum. 37 (6). John Wiley and Sons Ltd.: 382--394. doi:10.1111/cgf.13333. ISSN 1467-8659.
↑ Feng, Qi; Shum, Hubert P. H.; Morishima, Shigeo (2020). "Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization" . Computer Animation and Virtual Worlds. 31 (4--5). John Wiley and Sons Ltd.: e1956. doi:10.1002/cav.1956.
↑ Zhou, Kanglei; Chen, Jiaying; Shum, Hubert P. H.; Li, Frederick W. B.; Liang, Xiaohui (2021). STGAE: Spatial Temporal Graph Auto-Encoder for Hand Motion Denoising. IEEE. pp. 41--49. doi:10.1109/ISMAR52148.2021.00018. ISSN 1554-7868.
↑ Zhou, Kanglei; Shum, Hubert P. H.; Li, Frederick W. B.; Liang, Xiaohui (2024). "Multi-Task Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising". IEEE Transactions on Visualization and Computer Graphics. IEEE. doi:10.1109/TVCG.2023.3337868.
↑ "Xbox.com Project Natal". Archived from the original on 2009-07-09. Retrieved 2009-08-02.

References

http://blogs.msdn.com/surface/archive/2009/02/25/what-is-nui.aspx
https://www.amazon.com/Brave-NUI-World-Designing-Interfaces/dp/0123822319/ref=sr_1_1?ie=UTF8&qid=1329478543&sr=8-1 The book Brave NUI World from the creators of Microsoft Surface's NUI

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Brauner, Philipp; van Heek, Julia; Ziefle, Martina; Hamdan, Nur Al-huda; Borchers, Jan (2017-10-17). "Interactive FUrniTURE". Proceedings of the 2017 ACM International Conference on Interactive Surfaces and Spaces. Brighton United Kingdom: ACM. pp. 151–160. doi:10.1145/3132272.3134128. ISBN 978-1-4503-4691-7. S2CID 10774834.

[2] Reality User Interface (RUI), in the paper of the Closing Keynote Address, entitled "Reconfigured Self as Basis for Humanistic Intelligence", Steve Mann, USENIX-98, New Orleans June 15–19, 1998, Published in: ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference USENIX Association Berkeley, CA, USA ©1998

[3] Intelligent Image Processing, John Wiley and Sons, 2001

[4] Natural Interfaces for Musical Expression, Steve Mann, Nime 2007

[NUI_Group-5] Moore, Christian (2006-07-15). "New Community Open". NUI Group Community.

[Predicting-6] s Reyes, August (2008-09-25). "Predicting the Past". Web Directions South 2008. Sydney Convention Centre: Web Directions.

[WSJ-7] Wingfield, Nick (2010-01-05). "Body in Motion: CES to Showcase Touch Gizmos". Wall Street Journal.

[MSDN-8] Buxton, Bill (2010-01-06). "CES 2010: NUI with Bill Buxton". Microsoft Research.

[9] Brave NUI World

[billbuxton1-10] Buxton, Bill. "Multi-Touch Systems that I Have Known and Loved". Bill Buxton.

[shen18datadriven-11] Shen, Yijun; Henry, Joseph; Wang, He; Ho, Edmond S. L.; Komura, Taku; Shum, Hubert P. H. (2018). "Data-Driven Crowd Motion Control with Multi-Touch Gestures". Computer Graphics Forum. 37 (6). John Wiley and Sons Ltd.: 382--394. doi:10.1111/cgf.13333. ISSN 1467-8659.

[feng20resolving-12] Feng, Qi; Shum, Hubert P. H.; Morishima, Shigeo (2020). "Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization" . Computer Animation and Virtual Worlds. 31 (4--5). John Wiley and Sons Ltd.: e1956. doi:10.1002/cav.1956.

[zhou21stgae-13] Zhou, Kanglei; Chen, Jiaying; Shum, Hubert P. H.; Li, Frederick W. B.; Liang, Xiaohui (2021). STGAE: Spatial Temporal Graph Auto-Encoder for Hand Motion Denoising. IEEE. pp. 41--49. doi:10.1109/ISMAR52148.2021.00018. ISSN 1554-7868.

[zhou24multitask-14] Zhou, Kanglei; Shum, Hubert P. H.; Li, Frederick W. B.; Liang, Xiaohui (2024). "Multi-Task Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising". IEEE Transactions on Visualization and Computer Graphics. IEEE. doi:10.1109/TVCG.2023.3337868.

[xbox1-15] "Xbox.com Project Natal". Archived from the original on 2009-07-09. Retrieved 2009-08-02.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

v t e User interfaces
Natural-language user interfaces	Chatbot Dialogue system Voice user interfaces Conversational user interface Virtual assistant Voice search
Graphical user interfaces	Widgets Zooming user interface
Touch user interfaces	Multi-touch Tangible user interface
3D user interfaces	Augmented and virtual reality Finger tracking Positional tracking
Other user interfaces	Text-based user interface Natural user interface Multimodal user interface