Visual Cloud

Last updated December 22, 2024

Visual Cloud is the implementation of visual computing applications that rely on cloud computing architectures, cloud scale processing and storage, and ubiquitous broadband connectivity between connected devices, network edge devices and cloud data centers. It is a model for providing visual computing services to consumers and business users, while allowing service providers to realize the general benefits of cloud computing, such as low cost, elastic scalability, and high availability while providing optimized infrastructure for visual computing application requirements.

History and overview

The rise of cloud computing was enabled by a convergence of powerful, low-cost computer hardware, high-capacity networks, and advances in hardware virtualization. To satisfy high consumer demand for visually-based entertainment such as video and gaming, as well as online social interaction, service providers began to deploy visually oriented applications in centralized data centers and use distributed content delivery networks to make that content accessible to their users.

Mobile consumption of video content in particular makes cloud delivery of video attractive, because remote processing and storage can compensate for the limitations of mobile devices. As much as 75% of the world's mobile data traffic is expected to be video by 2020.^[1]

The first generation of visual cloud technologies were mostly oriented around streaming media applications. The mid-2000s saw the introduction of professional and user generated video-on-demand services like Netflix and YouTube, multiplayer online games (MOGs) like Call of Duty and massively multiplayer online games (MMOGs) like World of Warcraft. Another common usage of visual cloud that emerged during this timeframe is desktop virtualization based on remote desktop instances that are hosted using cloud infrastructure.

As visual cloud technology has become more capable, more demanding usages have begun to emerge, such as the use of visual cloud for virtual reality, augmented reality, 3D scene understanding and interactivity, and immersive live experiences.^[2] Visual cloud applications can be roughly divided into four primary domains:

Media content creation and delivery
Cloud graphics
Media analytics
Immersive media

Media content creation and delivery

The overall amount of video being delivered throughout the world is increasing significantly, as new sources develop. Processing and distribution of that content may increasingly be addressed by means of the visual cloud. Sources of that content include applications in cloud, communications, media/entertainment, and enterprise environments. Global mobile data traffic is forecast to increase nearly 7x between 2016 and 2021.^[3] There are three primary models of content distribution.

Broadcasting of linear, live, and on-demand content by traditional communications service providers such as Comcast and DirecTV. This content has typically been consumed on televisions. The visual cloud trends in this model include cloud-based DVRs and virtual set-top boxes that enable broadcast content to be watched on other devices.
Over the Top: Video on Demand (VOD) of professional content from cloud media companies such as Amazon Video and Netflix and user-generated content hosted on platforms like Netflix. Over the top content refers to audio/visual content that is transmitted directly to end users via the Internet, without relying on a communications service provider for control or distribution.
Over the Top: Live streaming of content on video platforms such as Twitch, Facebook live streaming, WatchESPN and SlingTV that is distributed by private cloud or public cloud.

Compute-intensive visual workloads in the media content creation and delivery segment include media processing (e.g., compression and transcoding), enhancement, restoration, and compositing. As this content is stored in data centers and ultimately transmitted to end-users, workloads such as these (and many others) may be applied to the content, with factors such as bit rate and resolution tailored to match the transmission medium and capabilities of the end-consumer device

Cloud Graphics

Interactive 3D (e.g., virtual desktop infrastructure) and batch rendering (e.g., Renderman) operations may be performed at scale in visual cloud usages, where the user is remote from the site of the rendering operations. Example usages in this domain include the following:

Remote desktops allow end-user virtualized computing environments to be centrally hosted, stored and managed in the cloud, for content access with consistent user experiences on multiple types of devices, including tablets and phablets with limited footprints. Examples of remote desktop applications include Citrix, VMware, and Xen.
Remote batch rendering enables resource-intensive graphics processing to be done using either public or private cloud resources, or a hybrid combination of the two. This approach is particularly valuable for on-demand usages that may have large peak workloads, such as in the final stages of production for animated films. Render farms operated by Pixar and LucasFilm are established examples of this usage model.^[4]
Cloud game streaming stores, executes, and renders the game itself in the cloud, transmitting an encoded video stream to user PCs, consoles, or other devices, where the game video is displayed. Controller and keystroke signals are transmitted back to the cloud. Early pioneers in this space included OnLive and Gaikai (both acquired by Sony). Hardware providers such as Sony and NVIDIA and smaller companies like GameFly, GameStream, and PlayGiga are developing cloud gaming products and services today.
Compute-intensive visual workloads in the cloud graphics segment include computer graphics technologies such as ray and raster rendering, 3D design, 3D modeling, and visual simulation. The graphics workload is manipulated in the cloud, with final rendering to the client device. Examples of this kind of workload include Petrel (a software platform used in petroleum exploration and production) and Autodesk 3ds Max (a computer graphics application for making 3D animations).

The varying requirements for the scale of performance and density among these workloads have implications for the cloud resources that optimally support them. For example, a visual cloud infrastructure to support remote desktops would likely be configured with the goal of supporting the greatest practical number of desktop instances per server. Cloud game streaming, on the other hand, requires far greater attention to meeting peak graphics performance, likely requiring lower density per server. While both those interactive usages are also highly latency sensitive, remote batch rendering values time to completion, with latency playing a far less important role.

Media analytics

Computations based on media in the visual cloud can be used to manipulate or provide deeper understanding of the media content itself, as well as to provide insights based on how users interact with it. Media analytics treats visual information as unstructured data to be processed and fed into analytics engines for interpretation of images, audio, or video to implement usages such as web visual search, autonomous transportation, surveillance, smart cities, and robotics. Visual computing technologies in the media analytics segment include three subdomains:

Media processing technologies required to prepare visual content for analysis include transcoding, decoding, enhancement, restoration, edge detection, and segmentation.
Content analysis consists of capabilities such as object detection and recognition, event detection and recognition, and scene understanding.
Media analytics produces metrics based on performance and usage factors related to video or other media; measurements of video quality and audience behavior are common examples.^[5]

Media analytics often makes use of “deep learning” frameworks, which involve training an algorithm using large amounts of source data. The training portion of this approach typically takes place over an extended period of time and involves teaching the algorithm by mapping large amounts of input data to specific output classifications. The resulting trained algorithm can then make rapid or instantaneous interpretations of new input data based on rules developed during the training stage.

Immersive media

Making use of the capabilities in the three usage areas described above (i.e., media content creation and delivery, cloud graphics, and media analytics), visual data can be manipulated based on its contents to support emerging usages such as live panoramic video and augmented or virtual reality (VR). Immersive reality gaming, for example, builds a game experience on top of the physical surroundings of the player, in real time. These experiences can be consumed on purpose-built displays such as VR head-mounted displays or on conventional devices such as PCs, tablets, or phones.

Some of the more visible usages of these technologies include Google Street View and Pokémon Go. Other commercially available examples of immersive media include Voke VR live streaming, FreeD ^[usurped] virtual replay technology, Facebook 360° photos, and Oculus Rift and Microsoft Hololens VR goggles.

Most usages in the immersive media segment require compute-intensive scene analysis, which must often be performed in real time or near-real time. As with all visual cloud applications, workloads will be distributed between end devices and the cloud. For example, head-mounted display rendering might be done locally to the user to minimize latency, but live VR content distribution could be done predominantly from the cloud.

Related Research Articles

Multimedia refers to the integration of multiple forms of content, such as text, audio, images, video, and interactive elements into a single digital platform or application. This integration allows for a more immersive and engaging experience compared to traditional single-medium content. Multimedia is utilized in various fields, including education, entertainment, communication, game design, and digital art, reflecting its broad impact on modern technology and media.

<span class="mw-page-title-main">Thin client</span> Non-powerful computer optimized for remote server access

In computer networking, a thin client, sometimes called slim client or lean client, is a simple (low-performance) computer that has been optimized for establishing a remote connection with a server-based computing environment. They are sometimes known as network computers, or in their simplest form as zero clients. The server does most of the work, which can include launching software programs, performing calculations, and storing data. This contrasts with a rich client or a conventional personal computer; the former is also intended for working in a client–server model but has significant local processing power, while the latter aims to perform its function mostly locally.

Virtual reality (VR) is a simulated experience that employs 3D near-eye displays and pose tracking to give the user an immersive feel of a virtual world. Applications of virtual reality include entertainment, education and business. VR is one of the key technologies in the reality-virtuality continuum. As such, it is different from other digital visualization solutions, such as augmented virtuality and augmented reality.

Application software is any computer program that is intended for end-user use – not operating, administering or programming the computer. An application is any program that can be categorized as application software. Common types of applications include word processor, media player and accounting software.

A virtual environment is a networked application that allows a user to interact with both the computing environment and the work of other users. Email, chat, and web-based document sharing applications are all examples of virtual environments. Simply put, it is a networked common operating space. Once the fidelity of the virtual environment is such that it "creates a psychological state in which the individual perceives himself or herself as existing within the virtual environment" then the virtual environment (VE) has progressed into the realm of immersive virtual environments (IVEs).

Visualization, also known as Graphics Visualization, is any technique for creating images, diagrams, or animations to communicate a message. Visualization through visual imagery has been an effective way to communicate both abstract and concrete ideas since the dawn of humanity. from history include cave paintings, Egyptian hieroglyphs, Greek geometry, and Leonardo da Vinci's revolutionary methods of technical drawing for engineering purposes that actively involve scientific requirements.

A Rich Internet Application is a web application that has many of the characteristics of desktop application software. The concept is closely related to a single-page application, and may allow the user interactive features such as drag and drop, background menu, WYSIWYG editing, etc. The concept was first introduced in 2002 by Macromedia to describe Macromedia Flash MX product. Throughout the 2000s, the term was generalized to describe browser-based applications developed with other competing browser plugin technologies including Java applets, Microsoft Silverlight.

Desktop Window Manager is the compositing window manager in Microsoft Windows since Windows Vista that enables the use of hardware acceleration to render the graphical user interface of Windows.

VirtualGL (VGL) is an open-source software package that redirects the 3D rendering commands from Unix and Linux OpenGL applications to 3D accelerator hardware in a dedicated server and sends the rendered output to a (thin) client located elsewhere on the network. On the server side, VirtualGL consists of a library that handles the redirection and a wrapper program that instructs applications to use this library. Clients can connect to the server either using a remote X11 connection or using an X11 proxy such as a Virtual Network Computing (VNC) server. In case of an X11 connection some client-side VirtualGL software is also needed to receive the rendered graphics output separately from the X11 stream. In case of a VNC connection no specific client-side software is needed other than the VNC client itself.

Desktop virtualization is a software technology that separates the desktop environment and associated application software from the physical client device that is used to access it.

In computing, 3D interaction is a form of human-machine interaction where users are able to move and perform interaction in 3D space. Both human and machine process information where the physical position of elements in the 3D space is relevant.

Microsoft RemoteFX is a Microsoft brand name that covers a set of technologies that enhance visual experience of the Microsoft-developed remote display protocol Remote Desktop Protocol (RDP). RemoteFX was first introduced in Windows Server 2008 R2 SP1 and is based on intellectual property that Microsoft acquired and continued to develop since acquiring Calista Technologies. It is a part of the overall Remote Desktop Services workload.

VideoOverIP is a remote desktop protocol developed by Texas-based, desktop virtualization and cloud computing company VDIworks.

GPU virtualization refers to technologies that allow the use of a GPU to accelerate graphics or GPGPU applications running on a virtual machine. GPU virtualization is used in various applications such as desktop virtualization, cloud gaming and computational science.

Computation offloading is the transfer of resource intensive computational tasks to a separate processor, such as a hardware accelerator, or an external platform, such as a cluster, grid, or a cloud. Offloading to a coprocessor can be used to accelerate applications including: image rendering and mathematical calculations. Offloading computing to an external platform over a network can provide computing power and overcome hardware limitations of a device, such as limited computational power, storage, and energy.

Visual computing is a generic term for all computer science disciplines dealing with the 3D modeling of graphical requirements, for which extenuates to all disciplines of the Computational Sciences. While this is directly relevant to the software visualistics of Microservices, Visual Computing also includes the specializations of the subfields that are called Computer Graphics, Image Processing, Visualization, Computer Vision, Computational Imaging, Augmented Reality, and Video Processing, upon which extenuates into Design Computation. Visual computing also includes aspects of Pattern Recognition, Human-Computer Interaction, Machine Learning, Robotics, Computer Simulation, Steganography, Security Visualization, Spatial Analysis, Computational Visualistics, and Computational Creativity. The core challenges are the acquisition, processing, analysis and rendering of visual information. Application areas include industrial quality control, medical image processing and visualization, surveying, multimedia systems, virtual heritage, special effects in movies and television, and ultimately computer games, which is central towards the visual models of User Experience Design. Conclusively, this includes the extenuations of large language models (LLM) that are in Generative Artificial Intelligence for developing research around the simulations of scientific instruments in the Computational Sciences. This is especially the case with the research simulations that are between Embodied Agents and Generative Artificial Intelligence that is designed for Visual Computation. Therefore, this field also extenuates into the diversity of scientific requirements that are addressed through the visualized technologies of interconnected research in the Computational Sciences.

Remote mobile virtualization, like its counterpart desktop virtualization, is a technology that separates operating systems and applications from the client devices that access them. However, while desktop virtualization allows users to remotely access Windows desktops and applications, remote mobile virtualization offers remote access to mobile operating systems such as Android.

<span class="mw-page-title-main">Kubity</span> Cloud-based 3D communication tool

Kubity is a cloud-based 3D communication tool that works on desktop computers, the web, smartphones, tablets, augmented reality gear, and virtual reality glasses. Kubity is powered by several proprietary 3D processing engines including "Paragone" and "Etna" that prepare the 3D file for transfer over mobile devices.

A virtual reality game or VR game is a video game played on virtual reality (VR) hardware. Most VR games are based on player immersion, typically through a head-mounted display unit or headset with stereoscopic displays and one or more controllers.

Volumetric capture or volumetric video is a technique that captures a three-dimensional space, such as a location or performance. This type of volumography acquires data that can be viewed on flat screens as well as using 3D displays and VR headset. Consumer-facing formats are numerous and the required motion capture techniques lean on computer graphics, photogrammetry, and other computation-based methods. The viewer generally experiences the result in a real-time engine and has direct input in exploring the generated volume.

References

↑ "Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2016–2021 White Paper". Cisco. Retrieved 22 February 2017.
↑ Blakley, Jim. "The Visual Cloud Second Wave". IT Peer Network. Intel. Retrieved 22 February 2017.
↑ "Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2016–2021 White Paper". Cisco.com. Cisco. Retrieved 22 February 2017.
↑ Sciretta, Peter. "Cool Stuff: A Look at Pixar and LucasFilm's Renderfarms". SlashFilm. Retrieved 22 February 2017.
↑ "Media Analytics Product Brief" (PDF). Akamai.com. Akamai Technologies. Retrieved 22 February 2017.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2016–2021 White Paper". Cisco. Retrieved 22 February 2017.

[2] Blakley, Jim. "The Visual Cloud Second Wave". IT Peer Network. Intel. Retrieved 22 February 2017.

[3] "Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2016–2021 White Paper". Cisco.com. Cisco. Retrieved 22 February 2017.

[4] Sciretta, Peter. "Cool Stuff: A Look at Pixar and LucasFilm's Renderfarms". SlashFilm. Retrieved 22 February 2017.

[5] "Media Analytics Product Brief" (PDF). Akamai.com. Akamai Technologies. Retrieved 22 February 2017.

[1]

[2]

[3]

[4]

[5]