Waifu2x

waifu2x
Original author(s)	nagadomi
Initial release	October 11, 2015;9 years ago
Stable release	v0.13.2 / November 18, 2018;6 years ago
Repository	github.com/nagadomi/waifu2x ;
Written in	Lua
Operating system	Linux with CUDA support
License	MIT License
Website	www.waifu2x.net

Last updated January 30, 2025

waifu2x is an image scaling and noise reduction program for anime-style art and other types of photos.^[1]

Etymology

Waifu (from the Japanese pronunciation of "wife") is anime slang for a female character to whom one is attracted. 2x means two-times magnification.

Example

Left: 512×512px, PNG lossless, original source Wikipe-tan face.svg
Middle: 256×256px, JPG quality 30%, then nearest-neighbor interpolation
Right: 512×512px, waifu2x upscaling & noise reduction

Related Research Articles

OpenGL is a cross-language, cross-platform application programming interface (API) for rendering 2D and 3D vector graphics. The API is typically used to interact with a graphics processing unit (GPU), to achieve hardware-accelerated rendering.

A graphics processing unit (GPU) is a specialized electronic circuit initially designed for digital image processing and to accelerate computer graphics, being present either as a discrete video card or embedded on motherboards, mobile phones, personal computers, workstations, and game consoles. After their initial design, GPUs were found to be useful for non-graphic calculations involving embarrassingly parallel problems due to their parallel structure. GPUs ability to perform vast numbers of calculations rapidly has led to their adoption in diverse fields including artificial intelligence where it excels at handling data-intensive and computationally demanding tasks.Other non-graphical uses include the training of neural networks and cryptocurrency mining.

hqx is a set of 3 image upscaling algorithms developed by Maxim Stepin. The algorithms are hq2x, hq3x, and hq4x, which magnify by a factor of 2, 3, and 4 respectively. It was initially created in 2003 for the Super NES emulator ZSNES, and is used in emulators such as Nestopia, F. CEUXSnes9x., and Snes9x.

Mesa, also called Mesa3D and The Mesa 3D Graphics Library, is an open source implementation of OpenGL, Vulkan, and other graphics API specifications. Mesa translates these specifications to vendor-specific graphics hardware drivers.

Pixel art scaling algorithms are graphical filters that attempt to enhance the appearance of hand-drawn 2D pixel art graphics. These algorithms are a form of automatic image enhancement. Pixel art scaling algorithms employ methods significantly different than the common methods of image rescaling, which have the goal of preserving the appearance of images.

In computer graphics and digital imaging, imagescaling refers to the resizing of a digital image. In video technology, the magnification of digital material is known as upscaling or resolution enhancement.

In computing, CUDA is a proprietary parallel computing platform and application programming interface (API) that allows software to use certain types of graphics processing units (GPUs) for accelerated general-purpose processing, an approach called general-purpose computing on GPUs. CUDA was created by Nvidia in 2006. When it was first introduced, the name was an acronym for Compute Unified Device Architecture, but Nvidia later dropped the common use of the acronym and now rarely expands it.

AMD Software is a device driver and utility software package for AMD's Radeon graphics cards and APUs. Its graphical user interface is built with Qt and is compatible with 64-bit Windows and Linux distributions.

OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of central processing units (CPUs), graphics processing units (GPUs), digital signal processors (DSPs), field-programmable gate arrays (FPGAs) and other processors or hardware accelerators. OpenCL specifies a programming language for programming these devices and application programming interfaces (APIs) to control the platform and execute programs on the compute devices. OpenCL provides a standard interface for parallel computing using task- and data-based parallelism.

In computing, half precision is a binary floating-point computer number format that occupies 16 bits in computer memory. It is intended for storage of floating-point values in applications where higher precision is not essential, in particular image processing and neural networks.

Mantle was a low-overhead rendering API targeted at 3D video games. AMD originally developed Mantle in cooperation with DICE, starting in 2013. Mantle was designed as an alternative to Direct3D and OpenGL, primarily for use on personal computers. In 2015, Mantle's public development was suspended and in 2019 completely discontinued, as DirectX 12 and the Mantle-derived Vulkan rose in popularity.

Vulkan is a low-level, low-overhead cross-platform API and open standard for 3D graphics and computing. It was intended to address the shortcomings of OpenGL, and allow developers more control over the GPU. It is designed to support a wide variety of GPUs, CPUs and operating systems, and it is also designed to work with modern multi-core CPUs.

GPUOpen is a middleware software suite originally developed by AMD's Radeon Technologies Group that offers advanced visual effects for computer games. It was released in 2016. GPUOpen serves as an alternative to, and a direct competitor of Nvidia GameWorks. GPUOpen is similar to GameWorks in that it encompasses several different graphics technologies as its main components that were previously independent and separate from one another. However, GPUOpen is partially open source software, unlike GameWorks which is proprietary and closed.

SYCL is a higher-level programming model to improve programming productivity on various hardware accelerators. It is a single-source embedded domain-specific language (eDSL) based on pure C++17. It is a standard developed by Khronos Group, announced in March 2014.

Caffe is a deep learning framework, originally developed at University of California, Berkeley. It is open source, under a BSD license. It is written in C++, with a Python interface.

SqueezeNet is a deep neural network for image classification released in 2016. SqueezeNet was developed by researchers at DeepScale, University of California, Berkeley, and Stanford University. In designing SqueezeNet, the authors' goal was to create a smaller neural network with fewer parameters while achieving competitive accuracy. Their best-performing model achieved the same accuracy as AlexNet on ImageNet classification, but has a size 510x less than it.

The Style Generative Adversarial Network, or StyleGAN for short, is an extension to the GAN architecture introduced by Nvidia researchers in December 2018, and made source available in February 2019.

Deep learning super sampling (DLSS) is a family of real-time deep learning image enhancement and upscaling technologies developed by Nvidia that are available in a number of video games. The goal of these technologies is to allow the majority of the graphics pipeline to run at a lower resolution for increased performance, and then infer a higher resolution image from this that approximates the same level of detail as if the image had been rendered at this higher resolution. This allows for higher graphical settings and/or frame rates for a given output resolution, depending on user preference.

Intel Arc is a brand of graphics processing units designed by Intel. These are discrete GPUs mostly marketed for the high-margin gaming PC market. The brand also covers Intel's consumer graphics software and services.

Deep learning anti-aliasing (DLAA) is a form of spatial anti-aliasing created by Nvidia. DLAA depends on and requires Tensor Cores available in Nvidia RTX cards.

References

↑ "Amplía la resolución de tus imágenes con este portal web". TekCrispy (in Spanish). 2019-01-17. Archived from the original on January 21, 2019. Retrieved 2019-01-21.
↑ "GitHub - nagadomi/Waifu2x: Image Super-Resolution for Anime-Style Art". GitHub . April 2020.
↑ Dong C, Loy C C, He K, et al. Image super-resolution using deep convolutional networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(2): 295-307.
↑ "Even better image upscaling with Waifu2x". Fedora Magazine. 2018-10-02. Retrieved 2019-01-21.
↑ "GitHub - marcan/Cl-waifu2x: OpenCL implementation of waifu2x image upscaling". GitHub . 25 March 2020.
↑ "Waifu2x converter NCNN version, runs fast on intel / Amd / Nvidia GPU with vulkan: Nihui/Waifu2x-NCNN-vulkan". GitHub . April 2020.

External links

Official website
Free AI image upscaler: https://image-upscaling.net/
waifu2x on GitHub

This graphics software–related article is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "Amplía la resolución de tus imágenes con este portal web". TekCrispy (in Spanish). 2019-01-17. Archived from the original on January 21, 2019. Retrieved 2019-01-21.

[2] "GitHub - nagadomi/Waifu2x: Image Super-Resolution for Anime-Style Art". GitHub . April 2020.

[3] Dong C, Loy C C, He K, et al. Image super-resolution using deep convolutional networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(2): 295-307.

[4] "Even better image upscaling with Waifu2x". Fedora Magazine. 2018-10-02. Retrieved 2019-01-21.

[5] "GitHub - marcan/Cl-waifu2x: OpenCL implementation of waifu2x image upscaling". GitHub . 25 March 2020.

[6] "Waifu2x converter NCNN version, runs fast on intel / Amd / Nvidia GPU with vulkan: Nihui/Waifu2x-NCNN-vulkan". GitHub . April 2020.

[1]

[2]

[3]

[4]

[5]

[6]