Integrated Performance Primitives

Intel Integrated Performance Primitives
Developer(s)	Intel
Stable release	2021.11.0 / March 28, 2024;5 months ago
Written in	C/C++
Operating system	Linux, macOS, Microsoft Windows
Type	Library or framework
License	Proprietary, freeware
Website	software.intel.com/intel-ipp

Last updated September 25, 2024

Intel Integrated Performance Primitives (Intel IPP) is an extensive library of ready-to-use, domain-specific functions that are highly optimized for diverse Intel architectures. Its royalty-free APIs help developers take advantage of single instruction, multiple data (SIMD) instructions.^[4]

Features

The library takes advantage of processor features including MMX, SSE, SSE2, SSE3, SSSE3, SSE4, AVX, AVX2, AVX-512, AES-NI and multi-core processors. ^[6] Intel IPP includes functions for:

Organization

Intel IPP is divided into four major processing groups: signal processing (with linear array or vector data), image processing (with 2D arrays for typical color spaces), data compression, and cryptography.^[6]

Half the entry points are of the matrix type, a third are of the signal type, and the remainder are of the image and cryptography types. Intel IPP functions are divided into 4 data types: data types include 8u (8-bit unsigned), 8s (8-bit signed), 16s, 32f (32-bit floating-point), 64f, etc. Typically, an application developer works with only one dominant data type for most processing functions, converting between input to processing to output formats at the end points.^[6]

History

Version 2.0 files are dated April 22, 2002.
Version 3.0
Version 4.0 files are dated November 11, 2003. 4.0 runtime fully supports applications coded for 3.0 and 2.0.
Version 5.1 files are dated March 9, 2006. 5.1 runtime does not support applications coded for 4.0 or before.
Version 5.2 files are dated April 11, 2007. 5.2 runtime does not support applications coded for 5.1 or before. Introduced June 5, 2007, adding code samples for data compression, new video codec support, support for 64-bit applications on Mac OS X, support for Windows Vista, and new functions for ray-tracing and rendering.
Version 6.1 was released with the Intel C++ Compiler on June 28, 2009. Update 1 for version 6.1 was released on July 28, 2009. Update 2 files are dated October 19, 2009.^[7]
Version 7.1^[8]
Version 8.0^[9]
Version 8.1^[10]
Version 8.2^[11]
Version 9.0 Initial Release, August 25, 2015^[12]
Version 9.0 Update 1, December 1, 2015^[13]
Version 9.0 Update 2
Version 9.0 Update 3
Version 9.0 Update 4
Version 2017 Initial Release
Version 2017 Update 1
Version 2017 Update 2
Version 2017 Update 3, February 28, 2016^[1]
Version 2018 Initial Release
Version 2018 Update 1
Version 2018 Update 2
Version 2018 Update 2.1
Version 2018 Update 3
Version 2018 Update 3.1
Version 2018 Update 4, September 20, 2018^[1]
Version 2019 Initial Release
Version 2019 Update 1
Version 2019 Update 2
Version 2019 Update 3, February 14, 2019^[1]
Version 2019 Update 4
Version 2019 Update 5
Version 2020 Initial Release, December 12, 2019^[1]^[2]
Version 2020 Update 1, March 30, 2020^[1]^[2]
Version 2020 Update 2, July 16, 2020^[1]^[2]
Version 2020 Update 3
Version 2021 Initial Release
Version 2021.1
Version 2021.2
Version 2021.3
Version 2021.4
Version 2021.5
Version 2021.6
Version 2021.7, December 2022^[14]
Version 2021.8, April 2023^[14]
Version 2021.9.0, July 2023^[14]
Version 2021.9.1, October 2023^[14]
Version 2021.10.0, November 2023^[14]
Version 2021.10.1, December 2023^[14]
Version 2021.11.0, March 2024^[14]
Version 2021.12.0, June 2024^[15]

Counterparts

Sun: mediaLib for Solaris
Apple: vDSP, vImage, Accelerate etc. for macOS
AMD: Framewave (formerly the AMD Performance Library or APL)
Khronos Group: OpenMAX DL
NVIDIA Performance Primitives^[16]

Related Research Articles

OpenGL is a cross-language, cross-platform application programming interface (API) for rendering 2D and 3D vector graphics. The API is typically used to interact with a graphics processing unit (GPU), to achieve hardware-accelerated rendering.

Direct3D is a graphics application programming interface (API) for Microsoft Windows. Part of DirectX, Direct3D is used to render three-dimensional graphics in applications where performance is important, such as games. Direct3D uses hardware acceleration if it is available on the graphics card, allowing for hardware acceleration of the entire 3D rendering pipeline or even only partial acceleration. Direct3D exposes the advanced graphics capabilities of 3D graphics hardware, including Z-buffering, W-buffering, stencil buffering, spatial anti-aliasing, alpha blending, color blending, mipmapping, texture blending, clipping, culling, atmospheric effects, perspective-correct texture mapping, programmable HLSL shaders and effects. Integration with other DirectX technologies enables Direct3D to deliver such features as video mapping, hardware 3D rendering in 2D overlay planes, and even sprites, providing the use of 2D and 3D graphics in interactive media ties.

A graphics processing unit (GPU) is a specialized electronic circuit initially designed for digital image processing and to accelerate computer graphics, being present either as a discrete video card or embedded on motherboards, mobile phones, personal computers, workstations, and game consoles. After their initial design, GPUs were found to be useful for non-graphic calculations involving embarrassingly parallel problems due to their parallel structure. Other non-graphical uses include the training of neural networks and cryptocurrency mining.

OpenMAX, often shortened as "OMX", is a non-proprietary and royalty-free cross-platform set of C-language programming interfaces. It provides abstractions for routines that are especially useful for processing of audio, video, and still images. It is intended for low power and embedded system devices that need to efficiently process large amounts of multimedia data in predictable ways, such as video codecs, graphics libraries, and other functions for video, image, audio, voice and speech.

OpenGL for Embedded Systems is a subset of the OpenGL computer graphics rendering application programming interface (API) for rendering 2D and 3D computer graphics such as those used by video games, typically hardware-accelerated using a graphics processing unit (GPU). It is designed for embedded systems like smartphones, tablet computers, video game consoles and PDAs. OpenGL ES is the "most widely deployed 3D graphics API in history".

X-Video Motion Compensation (XvMC), is an extension of the X video extension (Xv) for the X Window System. The XvMC API allows video programs to offload portions of the video decoding process to the GPU video-hardware. In theory this process should also reduce bus bandwidth requirements. Currently, the supported portions to be offloaded by XvMC onto the GPU are motion compensation and inverse discrete cosine transform (iDCT) for MPEG-2 video. XvMC also supports offloading decoding of mo comp, iDCT, and VLD for not only MPEG-2 but also MPEG-4 ASP video on VIA Unichrome hardware.

In computing, CUDA is a proprietary parallel computing platform and application programming interface (API) that allows software to use certain types of graphics processing units (GPUs) for accelerated general-purpose processing, an approach called general-purpose computing on GPUs (GPGPU). CUDA API and its runtime: The CUDA API is an extension of the C programming language that adds the ability to specify thread-level parallelism in C and also to specify GPU device specific operations. CUDA is a software layer that gives direct access to the GPU's virtual instruction set and parallel computational elements for the execution of compute kernels. In addition to drivers and runtime kernels, the CUDA platform includes compilers, libraries and developer tools to help programmers accelerate their applications.

Intel oneAPI DPC++/C++ Compiler and Intel C++ Compiler Classic are Intel’s C, C++, SYCL, and Data Parallel C++ (DPC++) compilers for Intel processor-based systems, available for Windows, Linux, and macOS operating systems.

Framewave is computer software, a high-performance optimized programming library, consisting of low level application programming interfaces (APIs) for image processing, signal processing, JPEG, and video functions. These APIs are programmed with task level parallelization (multi-threading) and instruction-level parallelism single instruction, multiple data (SIMD) for maximum performance on multi-core processors from Advanced Micro Devices (AMD).

oneAPI Threading Building Blocks is a C++ template library developed by Intel for parallel programming on multi-core processors. Using TBB, a computation is broken down into tasks that can run in parallel. The library manages and schedules threads to execute these tasks.

Intel Fortran Compiler, as part of Intel OneAPI HPC toolkit, is a group of Fortran compilers from Intel for Windows, macOS, and Linux.

OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of central processing units (CPUs), graphics processing units (GPUs), digital signal processors (DSPs), field-programmable gate arrays (FPGAs) and other processors or hardware accelerators. OpenCL specifies a programming language for programming these devices and application programming interfaces (APIs) to control the platform and execute programs on the compute devices. OpenCL provides a standard interface for parallel computing using task- and data-based parallelism.

Intel Parallel Studio XE was a software development product developed by Intel that facilitated native code development on Windows, macOS and Linux in C++ and Fortran for parallel computing. Parallel programming enables software programs to take advantage of multi-core processors from Intel and other processor vendors.

Video Decode and Presentation API for Unix (VDPAU) is a royalty-free application programming interface (API) as well as its implementation as free and open-source library distributed under the MIT License. VDPAU is also supported by Nvidia.

Intel oneAPI Math Kernel Library, formerly known as Intel Math Kernel Library, is a library of optimized math routines for science, engineering, and financial applications. Core math functions include BLAS, LAPACK, ScaLAPACK, sparse solvers, fast Fourier transforms, and vector math.

Intel Quick Sync Video is Intel's brand for its dedicated video encoding and decoding hardware core. Quick Sync was introduced with the Sandy Bridge CPU microarchitecture on 9 January 2011 and has been found on the die of Intel CPUs ever since.

Metal is a low-level, low-overhead hardware-accelerated 3D graphic and compute shader API created by Apple, debuting in iOS 8. Metal combines functions similar to OpenGL and OpenCL in one API. It is intended to improve performance by offering low-level access to the GPU hardware for apps on iOS, iPadOS, macOS, and tvOS. It can be compared to low-level APIs on other platforms such as Vulkan and DirectX 12.

Vulkan is a low-level, low-overhead cross-platform API and open standard for 3D graphics and computing. It was intended to address the shortcomings of OpenGL, and allow developers more control over the GPU. It is designed to support a wide variety of GPUs, CPUs and operating systems, and it is also designed to work with modern multi-core CPUs.

oneAPI Data Analytics Library, is a library of optimized algorithmic building blocks for data analysis stages most commonly associated with solving Big Data problems.

oneAPI (compute acceleration) Open standard for parallel computing

oneAPI is an open standard, adopted by Intel, for a unified application programming interface (API) intended to be used across different computing accelerator (coprocessor) architectures, including GPUs, AI accelerators and field-programmable gate arrays. It is intended to eliminate the need for developers to maintain separate code bases, multiple programming languages, tools, and workflows for each architecture.

References

1 2 3 4 5 6 7 "Intel® Integrated Performance Primitives Library Release Notes and New Features". software.intel.com.
1 2 3 4 "Intel® IPP 2020 Bug Fixes". software.intel.com.
↑ "No Cost Options for Intel Parallel Studio XE, Support yourself, Royalty-Free".
1 2 "Intel® Integrated Performance Primitives". Intel. Retrieved 2024-04-03.
↑ "Intel® oneAPI Toolkit and Component Versioning Schema". Intel. Retrieved 2024-04-03.
1 2 3 "Intel Integrated Performance Primitives (Intel IPP) Library".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 6.1 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 7.1 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 8.0 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 8.1 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 8.2 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 9.0 Release Notes".
↑ "Intel Integrated Performance Primitives (Intel IPP) Library 9.0 Github".
1 2 3 4 5 6 7 Harrison, Pamela. "Intel® Integrated Performance Primitives Release Notes for Intel®..." Intel. Retrieved 2024-04-03.
↑ Harrison, Pamela. "Intel® Integrated Performance Primitives Release Notes for Intel®..." Intel. Retrieved 2024-07-23.
↑ "NVIDIA Performance Primitives (NPP)". NVIDIA Developer. Retrieved 2024-04-03.

External links

Official website
Intel oneAPI Base Toolkit Home Page
Stewart Taylor, "Intel Integrated Performance Primitives - How to Optimize Software Applications Using Intel IPP", Intel Press.
Jpeg Delphi implementation using official JPEG Group C library or Intel Jpeg Library 1.5 (ijl.dll included)
How To Install OpenCV using IPP (french). Archived 2020-08-08 at the Wayback Machine

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[ippreleasenotes-1] 1 2 3 4 5 6 7 "Intel® Integrated Performance Primitives Library Release Notes and New Features". software.intel.com.

[ipp2020bugfixes-2] 1 2 3 4 "Intel® IPP 2020 Bug Fixes". software.intel.com.

[freelib-3] "No Cost Options for Intel Parallel Studio XE, Support yourself, Royalty-Free".

[:0-4] 1 2 "Intel® Integrated Performance Primitives". Intel. Retrieved 2024-04-03.

[5] "Intel® oneAPI Toolkit and Component Versioning Schema". Intel. Retrieved 2024-04-03.

[intelipp-6] 1 2 3 "Intel Integrated Performance Primitives (Intel IPP) Library".

[7] "Intel Integrated Performance Primitives (Intel IPP) Library 6.1 Release Notes".

[8] "Intel Integrated Performance Primitives (Intel IPP) Library 7.1 Release Notes".

[9] "Intel Integrated Performance Primitives (Intel IPP) Library 8.0 Release Notes".

[10] "Intel Integrated Performance Primitives (Intel IPP) Library 8.1 Release Notes".

[11] "Intel Integrated Performance Primitives (Intel IPP) Library 8.2 Release Notes".

[12] "Intel Integrated Performance Primitives (Intel IPP) Library 9.0 Release Notes".

[13] "Intel Integrated Performance Primitives (Intel IPP) Library 9.0 Github".

[:1-14] 1 2 3 4 5 6 7 Harrison, Pamela. "Intel® Integrated Performance Primitives Release Notes for Intel®..." Intel. Retrieved 2024-04-03.

[15] Harrison, Pamela. "Intel® Integrated Performance Primitives Release Notes for Intel®..." Intel. Retrieved 2024-07-23.

[16] "NVIDIA Performance Primitives (NPP)". NVIDIA Developer. Retrieved 2024-04-03.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

v t e Intel software
Items in italics are no longer maintained or have planned end-of-life dates.
Development	Parallel Studio C++ Compiler Fortran Compiler Advisor Inspector INTERP/80 VTune
Components	Data Analytics Library (DAL) Integrated Performance Primitives (IPP) Math Kernel Library (MKL) Threading Building Blocks (TBB)
Open source	Data Analytics Library (DAL) Threading Building Blocks (TBB) Tizen OpenVINO
Software programs	Telekinesys Research ¹ Havok ¹ Vision ¹
Organizations	Developer Zone Research
¹Sold to Microsoft