| |
Release date | February 19, 2013 |
---|---|
Codename |
|
Architecture | |
Models | GeForce series
|
Transistors |
|
Fabrication process | |
Cards | |
Entry-level |
|
Mid-range |
|
High-end |
|
Enthusiast |
|
API support | |
DirectX | Direct3D 12.0 (feature level 11_0) [1] [2] Shader Model 6.7 (Maxwell), Shader Model 6.5 (Kepler) or Shader Model 5.1 (Fermi) |
OpenCL | OpenCL 3.0 [a] |
OpenGL | OpenGL 4.6 |
Vulkan | Vulkan 1.2 [3] (Kepler) Vulkan 1.3 (Maxwell) SPIR-V |
History | |
Predecessor | GeForce 600 series |
Variant | GeForce 800M series |
Successor | GeForce 900 series |
Support status | |
Fermi and Kepler unsupported. Maxwell fully supported. |
The GeForce 700 series (stylized as GEFORCE GTX 700 SERIES) is a series of graphics processing units developed by Nvidia. While mainly a refresh of the Kepler microarchitecture (GK-codenamed chips), some cards use Fermi (GF) and later cards use Maxwell (GM). GeForce 700 series cards were first released in 2013, starting with the release of the GeForce GTX Titan on February 19, 2013, followed by the GeForce GTX 780 on May 23, 2013. The first mobile GeForce 700 series chips were released in April 2013.
GK110 was designed and marketed with computational performance in mind. It contains 7.1 billion transistors. This model also attempts to maximise energy efficiency through the execution of as many tasks as possible in parallel according to the capabilities of its streaming processors.
With GK110, increases in memory space and bandwidth for both the register file and the L2 cache over previous models, are seen. At the SMX level, GK110's register file space has increased to 256KB composed of 64K 32bit registers, as compared to Fermi's 32K 32bit registers totaling 128 KB. As for the L2 cache, GK110 L2 cache space increased by up to 1.5MB, 2x as big as GF110. Both the L2 cache and register file bandwidth have also doubled. Performance in register-starved scenarios is also improved as there are more registers available to each thread. This goes in hand with an increase of total number of registers each thread can address, moving from 63 registers per thread to 255 registers per thread with GK110.
With GK110, Nvidia also reworked the GPU texture cache to be used for compute. With 48KB in size, in compute the texture cache becomes a read-only cache, specializing in unaligned memory access workloads. Furthermore, error detection capabilities have been added to make it safer for use with workloads that rely on ECC. [4]
Dynamic Super Resolution (DSR) was added to Kepler GPUs with the latest Nvidia drivers. [5]
The GeForce 700 series contains features from both GK104 and GK110. Kepler based members of the 700 series add the following standard features to the GeForce family.
Derived from GK104:
New Features from GK110:
With GK110, Nvidia opted to increase computational performance. The single biggest change from GK104 is that rather than 8 dedicated FP64 CUDA cores, GK110 has up to 64, giving it 8x the FP64 throughput of a GK104 SMX. The SMX also sees an increase in space for register file. Register file space has increased to 256KB compared to Fermi. The texture cache are also improved. With a 48KB space, the texture cache can become a read-only cache for compute workloads. [4]
At a low level, GK110 sees additional instructions and operations to further improve performance. New shuffle instructions allow for threads within a warp to share data without going back to memory, making the process much quicker than the previous load/share/store method. Atomic operations are also overhauled, speeding up the execution speed of atomic operations and adding some FP64 operations that were previously only available for FP32 data. [4]
Hyper-Q expands GK110 hardware work queues from 1 to 32. The significance of this being that having a single work queue meant that Fermi could be under occupied at times as there wasn't enough work in that queue to fill every SM. By having 32 work queues, GK110 can in many scenarios, achieve higher utilization by being able to put different task streams on what would otherwise be an idle SMX. The simple nature of Hyper-Q is further reinforced by the fact that it's easily map to MPI, a common message passing interface frequently used in HPC. As legacy MPI-based algorithms that were originally designed for multi-CPU systems that became bottlenecked by false dependencies now have a solution. By increasing the number of MPI jobs, it's possible to utilize Hyper-Q on these algorithms to improve the efficiency all without changing the code itself. [4]
Nvidia Kepler GPUs of the GeForce 700 series fully support DirectX 11.0. All GeForce 700 series card also support DirectX 12.0 with feature level 11_0.
Dynamic parallelism ability is for kernels to be able to dispatch other kernels. With Fermi, only the CPU could dispatch a kernel, which incurs a certain amount of overhead by having to communicate back to the CPU. By giving kernels the ability to dispatch their own child kernels, GK110 can both save time by not having to go back to the CPU, and in the process free up the CPU to work on other tasks. [4]
The GeForce 700 series was designed for desktop architecture. Cheaper and lower performing products were expected to be released over time. Kepler supports 11.1 features with 11_0 feature level through the DirectX 11.1 API, however Nvidia did not enable four non-gaming features in Hardware in Kepler (for 11_1). [6] [7]
Some implementations may use different specifications.
Model | Launch | Code name | Fab (nm) | Bus interface | Core config1 | Clock speed | Fillrate | Memory | API support (version) | Processing power2 (GFLOPS) | TDP (watts) | Notes | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Core (MHz) | Shader (MHz) | Memory (MT/s) | Pixel (GP/s) | Texture (GT/s) | Size (MB) | Bandwidth (GB/s) | Type | Bus width (bit) | DirectX | OpenGL | OpenCL | Vulkan | |||||||||
GeForce 705M [11] | June 1, 2013 | GF119 [b] | 40 | PCIe 2.0 x16 | 48:8:4 | 775 | 1550 | 1800 | 1.48 | 5.9 | up to 2048 | ? | DDR3 | 64 | 12.0 (11_0) | 4.6 | 1.1 | 1.3 | 141.7 | 12 | Rebadged 520M |
GeForce 710M [12] | April 1, 2013 | GF117 [b] | 28 | 96:16:4 | 775 | 1550 | 1800 | 3.1 | 12.4 | up to 2048 | 14.4 | 64 | 297.6 | 15 | |||||||
GeForce GT 720M [13] | April 1, 2013 | 96:16:4 | 800 | 1600 | 1600 | 2.5 | 10 | up to 2048 | 12.8 | 64 | 240 | 33 | |||||||||
December 25, 2013 | GK208 | 192:16:8 | 800 | 2.9 | 11.5 | 1.2 | 276 | 33 | |||||||||||||
GeForce GT 730M [14] | April 1, 2013 | GK107 | PCIe 3.0 x16 | 384:32:16 | 725 | 725 | 1800 | 5.8 | 23 | up to 2048 | 14.4 – 64.0 | DDR3 GDDR5 | 128 | 1.1 | 552.2 | 33 | |||||
March 6, 2014 | GK208 | PCIe 2.0 x8 | 384:16:8 | 5.8 | 11.5 | 64 | |||||||||||||||
GeForce GT 735M [15] | April 1, 2013 | PCIe 2.0 x8 | 384:32:8 | 889 | 889 | 2000 | 4.6 | 9.2 | up to 2048 | 16.0 | DDR3 | 64 | 1.2 | 441.6 | 33 | ||||||
GeForce GT 740M [16] | April 1, 2013 | GK107 | PCIe 3.0 x16 | 384:32:16 | 810-1033 | 810-1033 | 1800/3600 | 6.48 | 25.9 | up to 2048 | 14.4 – 57.6 | DDR3 GDDR5 | 128 | 1.1 | 622.1 | 45 | |||||
June 20, 2013 | GK208 | PCIe 3.0 x8 | 384:32:8 | 980-1033 | 980-1033 | 8.3 | 33.1 | 64 | 1.2 | 752.6 | 33 | ||||||||||
GeForce GT 745M [17] | April 1, 2013 | GK107 | PCIe 3.0 x16 | 384:32:16 | 837 | 837 | 2000 – 5000 | 4.39 | 17.6 | up to 2048 | 32.0 – 80.0 | 128 | 642.8 | 45 | |||||||
GeForce GT 750M [18] | April 1, 2013 | 384:32:16 | 967 | 967 | 2000 – 5000 | 7.53 | 30.1 | up to 4096 | 32 – 80 | 128 | 1.1 | 722.7 | 50 | ||||||||
GeForce GT 755M [19] | Unknown | 384:32:16 | 1020 | 1020 | 5400 | 15.7 | 31.4 | up to 2048 | 86.4 | GDDR5 | 128 | 752.6 | 50 | ||||||||
GeForce GTX 760M [20] | May 30, 2013 | GK106 | 768:64:16 | 657 | 657 | 4008 | 10 | 40.2 | 2048 | 64.1 | 128 | 964.6 | 55 | ||||||||
GeForce GTX 765M [21] | May 30, 2013 | 768:64:16 | 850 | 850 | 4008 | 12.8 | 51 | 2048 | 64.1 | 128 | 1.2 | 1224 | 75 | ||||||||
GeForce GTX 770M [22] | May 30, 2013 | 960:80:24 | 811 | 811 | 4008 | 14.1 | 56.5 | 3072 | 96.2 | 192 | 1356 | 75 | |||||||||
GeForce GTX 780M [23] | May 30, 2013 | GK104 | 1536:128:32 | 823 | 823 | 5000 | 24.7 | 98.7 | 4096 | 160.0 | 256 | 2369 | 100 | ||||||||
Model | Launch | Code name | Fab (nm) | Transistors (million) | Die size (mm2) | Bus interface | SMX count | Core config [a] | Clock rate | Fillrate | Memory configuration | Supported API version | Processing power (GFLOPS) [b] | TDP (Watts) | Release price (USD) | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Base (MHz) | Average Boost (MHz) | Max Boost [c] (MHz) | Memory (MHz) | Pixel (GP/s) | Texture (GT/s) | Size (MB) | Bandwidth (GB/s) | DRAM type | Bus width (bit) | Vulkan [d] | Direct3D [e] | OpenGL | OpenCL | Single precision | Double precision | |||||||||||
GeForce GT 705 [27] [f] | March 27, 2014 | GF119-300-A1 | TSMC 40 nm | 292 | 79 | PCIe 2.0 x16 | 1 | 48:8:4 | 810 | — | — | 898 (1796) | 3.24 | 6.5 | 512 1024 | 14.4 | DDR3 | 64 | n/a | 12 | 4.6 | 1.1 | 155.5 | 19.4 | 29 | OEM |
GeForce GT 710 [28] | GK208-301-A1 | TSMC 28 nm | 1020 | 79 | PCIe 2.0 x8 | 192:16:8 | 823 | — | — | 900 (1800) | 6.6 | 13.2 | 512 | 64 | 1.2 | 1.2 | 316.0 | 13.2 | ||||||||
January 26, 2016 | GK208-203-B1 | PCIe 2.0 x8, PCIe x1 | 192:16:8 | 954 | — | — | 900 (1800) 1253 (5010) | 7.6 | 15.3 | 1024 2048 | 14.4 40.0 | DDR3 GDDR5 | 366 | 15.3 | 19 | 35–45 | ||||||||||
GeForce GT 720 [29] | March 27, 2014 | GK208-201-B1 | PCIe 2.0 x8 | 192:16:8 | 797 | — | — | 900 (1800) 1253 (5010) | 6.4 | 12.8 | 1024 2048 | 14.4 40.0 | 306 | 12.8 | 49–59 | |||||||||||
GeForce GT 730 [30] [g] [h] | June 18, 2014 | GK208-301-A1 | 2 | 384:16:8 | 902 | — | — | 900 (1800) | 7.22 | 14.44 | 1024 [31] 2048 4096 | 14.4 | DDR3 | 692.7 | 28.9 | 23 | 69–79 | |||||||||
GK208-400-A1 | 384:16:8 | 902 | — | — | 1250 (5000) | 7.22 | 14.44 | 1024 2048 [32] | 40.0 | GDDR5 | 25 | |||||||||||||||
GF108 | TSMC 40 nm | 585 | 116 | PCIe 2.0 x16 | 96:16:4 | 700 | — | — | 900 (1800) | 2.8 | 11.0 | 1024 2048 4096 | 28.8 | DDR3 | 128 | n/a | 1.1 | 268.8 | 33.6 | 49 | ||||||
GeForce GT 740 [i] | May 29, 2014 | GK107-425-A2 | TSMC 28HP | 1270 | 118 | PCIe 3.0 x16 | 384:32:16 | 993 | — | — | 891 (1782) | 15.9 | 31.8 | 28.5 | 128 | 1.2 | 1.2 | 762.6 | 31.8 | 64 | 89–99 | |||||
384:32:16 | 993 | — | — | 1252 (5008) | 15.9 | 31.8 | 80.1 | GDDR5 | ||||||||||||||||||
GeForce GTX 745 | February 18, 2014 | GM107-220-A2 | 1870 | 148 | 3 | 384:24:16 | 1033 | Unknown | Unknown | 900 (1800) | 16.5 | 24.8 | 1024 4096 | 28.8 | DDR3 | 1.3 | 793.3 | 24.8 | 55 | OEM | ||||||
GeForce GTX 750 | GM107-300-A2 | 4 | 512:32:16 | 1020 | 1085 | 1163 | 1250 (5000) | 16.3 | 32.6 | 1024 2048 4096 [33] | 80 | GDDR5 | 1044.5 | 32.6 | 119 | |||||||||||
GeForce GTX 750 Ti | GM107-400-A2 | 5 | 640:40:16 | 1020 | 1085 | 1200 | 1350 (5400) | 16.3 | 40.8 | 1024 2048 4096 | 86.4 | 1305.6 | 40.8 | 60 | 149 | |||||||||||
GeForce GTX 760 192-bit | October 17, 2013 | GK104-200-KD-A2 | 3540 | 294 | 6 | 1152:96:24 | 824 | 888 | 889 | 1450 (5800) | 19.8 | 79.1 | 1536 3072 | 139.2 | 192 | 1.2 | 1896.2 | 79.0 | 130 | OEM | ||||||
GeForce GTX 760 | June 25, 2013 | GK104-225-A2 | 1152:96:32 | 980 | 1033 | 1124 | 1502 (6008) | 31.4 [j] | 94 | 2048 4096 | 192.3 | 256 | 2257.9 | 94.1 | 170 | 249 (219) | ||||||||||
GeForce GTX 760 Ti [k] | September 27, 2013 [34] | GK104 | 7 | 1344:112:32 | 915 | 980 | 1084 | 1502 (6008) | 29.3 | 102.5 | 2048 | 192.3 | 2459.5 | 102.5 | OEM | |||||||||||
GeForce GTX 770 | May 30, 2013 | GK104-425-A2 | 8 | 1536:128:32 | 1046 | 1085 | 1130 | 1752.5 (7010) | 33.5 | 134 | 2048 4096 | 224 | 3213.3 | 133.9 | 230 | 399 (329) | ||||||||||
GeForce GTX 780 | May 23, 2013 | GK110-300-A1 | 7080 | 561 | 12 | 2304:192:48 | 863 | 900 | 1002 | 1502 (6008) | 41.4 [j] | 160.5 | 3072 6144 [35] | 288.4 | 384 | 3976.7 | 165.7 | 649 (499) | ||||||||
GeForce GTX 780 Ti [36] [37] [38] | November 7, 2013 | GK110-425-B1 | 15 | 2880:240:48 | 876 | 928 | 1019 | 1752.5 (7010) | 42.0 [j] | 210.2 | 3072 | 336.5 | 5045.7 | 210.2 | 699 | |||||||||||
GeForce GTX TITAN [39] [40] [41] | February 21, 2013 | GK110-400-A1 | 14 | 2688:224:48 | 837 | 876 | 993 | 1502 (6008) | 40.2 | 187.5 | 6144 | 288.4 | 4499.7 | 1300 [42] -1499.9 | 999 | |||||||||||
GeForce GTX TITAN Black | February 18, 2014 | GK110-430-B1 | 15 | 2880:240:48 | 889 | 980 | 1058 | 1752.5 (7010) | 42.7 | 213.4 | 336.5 | 5120.6 | 1706.9 | |||||||||||||
GeForce GTX TITAN Z | May 28, 2014 | 2x GK110-350-B1 [43] | 2x 7080 | 2x 561 | 2x 15 | 2x 2880:240:48 | 705 | 876 | Unknown | 1752.5 (7010) | 2x 33.8 | 2x 169 | 2x 6144 | 2x 336.5 | 2x 384 | 4.5 | 5046x2 | 1682x2 [44] | 375 | 2999 | ||||||
Model | Launch | Code name | Fab (nm) | Transistors (million) | Die size (mm2) | Bus interface | SMX count | Core config [a] | Clock rate | Fillrate | Memory configuration | Supported API version | Processing power (GFLOPS) [k] | TDP (Watts) | Release price (USD) | |||||||||||
Base (MHz) | Average Boost (MHz) | Max Boost [c] (MHz) | Memory (MHz) | Pixel (GP/s) | Texture (GT/s) | Size (MB) | Bandwidth (GB/s) | DRAM type | Bus width (bit) | Vulkan | Direct3D [e] | OpenGL | OpenCL | Single precision | Double precision |
Nvidia stopped releasing 32-bit drivers for 32-bit operating systems after the last Release 390.x driver, 391.35, was released in March 2018. [45]
Kepler notebook GPUs moved to legacy support in April 2019 and stopped receiving security updates in April 2020. [46] All notebook GPUs from the 7xxM family were affected by this change.
Nvidia announced that after Release 470 drivers, it would transition driver support for the Windows 7 and Windows 8.1 operating systems to legacy status and continue to provide critical security updates for these operating systems through September 2024. [47]
Nvidia announced that all remaining Kepler desktop GPUs would transition to legacy support from September 2021 onwards and be supported for critical security updates through September 2024. [48] The Nvidia GeForce GTX 745, 750 and 750 Ti from the 7xx desktop GPU family would not be affected by this change.
In Windows ,the last driver to fully support CUDA with 64-Bit Compute Capability 3.5 for Kepler in Windows 7 and Windows 8.1 64-bit is 388.71, tested with latest CUDA-Z and GPU-Z, after that driver, the 64-Bit CUDA support becomes broken for GeForce 700 series GK110 with Kepler architecture.
The last driver where monitor type detection is working properly on Windows XP is 352.86. [49]
GeForce is a brand of graphics processing units (GPUs) designed by Nvidia and marketed for the performance market. As of the GeForce 40 series, there have been eighteen iterations of the design. The first GeForce products were discrete GPUs designed for add-on graphics boards, intended for the high-margin PC gaming market, and later diversification of the product line covered all tiers of the PC graphics market, ranging from cost-sensitive GPUs integrated on motherboards to mainstream add-in retail boards. Most recently, GeForce technology has been introduced into Nvidia's line of embedded application processors, designed for electronic handhelds and mobile handsets.
The GeForce 6 series is the sixth generation of Nvidia's GeForce line of graphics processing units. Launched on April 14, 2004, the GeForce 6 family introduced PureVideo post-processing for video, SLI technology, and Shader Model 3.0 support.
Scalable Link Interface (SLI) is the brand name for a now discontinued multi-GPU technology developed by Nvidia for linking two or more video cards together to produce a single output. SLI is a parallel processing algorithm for computer graphics, meant to increase the available processing power.
The GeForce 7 series is the seventh generation of Nvidia's GeForce line of graphics processing units. This was the last series available on AGP cards.
Quadro was Nvidia's brand for graphics cards intended for use in workstations running professional computer-aided design (CAD), computer-generated imagery (CGI), digital content creation (DCC) applications, scientific calculations and machine learning from 2000 to 2020.
The GeForce 8 series is the eighth generation of Nvidia's GeForce line of graphics processing units. The third major GPU architecture developed by Nvidia, Tesla represents the company's first unified shader architecture.
The GeForce 9 series is the ninth generation of Nvidia's GeForce line of graphics processing units, the first of which was released on February 21, 2008. The products are based on an updated Tesla microarchitecture, adding PCI Express 2.0 support, improved color and z-compression, and built on a 65 nm process, later using 55 nm process to reduce power consumption and die size.
PureVideo is Nvidia's hardware SIP core that performs video decoding. PureVideo is integrated into some of the Nvidia GPUs, and it supports hardware decoding of multiple video codec standards: MPEG-2, VC-1, H.264, HEVC, and AV1. PureVideo occupies a considerable amount of a GPU's die area and should not be confused with Nvidia NVENC. In addition to video decoding on chip, PureVideo offers features such as edge enhancement, noise reduction, deinterlacing, dynamic contrast enhancement and color enhancement.
The GeForce 200 series is a series of Tesla-based GeForce graphics processing units developed by Nvidia.
The GeForce 400 series is a series of graphics processing units developed by Nvidia, serving as the introduction of the Fermi microarchitecture. Its release was originally slated in November 2009, however, after delays, it was released on March 26, 2010, with availability following in April 2010.
The GeForce 500 series is a series of graphics processing units developed by Nvidia, as a refresh of the Fermi based GeForce 400 series. It was first released on November 9, 2010 with the GeForce GTX 580.
The GeForce 600 series is a series of graphics processing units developed by Nvidia, first released in 2012. It served as the introduction of the Kepler architecture. It is succeeded by the GeForce 700 series.
Fermi is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia, first released to retail in April 2010, as the successor to the Tesla microarchitecture. It was the primary microarchitecture used in the GeForce 400 series and 500 series. All desktop Fermi GPUs were manufactured in 40nm, mobile Fermi GPUs in 40nm and 28nm. Fermi is the oldest microarchitecture from Nvidia that receives support for Microsoft's rendering API Direct3D 12 feature_level 11.
The GeForce 800M series is a family of graphics processing units by Nvidia for laptop PCs. It consists of rebrands of mobile versions of the GeForce 700 series and some newer chips that are lower end compared to the rebrands.
The GeForce 900 series is a family of graphics processing units developed by Nvidia, succeeding the GeForce 700 series and serving as the high-end introduction to the Maxwell microarchitecture, named after James Clerk Maxwell. They were produced with TSMC's 28 nm process.
The GeForce 10 series is a series of graphics processing units developed by Nvidia, initially based on the Pascal microarchitecture announced in March 2014. This design series succeeded the GeForce 900 series, and is succeeded by the GeForce 16 series and GeForce 20 series using the Turing microarchitecture.
Kepler is the codename for a GPU microarchitecture developed by Nvidia, first introduced at retail in April 2012, as the successor to the Fermi microarchitecture. Kepler was Nvidia's first microarchitecture to focus on energy efficiency. Most GeForce 600 series, most GeForce 700 series, and some GeForce 800M series GPUs were based on Kepler, all manufactured in 28 nm. Kepler found use in the GK20A, the GPU component of the Tegra K1 SoC, and in the Quadro Kxxx series, the Quadro NVS 510, and Tesla computing modules.
Maxwell is the codename for a GPU microarchitecture developed by Nvidia as the successor to the Kepler microarchitecture. The Maxwell architecture was introduced in later models of the GeForce 700 series and is also used in the GeForce 800M series, GeForce 900 series, and Quadro Mxxx series, as well as some Jetson products.
Pascal is the codename for a GPU microarchitecture developed by Nvidia, as the successor to the Maxwell architecture. The architecture was first introduced in April 2016 with the release of the Tesla P100 (GP100) on April 5, 2016, and is primarily used in the GeForce 10 series, starting with the GeForce GTX 1080 and GTX 1070, which were released on May 27, 2016, and June 10, 2016, respectively. Pascal was manufactured using TSMC's 16 nm FinFET process, and later Samsung's 14 nm FinFET process.
the calculated fp64 peak of Titan is 1.5 TFlops. However, under heavy load in fp64 mode, the card may underclock below the listed 837MHz to remain within the power and thermal specifications