RSX Reality Synthesizer

Last updated
The RSX 'Reality Synthesizer' on a PlayStation 3 motherboard RSX 'Reality Synthesizer'.jpg
The RSX 'Reality Synthesizer' on a PlayStation 3 motherboard

The RSX 'Reality Synthesizer' is a proprietary graphics processing unit (GPU) codeveloped by Nvidia and Sony for the PlayStation 3 game console. It is based on the Nvidia 7800GTX graphics processor and, according to Nvidia, is a G70/G71 (previously known as NV47) hybrid architecture with some modifications. The RSX has separate vertex and pixel shader pipelines. The GPU makes use of 256 MB GDDR3 RAM clocked at 650 MHz with an effective transmission rate of 1.3 GHz and up to 224 MB of the 3.2 GHz XDR main memory via the CPU (480 MB max). Although it carries the majority of the graphics processing, the Cell Broadband Engine, the console's CPU, is also used complementarily for some graphics-related computational loads of the console.

Contents


Specifications

Length of chip at bottom: 4.28 cm RSX Reality Synthesizer on CD.jpg
Length of chip at bottom: 4.28 cm

Unless otherwise noted, the following specifications are based on a press release by Sony at the E3 2005 conference, [1] slides from the same conference, [2] and slides from a Sony presentation at the 2006 Game Developer's Conference.[ citation needed ]

The RSX has a floating-point performance of 192 GFLOPS. [3]

Other features: Support for Bilinear, trilinear, anisotropic, quincunx texture filtering, quincunx antialiasing, up to 4xMSAA, SSAA, Alpha to Coverage and Alphakill.

Model numbers

90nm:

65nm:

40nm:

Local GDDR3 physical memory structure

RSX memory map

Although the RSX has 256 MB of GDDR3 RAM, not all of it is usable. The last 4 MB is reserved for keeping track of the RSX internal state and issued commands. The 4 MB of GPU Data contains RAMIN, RAMHT, RAMFC, DMA Objects, Graphic Objects, and the Graphic Context. The following is a breakdown of the address within 256 MB of the RSX.

Address RangeSizeComment
0000000-FBFFFFF252 MBFramebuffer
FC00000-FFFFFFF4 MBGPU Data
FF80000-FFFFFFF512KBRAMIN: Instance Memory
FF90000-FF93FFF16KBRAMHT: Hash Table
FFA0000-FFA0FFF4KBRAMFC: FIFO Context
FFC0000-FFCFFFF64KBDMA Objects
FFD0000-FFDFFFF64KBGraphic Objects
FFE0000-FFFFFFF128KBGRAPH: Graphic Context

Besides local GDDR3 memory, main XDR memory can be accessed by RSX too, which is limited to either:

-or-

Speed, bandwidth and latency

System bandwidth (theoretical maximum):

Because of the aforementioned layout of the communication path between the different chips, and the latency and bandwidth differences between the various components, there are different access speeds depending on the direction of the access in relation to the source and destination. The following is a chart showing the speed of reads and writes to the GDDR3 and XDR memory from the viewpoint of the Cell and RSX. Note that these are measured speeds (rather than calculated speeds) and they should be worse if RSX and GDDR3 access are involved because these figures were measured when the RSX was clocked at 550Mhz and the GDDR3 memory was clocked at 700Mhz. The shipped PS3 has the RSX clocked in at 500Mhz (front and back end, although the pixel shaders run separately inside at 550Mhz). In addition, the GDDR3 memory was also clocked lower at 650Mhz.

Speed table

Processor256 MB XDR256 MB GDDR3
Cell Read16.8GB/s16MB/s (15.6MB/s @ 650 MHz)
Cell Write24.9GB/s4GB/s
RSX Read15.5GB/s22.4GB/s (20.8GB/s @ 650 MHz)
RSX Write10.6GB/s22.4GB/s (20.8GB/s @ 650 MHz)

Because of the very slow Cell Read speed from the 256 MB GDDR3 memory, it is more efficient for the Cell to work in XDR and then have the RSX pull data from XDR and write to GDDR3 for output to the HDMI display. This is why extra texture lookup instructions were included in the RSX to allow loading data from XDR memory (as opposed to the local GDDR3 memory).

RSX libraries

The RSX is dedicated to 3D graphics, and developers are able to use different API libraries to access its features. The easiest way is to use high level PSGL, which is basically OpenGL|ES with programmable pipeline added in, however this is unpopular due to the performance overhead on a relatively weak console CPU. At a lower level developers can use LibGCM, which is an API that builds RSX command buffers at a lower level. (PSGL is actually implemented on top of LibGCM). This is done by setting up commands (via FIFO Context) and DMA Objects and issuing them to the RSX via DMA calls.

Differences with the G70 architecture

The RSX 'Reality Synthesizer' is based on the G70 architecture, but features a few changes to the core. [7] The biggest difference between the two chips is the way the memory bandwidth works. The G70 only supports rendering to local memory, while the RSX is able to render to both system and local memory. Since rendering from system memory has a much higher latency compared to rendering from local memory, the chip's architecture had to be modified to avoid a performance penalty. This was achieved by enlarging the chip size to accommodate larger buffers and caches in order to keep the graphics pipeline full. The result was that the RSX only has 60% of the local memory bandwidth of the G70, making it necessary for developers to use the system memory in order to achieve performance targets. [7]

DifferenceRSXnVidia 7800GTX
GDDR3 Memory bus128bit256bit
ROPs816
Post Transform and Lighting Cache63 max vertices45 max vertices
Total Texture Cache Per Quad of Pixel Pipes (L1 and L2)96kB48kB
CPU interfaceFlexIOPCI-Express 16x
Technology28 nm/40 nm/65 nm/90 nm110 nm

Other RSX features/differences include:

Press releases

Sony staff were quoted in PlayStation Magazine saying that the "RSX shares a lot of inner workings with NVIDIA 7800 which is based on G70 architecture."[ citation needed ] Since the G70 is capable of carrying out 136 shader operations per clock cycle, the RSX was expected to feature the same number of parallel pixel and vertex shader pipelines as the G70, which contains 24 pixel and 8 vertex pipelines.

Nvidia CEO Jen-Hsun Huang stated during Sony's pre-show press conference at E3 2005 that the RSX is twice as powerful as the GeForce 6800 Ultra. [2]

Bumpgate

In the case of the PlayStation 3, the RSX was originally manufactured with the 90nm process. before transitioning to the 65nm, 40nm and finally the 28nm process. The 90nm version of the RSX was packaged (in the context of thermal strain) with incompatible die packaging elements. These factors lead to the Ball Grid Array (BGA) between the chip's interposer and its die failing at an abnormally fast rate.

Some of the factors of failure include

See also

Related Research Articles

<span class="mw-page-title-main">GeForce 3 series</span> Series of GPUs by Nvidia

The GeForce 3 series (NV20) is the third generation of Nvidia's GeForce line of graphics processing units (GPUs). Introduced in February 2001, it advanced the GeForce architecture by adding programmable pixel and vertex shaders, multisample anti-aliasing and improved the overall efficiency of the rendering process.

<span class="mw-page-title-main">GeForce 6 series</span> Series of GPUs by Nvidia

The GeForce 6 series is the sixth generation of Nvidia's GeForce line of graphics processing units. Launched on April 14, 2004, the GeForce 6 family introduced PureVideo post-processing for video, SLI technology, and Shader Model 3.0 support.

The R420 GPU, developed by ATI Technologies, was the company's basis for its 3rd-generation DirectX 9.0/OpenGL 2.0-capable graphics cards. Used first on the Radeon X800, the R420 was produced on a 0.13 micrometer low-K photolithography process and used GDDR-3 memory. The chip was designed for AGP graphics cards.

<span class="mw-page-title-main">Voodoo 5</span> Graphics card line

The Voodoo 5 was the last and most powerful graphics card line that was released by 3dfx Interactive. All members of the family were based upon the VSA-100 graphics processor. Only the single-chip Voodoo 4 4500 and dual-chip Voodoo 5 5500 made it to market.

<span class="mw-page-title-main">Radeon R200 series</span> Series of video cards

The R200 is the second generation of GPUs used in Radeon graphics cards and developed by ATI Technologies. This GPU features 3D acceleration based upon Microsoft Direct3D 8.1 and OpenGL 1.3, a major improvement in features and performance compared to the preceding Radeon R100 design. The GPU also includes 2D GUI acceleration, video acceleration, and multiple display outputs. "R200" refers to the development codename of the initially released GPU of the generation. It is the basis for a variety of other succeeding products.

<span class="mw-page-title-main">GeForce 7 series</span> Series of GPUs by Nvidia

The GeForce 7 series is the seventh generation of Nvidia's GeForce line of graphics processing units. This was the last series available on AGP cards.

<span class="mw-page-title-main">Xenos (graphics chip)</span> GPU used in the Xbox 360

The Xenos is a custom graphics processing unit (GPU) designed by ATI, used in the Xbox 360 video game console developed and produced for Microsoft. Developed under the codename "C1", it is in many ways related to the R520 architecture and therefore very similar to an ATI Radeon X1800 XT series of PC graphics cards as far as features and performance are concerned. However, the Xenos introduced new design ideas that were later adopted in the TeraScale microarchitecture, such as the unified shader architecture. The package contains two separate dies, the GPU and an eDRAM, featuring a total of 337 million transistors.

The R520 is a graphics processing unit (GPU) developed by ATI Technologies and produced by TSMC. It was the first GPU produced using a 90 nm photolithography process.

<span class="mw-page-title-main">Radeon R300 series</span> Series of video cards

The R300 GPU, introduced in August 2002 and developed by ATI Technologies, is its third generation of GPU used in Radeon graphics cards. This GPU features 3D acceleration based upon Direct3D 9.0 and OpenGL 2.0, a major improvement in features and performance compared to the preceding R200 design. R300 was the first fully Direct3D 9-capable consumer graphics chip. The processors also include 2D GUI acceleration, video acceleration, and multiple display outputs.

<span class="mw-page-title-main">Radeon R100 series</span> Series of video cards

The Radeon R100 is the first generation of Radeon graphics chips from ATI Technologies. The line features 3D acceleration based upon Direct3D 7.0 and OpenGL 1.3, and all but the entry-level versions offloading host geometry calculations to a hardware transform and lighting (T&L) engine, a major improvement in features and performance compared to the preceding Rage design. The processors also include 2D GUI acceleration, video acceleration, and multiple display outputs. "R100" refers to the development codename of the initially released GPU of the generation. It is the basis for a variety of other succeeding products.

<span class="mw-page-title-main">Matrox Parhelia</span> GPU by Matrox

The Matrox Parhelia-512 is a graphics processing unit (GPU) released by Matrox in 2002. It has full support for DirectX 8.1 and incorporates several DirectX 9.0 features. At the time of its release, it was best known for its ability to drive three monitors and its Coral Reef tech demo.

The GeForce 8 series is the eighth generation of Nvidia's GeForce line of graphics processing units. The third major GPU architecture developed by Nvidia, Tesla represents the company's first unified shader architecture.

<span class="mw-page-title-main">GDDR5 SDRAM</span> Type of high performance DRAM graphics card memory

Graphics Double Data Rate 5 Synchronous Dynamic Random-Access Memory is a type of synchronous graphics random-access memory (SGRAM) with a high bandwidth interface designed for use in graphics cards, game consoles, and high-performance computing. It is a type of GDDR SDRAM.

<span class="mw-page-title-main">Hollywood (graphics chip)</span>

The Hollywood graphics chip is the graphics processing unit (GPU) used in Nintendo's Wii video game console. It was designed by ATI, and was manufactured using the same 90 nm or 65 nm CMOS process as Broadway, the Wii's central processing unit. Very few official details about Hollywood were released to the public by Nintendo, ATI, or any other company involved in the Wii's development. The Hollywood GPU is reportedly based on the GameCube's Flipper GPU and is clocked 50% higher at 243 MHz, though these clock rates have never been officially confirmed.

The GeForce 9 series is the ninth generation of Nvidia's GeForce line of graphics processing units, the first of which was released on February 21, 2008. The products are based on an updated Tesla microarchitecture, adding PCI Express 2.0 support, improved color and z-compression, and built on a 65 nm process, later using 55 nm process to reduce power consumption and die size.

<span class="mw-page-title-main">PlayStation 3 technical specifications</span> Overview of the PlayStation 3 technical specifications

The PlayStation 3 technical specifications describe the various components of the PlayStation 3 (PS3) video game console.

<span class="mw-page-title-main">Radeon 9000 series</span> Series of video cards

The R300 GPU, introduced in August 2002 and developed by ATI Technologies, is its third generation of GPU used in Radeon graphics cards. This GPU features 3D acceleration based upon Direct3D 9.0 and OpenGL 2.0, a major improvement in features and performance compared to the preceding R200 design. R300 was the first fully Direct3D 9-capable consumer graphics chip. The processors also include 2D GUI acceleration, video acceleration, and multiple display outputs.

<span class="mw-page-title-main">PlayStation 2 technical specifications</span> Overview of the PlayStation 2 technical specifications

The PlayStation 2 technical specifications describe the various components of the PlayStation 2 (PS2) video game console.

<span class="mw-page-title-main">Xbox technical specifications</span>

The Xbox technical specifications describe the various components of the Xbox video game console.

References

  1. "SONY COMPUTER ENTERTAINMENT INC. TO LAUNCH ITS NEXT GENERATION COMPUTER ENTERTAINMENT SYSTEM, PLAYSTATION3 IN SPRING 2006" (Press release). Sony Computer Entertainment Inc. 2005-05-16.
  2. 1 2 "Sony Introduces PlayStation 3, to launch in 2006". AnandTech. 2005-05-16.
  3. Klug, Anand Lal Shimpi, Brian. "NVIDIA Tegra K1 Preview & Architecture Analysis". www.anandtech.com. Retrieved 2024-08-13.{{cite web}}: CS1 maint: multiple names: authors list (link)
  4. "PS3 Graphics Chip Goes 65nm in Fall". Edge Online. 2008-06-26.
  5. "Sony PS3 upgraded with cooler 40-nm RSX graphics chip, profits await (updated)". Engadget. 2010-04-26.
  6. Gantayat, Anoop (2006-01-30). "New PS3 tools". IGN.com. Retrieved 2006-08-28.
  7. 1 2 "Microsoft's Xbox 360, Sony's PS3 - A Hardware Discussion" . Retrieved 2014-03-08.
  8. Young Yang, Se; Kim, Ilho; Lee, Soon-Bok (2008). "A Study on the Thermal Fatigue Behavior of Solder Joints Under Power Cycling Conditions". IEEE Transactions on Components and Packaging Technologies. 31: 3–12. doi:10.1109/TCAPT.2007.906294.
  9. Demerjian, Charlie (2008-09-01). "Why Nvidia's chips are defective". The Inquirer. Archived from the original on 2009-05-25. Retrieved 2023-11-12.
  10. Hau-Riege, Christine; Yau, YouWen (2018). Electromigration Reliability of Solder Balls. doi:10.1109/IPFA.2018.8452576.
  11. Hillman, C; Blattau, N; Sharon, G. "Low Tg Underfill: The Good, The Bad, and The Ugly" (PDF). Retrieved March 19, 2024.
  12. Vissa, U; Butel, N; Rowatt, J; Thielen, C. (2006). A systematic approach to qualification of 90 nm low-K flip-chip packaging. doi:10.1109/ECTC.2006.1645618.