
Originally Posted by
Subhadip Sen
GTX 580 has 512, but it's not as straightforward. Kepler is a vastly different marchitecture to Fermi. Previously Nvidia's design philosophy was few high efficiency big shaders at high clocks (2 x core clocks). So, GTX 580's 512 shaders were 'hotclocked' at 1550 MHz. With Kepler Nvidia has adopted AMD's model - many small, dense not-as-efficient shaders at low clocks (AMD 7970 has 2048 shaders at 925 MHz). GTX 680 has 1536 shaders clocked at 1006 MHz. Kepler shaders are smaller and less complex, most notable FP64 has dropped to 1/24th rate versus 1/8th for Fermi. So, the numbers are not really comparable. However, what is up is FP32 and thus single precision FLOPS. Way up, at 90+%. FP16 texture fillrate is also up massively, though this has little relevance to CUDA. Memory bandwidth is identical, 192 GB/s, GTX 580 has a wider 384-bit memory bus. In more complex compute (CUDA/OpenCL/DirectCompute) applications GTX 580 is still faster. Perhaps Premiere Pro is not as complex, relying mostly on single precision brute force. Let's wait for benchmarks to see how well GTX 680 performs in Premiere Pro.