226
General / Re: New Low Hash Rate (LHR) GeForce Cards
« on: September 03, 2021, 09:29:07 AM »
@Corensia
In multicore performance 5600X is sadly slower than 9900K, both on stock frequencies(good comaprisson is cinebench Rxx). Only in single core performance is 5600x better.
In terms of GPUs, there were changes in CUDA cores and especially how they are grouped in GPU core. In rtx 30xx CUDA count was doubled, but also "regrouped in two blocks" in one SM block. There is also question about dispatcher which is assigning tasks to CUDA cores, how good it can handle that assigning process. You can compare numbers(core configuration, and other fields in table) on this page https://en.wikipedia.org/wiki/List_of_Nvidia_graphics_processing_units#GeForce_20_series
Differences: 3060ti vs 2080s: SM blocks(38vs48), memory bandwith(448/496), default and boost frequencies(1410/1650 and 1665/1815), memory frequency(14000/15500)
Can you compare just depth maps generation times alone? and check what was the ~frequencies during that process? Or better lock frequencies using C:\Program Files\NVIDIA Corporation\NVSMI\nvidia-smi.exe utility to have more accurate result.
There is also chance, that LHR limit can affect some kind of processing tasks, we will see in the future. Double count of CUDA cores in RTX30xx gen. not always means automatically doubled performance in programs, probably some kind of optimization is needed in kernels which are executed on GPU. I'm not an expert in this field, but I'm a little interested in it.
In multicore performance 5600X is sadly slower than 9900K, both on stock frequencies(good comaprisson is cinebench Rxx). Only in single core performance is 5600x better.
In terms of GPUs, there were changes in CUDA cores and especially how they are grouped in GPU core. In rtx 30xx CUDA count was doubled, but also "regrouped in two blocks" in one SM block. There is also question about dispatcher which is assigning tasks to CUDA cores, how good it can handle that assigning process. You can compare numbers(core configuration, and other fields in table) on this page https://en.wikipedia.org/wiki/List_of_Nvidia_graphics_processing_units#GeForce_20_series
Differences: 3060ti vs 2080s: SM blocks(38vs48), memory bandwith(448/496), default and boost frequencies(1410/1650 and 1665/1815), memory frequency(14000/15500)
Can you compare just depth maps generation times alone? and check what was the ~frequencies during that process? Or better lock frequencies using C:\Program Files\NVIDIA Corporation\NVSMI\nvidia-smi.exe utility to have more accurate result.
There is also chance, that LHR limit can affect some kind of processing tasks, we will see in the future. Double count of CUDA cores in RTX30xx gen. not always means automatically doubled performance in programs, probably some kind of optimization is needed in kernels which are executed on GPU. I'm not an expert in this field, but I'm a little interested in it.