Page 2 of 4
Re: v2.3 Vs v2.4 performance
Posted: Thu Apr 30, 2020 2:34 pm
by B.Y.O.B.
Dade wrote: ↑Thu Apr 30, 2020 2:30 pm
I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
Did you do any profiling yet to find the hotspots in the interpreter?
Re: v2.3 Vs v2.4 performance
Posted: Thu Apr 30, 2020 3:00 pm
by Dade
B.Y.O.B. wrote: ↑Thu Apr 30, 2020 2:34 pm
Dade wrote: ↑Thu Apr 30, 2020 2:30 pm
I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
Did you do any profiling yet to find the hotspots in the interpreter?
I have (micro-)kernel execution times, each kernel does a quite specific task. The main bottle neck is mostly the BSDF sampling.
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 9:06 am
by epilectrolytics
Comparison with v2.2 is even scarier:
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 9:30 am
by Sharlybg
epilectrolytics wrote: ↑Fri May 01, 2020 9:06 am
Comparison with v2.2 is even scarier:
Screenshot.jpg
this is a lot even between 2.2 and 2.3 there is half performance lost.
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 9:46 am
by epilectrolytics
Sharlybg wrote: ↑Fri May 01, 2020 9:30 am
this is a lot even between 2.2 and 2.3 there is half performance lost.
Yep, I'm totally stunned.
Maybe we need Optix RTX support to regain the render speed we already had last year??
And AMD users are kind of screwed right now
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 1:06 pm
by acasta69
Here are results with GTX970 (2.3 vs. 2.4 build 20200430.12):
On this card CUDA underperforms OpenCL quite a bit...
Anyone else with this or similar GPU series can confirm?
P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:
Code: Select all
[LuxRays][1.828] [MBVHKernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D PARAM_RAY_EPSILON_MIN=0.0001f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_OPENCL_DEVICE -cl-fast-relaxed-math -cl-mad-enable
[LuxRays][1.828] [MBVHKernel] Compiling kernels
[LuxRays][1.828] [MBVHKernel] Program cached
[LuxCore][1.859] [PathOCLRenderEngine] OpenCL task count: 262144
[LuxCore][1.859] [PathOCLBaseRenderEngine] OpenCL max. page memory size: 1048576Kbytes
[LuxCore][1.859] PhotonGI evaluating best radius
C:\Users\Alessandro\Desktop\luxcorerender-latest-win64-cuda\scenes\LuxCore2.1Benchmark\LuxCoreScene>
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 1:12 pm
by Dade
acasta69 wrote: ↑Fri May 01, 2020 1:06 pm
Anyone else with this or similar GPU series can confirm?
A 970 GTX can not be 7 times slower than a 2070 RTX
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 1:17 pm
by acasta69
Dade wrote: ↑Fri May 01, 2020 1:12 pm
A 970 GTX can not be 7 times slower than a 2070 RTX
Indeed! I dont' understand if it can be something broken on my system or not...
I tested both with Azure-built and self-built binaries, results are the same.
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 3:22 pm
by Dade
acasta69 wrote: ↑Fri May 01, 2020 1:06 pm
P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:
I should have fixed this problem.
Re: v2.3 Vs v2.4 performance
Posted: Fri May 01, 2020 3:36 pm
by Dade
BTW, I'm doing the tests with PGI disabled (or the start up time may affect the stats).