Page 2 of 4

Re: v2.3 Vs v2.4 performance

Posted: Thu Apr 30, 2020 2:34 pm
by B.Y.O.B.
Dade wrote: Thu Apr 30, 2020 2:30 pm I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
Did you do any profiling yet to find the hotspots in the interpreter?

Re: v2.3 Vs v2.4 performance

Posted: Thu Apr 30, 2020 3:00 pm
by Dade
B.Y.O.B. wrote: Thu Apr 30, 2020 2:34 pm
Dade wrote: Thu Apr 30, 2020 2:30 pm I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
Did you do any profiling yet to find the hotspots in the interpreter?
I have (micro-)kernel execution times, each kernel does a quite specific task. The main bottle neck is mostly the BSDF sampling.

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 9:06 am
by epilectrolytics
Comparison with v2.2 is even scarier:
Screenshot.jpg

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 9:30 am
by Sharlybg
epilectrolytics wrote: Fri May 01, 2020 9:06 am Comparison with v2.2 is even scarier:
Screenshot.jpg
this is a lot even between 2.2 and 2.3 there is half performance lost. :?

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 9:46 am
by epilectrolytics
Sharlybg wrote: Fri May 01, 2020 9:30 am this is a lot even between 2.2 and 2.3 there is half performance lost. :?
Yep, I'm totally stunned.
Maybe we need Optix RTX support to regain the render speed we already had last year??

And AMD users are kind of screwed right now :(

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 1:06 pm
by acasta69
Here are results with GTX970 (2.3 vs. 2.4 build 20200430.12):

LuxCore2.3.jpg
LuxCore2.4alpha0_OpenCL.jpg
LuxCore2.4alpha0_CUDA.jpg
On this card CUDA underperforms OpenCL quite a bit... :?
Anyone else with this or similar GPU series can confirm?

P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:

Code: Select all

[LuxRays][1.828] [MBVHKernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D PARAM_RAY_EPSILON_MIN=0.0001f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_OPENCL_DEVICE -cl-fast-relaxed-math -cl-mad-enable
[LuxRays][1.828] [MBVHKernel] Compiling kernels
[LuxRays][1.828] [MBVHKernel] Program cached
[LuxCore][1.859] [PathOCLRenderEngine] OpenCL task count: 262144
[LuxCore][1.859] [PathOCLBaseRenderEngine] OpenCL max. page memory size: 1048576Kbytes
[LuxCore][1.859] PhotonGI evaluating best radius

C:\Users\Alessandro\Desktop\luxcorerender-latest-win64-cuda\scenes\LuxCore2.1Benchmark\LuxCoreScene>

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 1:12 pm
by Dade
acasta69 wrote: Fri May 01, 2020 1:06 pm Anyone else with this or similar GPU series can confirm?
A 970 GTX can not be 7 times slower than a 2070 RTX :!:

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 1:17 pm
by acasta69
Dade wrote: Fri May 01, 2020 1:12 pm A 970 GTX can not be 7 times slower than a 2070 RTX :!:
Indeed! I dont' understand if it can be something broken on my system or not...
I tested both with Azure-built and self-built binaries, results are the same.

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 3:22 pm
by Dade
acasta69 wrote: Fri May 01, 2020 1:06 pm P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:
I should have fixed this problem.

Re: v2.3 Vs v2.4 performance

Posted: Fri May 01, 2020 3:36 pm
by Dade
BTW, I'm doing the tests with PGI disabled (or the start up time may affect the stats).