Did you do any profiling yet to find the hotspots in the interpreter?Dade wrote: Thu Apr 30, 2020 2:30 pm I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
v2.3 Vs v2.4 performance
Re: v2.3 Vs v2.4 performance
Re: v2.3 Vs v2.4 performance
I have (micro-)kernel execution times, each kernel does a quite specific task. The main bottle neck is mostly the BSDF sampling.B.Y.O.B. wrote: Thu Apr 30, 2020 2:34 pmDid you do any profiling yet to find the hotspots in the interpreter?Dade wrote: Thu Apr 30, 2020 2:30 pm I assume I can regain some performance (but only in CUDA): for the textures alone, there is a "switch(textureType)" with 60+ cases, in CUDA it can be done with a jump table with function pointers.
-
- Donor
- Posts: 814
- Joined: Thu Oct 04, 2018 6:06 am
Re: v2.3 Vs v2.4 performance
Comparison with v2.2 is even scarier:
Re: v2.3 Vs v2.4 performance
this is a lot even between 2.2 and 2.3 there is half performance lost.epilectrolytics wrote: Fri May 01, 2020 9:06 am Comparison with v2.2 is even scarier:
Screenshot.jpg

-
- Donor
- Posts: 814
- Joined: Thu Oct 04, 2018 6:06 am
Re: v2.3 Vs v2.4 performance
Yep, I'm totally stunned.Sharlybg wrote: Fri May 01, 2020 9:30 am this is a lot even between 2.2 and 2.3 there is half performance lost.![]()
Maybe we need Optix RTX support to regain the render speed we already had last year??
And AMD users are kind of screwed right now

Re: v2.3 Vs v2.4 performance
Here are results with GTX970 (2.3 vs. 2.4 build 20200430.12):
On this card CUDA underperforms OpenCL quite a bit...
Anyone else with this or similar GPU series can confirm?
P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:
On this card CUDA underperforms OpenCL quite a bit...

Anyone else with this or similar GPU series can confirm?
P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius:
Code: Select all
[LuxRays][1.828] [MBVHKernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D PARAM_RAY_EPSILON_MIN=0.0001f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_OPENCL_DEVICE -cl-fast-relaxed-math -cl-mad-enable
[LuxRays][1.828] [MBVHKernel] Compiling kernels
[LuxRays][1.828] [MBVHKernel] Program cached
[LuxCore][1.859] [PathOCLRenderEngine] OpenCL task count: 262144
[LuxCore][1.859] [PathOCLBaseRenderEngine] OpenCL max. page memory size: 1048576Kbytes
[LuxCore][1.859] PhotonGI evaluating best radius
C:\Users\Alessandro\Desktop\luxcorerender-latest-win64-cuda\scenes\LuxCore2.1Benchmark\LuxCoreScene>
Re: v2.3 Vs v2.4 performance
A 970 GTX can not be 7 times slower than a 2070 RTX

Re: v2.3 Vs v2.4 performance
Indeed! I dont' understand if it can be something broken on my system or not...
I tested both with Azure-built and self-built binaries, results are the same.
Re: v2.3 Vs v2.4 performance
I should have fixed this problem.acasta69 wrote: Fri May 01, 2020 1:06 pm P.S. With the very latest build, LuxCoreUI exits without any error message while evaluating PhotonGI best radius: