Re: Viewport render crash on GPU Cuda
Posted: Thu Jul 02, 2020 6:30 pm
As an add-on to my post above, I had a LuxCore 2.4 alpha install and tried it and the issue does not happen in that.
Show your work, get help, participate in development
http://forums.luxcorerender.org/
When you enable light tracing, BlendLuxCore will use PATHOCL render engine instead of RTPATHOCL. PATHOCL is the render engine for final render (so there is no difference between view port rendering and final rendering in this case).
RTPATHOCL equally splits the image across available GPUs so your 2070 renders half of the image while the 1060 renders the other half (i.e. the 1060 is slowing down your rendering with RTPATHOCL, you should use it only the 2070 for view port rendering).
Code: Select all
[Engine/Final] Rendering layer "View Layer"
[Exporter] Creating session
[SDL][164.640] Define ImageMap: NamedObject
[SDL][164.640] Camera type: perspective
[SDL][164.640] Camera position: Point[1.70621, -5.51184, 0.688014]
[SDL][164.640] Camera target: Point[1.41258, -4.56328, 0.56961]
[SDL][164.640] Camera clipping plane disabled
WARNING: No material defined
[SDL][164.640] Material definition: __CLAY__
[SDL][164.640] Scene objects count: 1
[SDL][164.640] Light definition: __WORLD_BACKGROUND_LIGHT__
[SDL][164.640] Reading texture map: W:\TEXTURES\HDRI\517\Indoor\aft_lounge_8k.hdr
[LuxCore][166.109] Configuration:
[LuxCore][166.109] path.pathdepth.total = 7
[LuxCore][166.109] path.pathdepth.diffuse = 5
[LuxCore][166.109] path.pathdepth.glossy = 5
[LuxCore][166.109] path.pathdepth.specular = 6
[LuxCore][166.109] path.hybridbackforward.enable = 0
[LuxCore][166.109] path.hybridbackforward.partition = 0
[LuxCore][166.109] path.hybridbackforward.glossinessthreshold = 0.048999998718500137
[LuxCore][166.109] opencl.cpu.use = 0
[LuxCore][166.109] opencl.gpu.use = 1
[LuxCore][166.109] opencl.devices.select = "0011"
[LuxCore][166.109] film.noiseestimation.warmup = 8
[LuxCore][166.109] film.noiseestimation.step = 32
[LuxCore][166.109] sampler.sobol.adaptive.strength = 0.89999997615814209
[LuxCore][166.109] sampler.sobol.bucketsize = 16
[LuxCore][166.109] sampler.sobol.tilesize = 16
[LuxCore][166.109] sampler.sobol.supersampling = 1
[LuxCore][166.109] sampler.sobol.overlapping = 1
[LuxCore][166.109] renderengine.type = "PATHOCL"
[LuxCore][166.109] sampler.type = "SOBOL"
[LuxCore][166.109] film.width = 1920
[LuxCore][166.109] film.height = 1080
[LuxCore][166.109] film.filter.type = "BLACKMANHARRIS"
[LuxCore][166.109] film.filter.width = 1.5
[LuxCore][166.109] lightstrategy.type = "LOG_POWER"
[LuxCore][166.109] scene.epsilon.min = 9.9999997473787516e-06
[LuxCore][166.109] scene.epsilon.max = 0.10000000149011612
[LuxCore][166.109] film.opencl.enable = 1
[LuxCore][166.109] film.opencl.device = 2
[LuxCore][166.109] path.forceblackbackground.enable = 0
[LuxCore][166.109] context.cuda.optix.enable = 1
[LuxCore][166.109] renderengine.seed = 1
[LuxCore][166.109] film.outputs.0.type = "RGB_IMAGEPIPELINE"
[LuxCore][166.109] film.outputs.0.index = 0
[LuxCore][166.109] film.outputs.0.filename = "RGB_IMAGEPIPELINE_0.png"
[LuxCore][166.109] film.outputs.1.type = "ALBEDO"
[LuxCore][166.109] film.outputs.1.filename = "ALBEDO.exr"
[LuxCore][166.109] film.outputs.2.type = "AVG_SHADING_NORMAL"
[LuxCore][166.109] film.outputs.2.filename = "AVG_SHADING_NORMAL.exr"
[LuxCore][166.125] film.outputs.3.type = "RGB_IMAGEPIPELINE"
[LuxCore][166.125] film.outputs.3.index = 1
[LuxCore][166.125] film.outputs.3.filename = "RGB_IMAGEPIPELINE_1.png"
[LuxCore][166.125] film.outputs.4.type = "RGB_IMAGEPIPELINE"
[LuxCore][166.125] film.outputs.4.index = 2
[LuxCore][166.125] film.outputs.4.filename = "RGB_IMAGEPIPELINE_2.png"
[LuxCore][166.125] film.imagepipelines.001.0.type = "INTEL_OIDN"
[LuxCore][166.125] film.imagepipelines.001.0.oidnmemory = 6000
[LuxCore][166.125] film.imagepipelines.001.0.sharpness = 0
[LuxCore][166.125] film.imagepipelines.001.1.type = "NOP"
[LuxCore][166.125] film.imagepipelines.001.2.type = "TONEMAP_LINEAR"
[LuxCore][166.125] film.imagepipelines.001.2.scale = 1
[LuxCore][166.125] film.imagepipelines.001.radiancescales.0.enabled = 1
[LuxCore][166.125] film.imagepipelines.001.radiancescales.0.globalscale = 1
[LuxCore][166.125] film.imagepipelines.001.radiancescales.0.rgbscale = 1 1 1
[LuxCore][166.125] film.imagepipelines.002.0.type = "NOP"
[LuxCore][166.125] film.imagepipelines.002.1.type = "TONEMAP_LINEAR"
[LuxCore][166.125] film.imagepipelines.002.1.scale = 1
[LuxCore][166.125] film.imagepipelines.002.radiancescales.0.enabled = 1
[LuxCore][166.125] film.imagepipelines.002.radiancescales.0.globalscale = 1
[LuxCore][166.125] film.imagepipelines.002.radiancescales.0.rgbscale = 1 1 1
[LuxCore][166.125] film.imagepipelines.002.2.type = "GAMMA_CORRECTION"
[LuxCore][166.125] film.imagepipelines.002.2.value = 2.2000000000000002
[LuxCore][166.125] film.noiseestimation.index = 2
[LuxCore][166.125] film.imagepipelines.000.0.type = "NOP"
[LuxCore][166.125] film.imagepipelines.000.1.type = "TONEMAP_LINEAR"
[LuxCore][166.125] film.imagepipelines.000.1.scale = 1
[LuxCore][166.125] film.imagepipelines.000.radiancescales.0.enabled = 1
[LuxCore][166.125] film.imagepipelines.000.radiancescales.0.globalscale = 1
[LuxCore][166.125] film.imagepipelines.000.radiancescales.0.rgbscale = 1 1 1
[LuxCore][166.125] batch.haltspp = 1,000 0
[LuxCore][166.125] batch.halttime = 0
[LuxCore][166.125] File Name Resolver Configuration:
Export took 1.5 s
====================================================================
Step: 0/2
Creating kernel cache entry with configuration properties:
renderengine.type = "PATHOCL"
sampler.type = "SOBOL"
scene.epsilon.min = 9.9999997473787516e-06
scene.epsilon.max = 0.10000000149011612
opencl.devices.select = "0011"
And scene properties:
[SDL][166.140] Define ImageMap: NamedObject
[SDL][166.140] Define ImageMap: image.png
[SDL][166.140] Camera type: perspective
[SDL][166.140] Camera position: Point[1, 6, 3]
[SDL][166.140] Camera target: Point[0, 0, 0.5]
[SDL][166.140] Camera clipping plane disabled
[SDL][166.140] Light definition: infinite_light
[SDL][166.140] Material definition: matte_mat
[SDL][166.140] Scene objects count: 1
[SDL][166.140] Texture definition: constfloat3_tex
[SDL][166.140] Material definition: constfloat3_tmat
[SDL][166.140] Scene objects count: 1
[LuxCore][166.140] Configuration:
[LuxCore][166.140] renderengine.type = "PATHOCL"
[LuxCore][166.140] sampler.type = "SOBOL"
[LuxCore][166.140] scene.epsilon.min = 9.9999997473787516e-06
[LuxCore][166.140] scene.epsilon.max = 0.10000000149011612
[LuxCore][166.140] opencl.devices.select = "0011"
[LuxCore][166.140] film.outputs.1.type = "RGB_IMAGEPIPELINE"
[LuxCore][166.140] film.outputs.1.filename = "image.png"
[LuxCore][166.140] File Name Resolver Configuration:
[LuxCore][166.140] Film resolution: 640x640
[SDL][166.140] Film output definition: RGB_IMAGEPIPELINE [image.png]
[SDL][166.140] Film output definition: RGB_IMAGEPIPELINE [image.png]
[LuxRays][166.140] OpenCL support: enabled
[LuxRays][166.140] OpenCL Platform 0: NVIDIA CUDA
[LuxRays][166.140] CUDA support: enabled
[LuxRays][166.140] CUDA support: available
[LuxRays][166.140] CUDA driver version: 11.0
[LuxRays][166.140] CUDA device count: 2
[LuxRays][166.140] Optix support: available
[LuxRays][166.140] Device 0 name: Native
[LuxRays][166.140] Device 0 type: NATIVE_THREAD
[LuxRays][166.140] Device 0 compute units: 1
[LuxRays][166.140] Device 0 preferred float vector width: 4
[LuxRays][166.140] Device 0 max allocable memory: 17592186044415MBytes
[LuxRays][166.140] Device 0 max allocable memory block size: 17592186044415MBytes
[LuxRays][166.140] Device 0 has out of core memory support: 0
[LuxRays][166.140] Device 1 name: GeForce GTX 1080 Ti
[LuxRays][166.140] Device 1 type: OPENCL_GPU
[LuxRays][166.140] Device 1 compute units: 28
[LuxRays][166.140] Device 1 preferred float vector width: 1
[LuxRays][166.140] Device 1 max allocable memory: 11264MBytes
[LuxRays][166.140] Device 1 max allocable memory block size: 2816MBytes
[LuxRays][166.140] Device 1 has out of core memory support: 0
[LuxRays][166.140] Device 2 name: GeForce GTX 780
[LuxRays][166.140] Device 2 type: OPENCL_GPU
[LuxRays][166.140] Device 2 compute units: 12
[LuxRays][166.140] Device 2 preferred float vector width: 1
[LuxRays][166.156] Device 2 max allocable memory: 3072MBytes
[LuxRays][166.156] Device 2 max allocable memory block size: 768MBytes
[LuxRays][166.156] Device 2 has out of core memory support: 0
[LuxRays][166.156] Device 3 name: GeForce GTX 1080 Ti
[LuxRays][166.156] Device 3 type: CUDA_GPU
[LuxRays][166.156] Device 3 compute units: 128
[LuxRays][166.156] Device 3 preferred float vector width: 1
[LuxRays][166.156] Device 3 max allocable memory: 11264MBytes
[LuxRays][166.156] Device 3 max allocable memory block size: 17592186044415MBytes
[LuxRays][166.156] Device 3 has out of core memory support: 1
[LuxRays][166.156] Device 3 CUDA compute capability: 6.1
[LuxRays][166.156] Device 4 name: GeForce GTX 780
[LuxRays][166.156] Device 4 type: CUDA_GPU
[LuxRays][166.156] Device 4 compute units: 192
[LuxRays][166.156] Device 4 preferred float vector width: 1
[LuxRays][166.156] Device 4 max allocable memory: 3072MBytes
[LuxRays][166.156] Device 4 max allocable memory block size: 17592186044415MBytes
[LuxRays][166.156] Device 4 has out of core memory support: 1
[LuxRays][166.156] Device 4 CUDA compute capability: 3.5
[LuxRays][166.156] Creating 10 intersection device(s)
[LuxRays][166.156] Allocating intersection device 0: GeForce GTX 1080 Ti (Type = CUDA_GPU)
[LuxRays][166.218] Allocating intersection device 1: GeForce GTX 780 (Type = CUDA_GPU)
[LuxRays][166.250] Allocating intersection device 2: Native (Type = NATIVE_THREAD)
[LuxRays][166.250] Allocating intersection device 3: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 4: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 5: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 6: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 7: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 8: Native (Type = NATIVE_THREAD)
[LuxRays][166.265] Allocating intersection device 9: Native (Type = NATIVE_THREAD)
[LuxCore][166.265] CUDA devices used:
[LuxCore][166.265] [GeForce GTX 1080 Ti CUDAIntersect (Optix enabled: 0)]
[LuxCore][166.265] [GeForce GTX 780 CUDAIntersect (Optix enabled: 0)]
[LuxCore][166.265] OpenCL devices used:
[LuxCore][166.265] Native devices used: 8
[LuxCore][166.265] Configuring 2 OpenCL render threads
[LuxCore][166.265] Configuring 8 native render threads
[LuxRays][166.265] Preprocessing DataSet
[LuxRays][166.265] Total vertex count: 48
[LuxRays][166.265] Total triangle count: 24
[LuxRays][166.265] Preprocessing DataSet done
[LuxRays][166.265] Adding DataSet accelerator: BVH
[LuxRays][166.265] Total vertex count: 48
[LuxRays][166.265] Total triangle count: 24
[LuxRays][166.265] BVH Dataset preprocessing time: 0ms
[LuxRays][166.265] BVH builder: EMBREE_BINNED_SAH
[LuxRays][166.265] BVH build hierarchy time: 0ms
[LuxRays][166.265] BVH total build time: 0ms
[LuxRays][166.265] Total BVH memory usage: 1Kbytes
[LuxRays][166.265] Adding DataSet accelerator: EMBREE
[LuxRays][166.265] Total vertex count: 48
[LuxRays][166.265] Total triangle count: 24
[LuxRays][166.265] EmbreeAccel build time: 0ms
[LuxRays][166.265] [Device GeForce GTX 1080 Ti CUDAIntersect] BVH mesh vertices buffer size: 576bytes
[LuxRays][166.265] [Device GeForce GTX 1080 Ti CUDAIntersect] BVH nodes buffer size: 1248bytes
[LuxRays][166.265] [BVHKernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D PARAM_RAY_EPSILON_MIN=1e-05f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_CUDA_DEVICE -D LUXRAYS_OS_WINDOWS --use_fast_math
[LuxRays][166.265] [BVHKernel] Compiling kernels
[LuxRays][166.265] [BVHKernel] Program cached
[LuxRays][166.265] [Device GeForce GTX 780 CUDAIntersect] BVH mesh vertices buffer size: 576bytes
[LuxRays][166.265] [Device GeForce GTX 780 CUDAIntersect] BVH nodes buffer size: 1248bytes
[LuxRays][166.265] [BVHKernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D PARAM_RAY_EPSILON_MIN=1e-05f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_CUDA_DEVICE -D LUXRAYS_OS_WINDOWS --use_fast_math
[LuxRays][166.265] [BVHKernel] Compiling kernels
[LuxRays][166.265] [BVHKernel] Program cached
[LuxCore][166.281] [PathOCLRenderEngine] OpenCL task count: 131072
[LuxCore][166.281] [PathOCLBaseRenderEngine] OpenCL max. page memory size: 18014398509481983Kbytes
[LuxCore][166.281] Compile Geometry
[LuxCore][166.281] Scene geometry compilation time: 0ms
[LuxCore][166.281] Compile 2 Textures
[LuxCore][166.281] Texture evaluation ops count: 6
[LuxCore][166.281] Texture evaluation max. stack size: 3
[LuxCore][166.281] Textures compilation time: 0ms
[LuxCore][166.281] Compile 2 Materials
[LuxCore][166.281] Material evaluation ops count: 14
[LuxCore][166.281] Material evaluation max. stack size: 8
[LuxCore][166.281] Material compilation time: 0ms
[LuxCore][166.281] Compile Lights
[LuxCore][166.281] Lights compilation time: 0ms
[LuxCore][166.281] Compile ImageMaps
[LuxCore][166.281] Image maps page(s) count: 1
[LuxCore][166.281] RGB channel page 0 size: 3264Kbytes
[LuxCore][166.281] Image maps compilation time: 0ms
[LuxCore][166.281] Always enabled OpenCL code:
[LuxCore][166.281] Compile Geometry
[LuxCore][166.281] Scene geometry compilation time: 0ms
[LuxCore][166.281] Compile 2 Textures
[LuxCore][166.281] Texture evaluation ops count: 6
[LuxCore][166.281] Texture evaluation max. stack size: 3
[LuxCore][166.281] Textures compilation time: 0ms
[LuxCore][166.281] Compile 2 Materials
[LuxCore][166.281] Material evaluation ops count: 14
[LuxCore][166.281] Material evaluation max. stack size: 8
[LuxCore][166.281] Material compilation time: 0ms
[LuxCore][166.281] Compile Lights
[LuxCore][166.297] Lights compilation time: 0ms
[LuxCore][166.297] Compile ImageMaps
[LuxCore][166.297] Image maps page(s) count: 1
[LuxCore][166.297] RGB channel page 0 size: 3264Kbytes
[LuxCore][166.297] Image maps compilation time: 0ms
[LuxCore][166.297] Starting 2 OpenCL render threads
[LuxRays][166.297] [Device GeForce GTX 1080 Ti CUDAIntersect] RADIANCE_PER_PIXEL_NORMALIZEDs[0] buffer size: 6400Kbytes
[LuxRays][166.297] [Device GeForce GTX 1080 Ti CUDAIntersect] NOISE buffer size: 1600Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] RADIANCE_PER_PIXEL_NORMALIZEDs[0] buffer size: 6400Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Camera buffer size: 5488bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Triangle normals buffer size: 288bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Vertices buffer size: 576bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Triangles buffer size: 288bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Mesh description buffer size: 624bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] ImageMap descriptions buffer size: 64bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] ImageMaps buffer size: 3264Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Textures buffer size: 592bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Texture evaluation ops buffer size: 48bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Texture evaluation stacks buffer size: 1536Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Materials buffer size: 456bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Material evaluation ops buffer size: 168bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Material evaluation stacks buffer size: 4096Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Scene objects buffer size: 48bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Lights buffer size: 332bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Env. light indices buffer size: 4bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Light offsets (Part I) buffer size: 8bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Env. light distributions buffer size: 516Kbytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] LightsDistribution buffer size: 16bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] InfiniteLightSourcesDistribution buffer size: 16bytes
[LuxRays][166.312] [Device GeForce GTX 1080 Ti CUDAIntersect] Ray buffer size: 6144Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] RayHit buffer size: 2560Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] GPUTaskConfiguration buffer size: 328bytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] GPUTask buffer size: 84992Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] GPUTaskDirectLight buffer size: 7680Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] GPUTaskState buffer size: 50176Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] GPUTask Stats buffer size: 512Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] SamplerSharedData buffer size: 1608Kbytes
[LuxCore][166.328] [PathOCLBaseRenderThread::0] Size of a Sample: 40bytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] Sample buffer size: 5120Kbytes
[LuxCore][166.328] [PathOCLBaseRenderThread::0] Size of a SampleData: 8bytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] SampleData buffer size: 1024Kbytes
[LuxCore][166.328] [PathOCLBaseRenderThread::0] Size of a SampleResult: 428bytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] SampleResult buffer size: 54784Kbytes
[LuxRays][166.328] [Device GeForce GTX 1080 Ti CUDAIntersect] PathInfo buffer size: 13824Kbytes
[LuxRays][166.343] [Device GeForce GTX 1080 Ti CUDAIntersect] DirectLightVolumeInfo buffer size: 5632Kbytes
[LuxRays][166.343] [Device GeForce GTX 1080 Ti CUDAIntersect] Pixel Filter Distribution buffer size: 33Kbytes
[LuxCore][166.343] [PathOCLBaseRenderThread::0] Compiling kernels
[LuxRays][166.343] [PathOCL kernel] Compiler options: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D RENDER_ENGINE_PATHOCL -D PARAM_RAY_EPSILON_MIN=1e-05f -D PARAM_RAY_EPSILON_MAX=0.1f -D LUXRAYS_CUDA_DEVICE -D LUXRAYS_OS_WINDOWS --use_fast_math
[LuxRays][166.343] [PathOCL kernel] Compiling kernels
around 30 minutes. Not specifically in what I copied from the console above. That time I only ran a render so I could copy what was happening for this post and I only waited a few minutes after copying, but I tried well over 30 multiple times. When it was working it was taking around a minute or so on my 1080ti on a large scene so I know it takes a minute but it really LOCKS UP locks up now. Before I could see the progress bar moving in blender, now it just stays at 0. Compiling kernel would usually take a few moments and then it would add parenthesis and tell me what it was compiling, nothing now. It just sits there for half an hour laughing at me lol but in all seriousness after using luxcore for the last couple months I have not had a problem like this that completely locksup blender over a cube and an IBL even if I walk away and watch a show and come back.