Reminds me of an approach that Troy Sobotka mentioned - Arri Alexa uses to get better natural tonality, which is also the base for Filmic tone mapping.
1. A colour in the physical light quantity sense, ignoring the magic deep complexity of the psycho side of psychophysical, is expressed as ratios between three lights in an RGB system.
2. The purity of a heavily saturated colour is expressed by wide ratios between the lights. A massive intensity, coupled with two low to minimal intensities, is a very saturated primary, for example.
3. No matter how hard you try, a logarithmic or otherwise transfer function will not compress the ratios of a saturated colour. This means it will appear to “float” in your camera photo; the colour seems to persist too strongly in relation to the other colours.
4. The points above are exacerbated by the idea that sensors, and transfer functions, will always have a limited range, which is technically a “good thing”. The downside is that as the light intensities hit that limit, the original ratios will become broken and the famous colour skews take over. Skin turns nasty yellow, skies turn nasty cyan, and ever so often you’ll spot the other member of the Notorious Six, magenta.
In this case, considered as a sort of parallel to human perception of noise...
Otherwise, IMO, present solution feels like working with photography.
So to me, it's nothing wrong - dark, enclosed areas will always produce more noise, unless you:
1. render (gather light) longer
2. use finer film (better sampler)
3. set, balance the scene lighting.
Of course, all this comes valuable if & only after the scene is structured & constructed properly - in compliance with physical reality or so called ground truth... and this is only one. //
discretion advised and tolerance acceptable