Color Science differences

link

JACS
Members 878 posts

March 23, 2024, 4:50 p.m. March 23, 2024, 4:50 p.m.
link

[deleted]
link

bobn2
Team 2240 posts

March 23, 2024, 5:54 p.m. March 23, 2024, 5:54 p.m.
link

@JACS has written:

It is "all" from a purist point of view as I said. Next, you are just rephrasing what I wrote in the parentheses, and clarified in other posts. If you are talking about CFA same as our spectral sensitivities, this is too restrictive, and it is probably even sub-optimal for noise. The IL condition asks for three linearly independent combinations of those sensitivities

Great if we agree. I possibly interpreted what you said differently from how you intended. I didn't talk about CFA the same as our spectral sensitivities, though that would be the only way to get entirely 'accurate' colour - simply because any other response would sift the spectral energies differently, and thus sort the photons differently than a human viewer would. The LIM conditions refer to a model that differs from the ground truth of human perception for reasons of mathematical convenience. The model's close enough to have served us well for 90+ years, but shouldn't be confused with the 'purist' actuality.

@JACS has written:

No. First, for an image displayed on the screen, the ambient light does not matter much for what we view on the screen.

Actually, it does. It doesn't appear to do so because our display devices are designed to work with typical interior illumination conditions. View them in different conditions and the colours go off.

@JACS has written:

Next, I said nothing about prints.

Sure, I didn't say that you did. I was just pointing out that what we call 'colour science', the CIE work was originally about prints and printing.

@JACS has written:

In the sense I used it, it is a mismatch between the spectral sensitivity of the camera, and the observer. Then the camera would distinguish spectral distributions we cannot; but then it would fail to distinguish ones that we can. Those two go together. This is basic linear algebra

I can understand why your DPReview interlocutor quibbled about it, because i think that's quite an idiosyncratic definition. On the 'basic linear algebra', the point I made above applies. That it is true only of the CIE model, not the ground truth - though as I said there, the CIE model has served us well.

@JACS has written:

Quoted message:

The key thing here is that colour photographe does not try to capture the spectral disributions of the objects in the scene. What it's trying to capture is the three human visual stimuli that scene would produce, or at least sufficient information to allow them to be reproduced.

This is more or less what I said.

Good.
link

Andrew564
Members 86 posts

March 23, 2024, 6:23 p.m. March 23, 2024, 6:23 p.m.
link

@JACS has written:

Quoted message:

[quote="@Andrew564"]

JACS seems to be talking about preserving the integrity of the input data and so if you matched that in the output then the eye would have the same response and so you could forget about it.

But how does this work when your output device is a computer screen? Which is an optical illusion calibrated specifically to the human eye. The nature of the light emitted from your pic of a daffodil on screen is completely different to the yellow reflected of a daffodil in sunlight and so the human eye's response being mainly chemical will not be the same. We are also talking about an output device that must be calibrated to how a human eye perceives colour, and to be relevant it must also be calibrated to how it sees colour under reference ambient light. This is the only way you can have calibrated colour on an output device that uses additive colour.

It does not need to worry about the ambient light (of the scene, I guess). It might be a white rose in red light or a red one in neutral light, it still looks red, and you may want to present it to the viewer as that. After all, we do not want to undo color on a movie screen, right? The screen is white, is that what you want to see in a photo? 😃

I have a real problem understanding your point sometimes, there was a half sentence a while back that I misinterpreted. I also missed the point of your laser narrow, my knowledge about screens needs updating. If we are talking about the perception of red then rather than the whole, yes. If we are talking about perception of colour and the accuracy of that system being defined by reference colour viewed and not by measurement and preserving actual recorded wavelengths, yes. Accurate colour also becomes a necessity of the theoretical model rather than a truth that must be preserved on output, yes let the eye take care of the resulting inconsistencies that exist in the system. You see I'm not a purist and don't think accurate colour has any real importance outside corporate logos. I'm also deliberately avoiding any mathematical model fo colour as I would much rather develop a more abstract connection. Not really sure what the second sentence means.
link

Andrew564
Members 86 posts

March 23, 2024, 6:39 p.m. March 23, 2024, 6:39 p.m.
link

@bobn2 has written:

@JACS has written:

Quoted message:

The key thing here is that colour photographe does not try to capture the spectral disributions of the objects in the scene. What it's trying to capture is the three human visual stimuli that scene would produce, or at least sufficient information to allow them to be reproduced.

This is more or less what I said.

Good.

I'm good with this as well.
link

ArvoJ
Team 1067 posts

March 23, 2024, 7:39 p.m. March 23, 2024, 7:39 p.m.
link

@JACS has written:

@ArvoJ has written:

@JACS has written:

@ArvoJ has written:

@JACS has written:

If there is metamerism at capture, then there will be in the output, no matter what.

Please explain.

Joofa at dpreview had a good sketch, which I cannot find now. Say that we have a 2D space of spectral densities, and our eyes see an 1D projection only. Project first to the x-axis, then to a slightly slanted one (perpendicularly to it). Some points which projected to distinct points before, project to the same one now (metamers). Also, some points which you could not distinguish before, are distinguishable now.

As an example, assume a 4D space of spectral densities. Say that we have (1,2,3,4) but we can only see (1,2,3). Your camera sees (2,3,4) though. You can only say that the actual "color" was (x,2,3) and you have no idea what x is. Garbage in - garbage out, as they say. You also have that "4" but your eyes would get no info about it if they look at the scene, and it does not help you to find that x anyway.

I see...

You have developed terminology shift. Metamerism itself is purely physiological term, which describes human perception - seeing different 'spectral combinations' as same color. What you describe, could be called 'metameric failure', 'metamerism errors' or something similar.

We are arguing what the meaning of the words "is" is now? All those are connected, one leads to the other. Also, there is the notion of device metamerism (wikipedia).

Actually yes, we are arguing about terminology. Look, I just want to expand my limited knowledge and if your words meaning contradicts with what I have known previously, then I have question - is it problem with your words or my understanding? Thereby I asked for explanation, and from your explanation I concluded that you were talking about metamerism errors.
(About device metamerism - I agree I used this 'metamerism' term in a limited sense; this doesn't change basis of my confusion.)

Quoted message:

Quoted message:

Then - your examples of 4D are not relevant. All photography and imaging talk about metamerism is limited to visual range wihout any other, invisible dimension. Simplest model is 1D - spectral distribution of received light, next iteration would be 2D - spectrum plus light intensity (human eye receptors sensitivities depend on intensity in little different way).

This is fundamentally wrong. The functions (say, smooth) of one variable form an infinitely dimensional space.

Do you mean 'single' wavelength as a dimension? Even then I cannot get the idea of your second example.

Algebra is one of my weakest points. All those 'infinite dimension property spaces' and similar things do not talk to me :(

Quoted message:

Quoted message:

In other words - problem is not in frequencies, what we can not see, problem is that eye response to different spectral composition is not the same as sensor response. And even if it would the same (technically not impossible, but like you said, has some side effects), then what we could do with recorded information? Send directly into our eye nerves? This could work :)

So you are saying that the IL condition is nonsense?

Where did I say that? You implied it from "not the same" part? Was not intended this way.

Quoted message:

Quoted message:

But we need to reproduce it with help of some visualisation medium, like computer screen or printed paper - and here we have much bigger problems - there is no set of three independent color channels, able to generate similar response in our eyes as original does.

Of course, there is, withing the limits of the gamut. In fact, you are looking at such a set right now by reading this.

How do we construct this set of channels (say RGB screen) then? I know that we have screens with enough good color reproduction, but can those devices (screens) made ideal (create exactly similar eye response as the recorded scene creates, be it within the gamut then)?
link

Andrew564
Members 86 posts

March 23, 2024, 10:21 p.m. March 23, 2024, 10:21 p.m.
link

@ArvoJ has written:

Quoted message:

Quoted message:

But we need to reproduce it with help of some visualisation medium, like computer screen or printed paper - and here we have much bigger problems - there is no set of three independent color channels, able to generate similar response in our eyes as original does.

Of course, there is, withing the limits of the gamut. In fact, you are looking at such a set right now by reading this.

How do we construct this set of channels (say RGB screen) then? I know that we have screens with enough good color reproduction, but can those devices (screens) made ideal (create exactly similar eye response as the recorded scene creates, be it within the gamut then)?

I think you mean that you can't generate the exact same light as the eye saw, you can certainly trigger a similar response in the eye. But the limitations of the capture device, or the way it differs from the human eye, combined with the limitations of the display device introduces limits that doesn't quite match that of the eye. It is only possible to reproduce the colour within the limits of the devices, or their colour gamut.

[EDIT] If you are talking about the light from the actual scene then you can only talk about approximate because colour in combination can produce non-linear responses in the eye and real scenes are often lit by several sources with varying WB rather than the global one assumed.

Colour is always multiple wavelengths never single ones.
link

JACS
Members 878 posts

March 24, 2024, 1:19 a.m. March 24, 2024, 1:19 a.m.
link

[deleted]
link

bobn2
Team 2240 posts

March 24, 2024, 8:56 a.m. March 24, 2024, 8:56 a.m.
link

@JACS has written:

OK, some basic linear algebra. Take the functions f_1=1, and f_2=x on whatever fixed interval. Thery are linearly independent because no non-trivial (meaning having zero coefficients) linear combinations is zero. Now, add f_3=x^3 to the set. Still linearly independent. You can go like this forever, which implies that the dimension of the set of all functions on that interval is infinite.

Some terminology for the non-mathematicians
'interval' is a set of values across which you're looking at the function. So 'on whatever fixed interval' means we're comparing the functions over the same range.
'Linearly independent' - cannot be derived from each other by a fixed multiplier.

@JACS has written:

In our case, those functions are the spectral densities. Their space is infinitely dimensional but your cameras sees a 3D subspace. So do your eyes. Unfortunately, they are not the same subspaces but as I said earlier, they are kind of close.

Conceptually colour vision could have as many 'channels' as you want. For humans it is 3, hence a '3D subspace'. For most mammals it would be 2 (2D subspace). For most birds it would be 4 (4D subspace). For some birds and turtles it would be 5 (5D subspace). The dimensionality of the space reflects the number of colour stimuli in teh organism for which you're trying to provide colour reproduction.

@JACS has written:

Yes, they can, if you had the right information (the IL condition).

Usually called the 'Luther-Ives-Maxwell' (LIM) conditions, sometimes 'LI', independently derived by these three scientists. They say that you can use any three functions, so long as the ones that you actually want can be derived from them using linear combinations (simple multiplication and addition) of the functions that you have.

Quoted message:

It is a pretty generic property, actually but if you randomly chose three RGB guns, you may get a very small gamut since you can have linear combinations with non-negative coefficients only (additive colors). See the diagram I posted from Wikipedia.

See this monitor spectrum, for example (taken from a pretty old paper, BTW: www.researchgate.net/figure/Color-spectrum-of-an-example-LCD-monitor-LCD04_fig3_228988571)

Put whatever coefficients on the three curves and add them together. You get a horribly spiky curve. Yet, it may create very nice images within its gamut.

It's not 'horribly spiky'. It's spiky by design, and works much better than would a non-spiky curve that satisfied the LIM conditions (these don't).
The point about this spectrum is that each of the three stimuli should excite as nearly as possible a single type of cone in the human eye. The response of the eye looks like this (from Wikipedia)

The best stimulus would be a monochromatic (hence 'spiky') source targeting a point on the cone response curve which selected only one kind of cone. THis is why the best OLP projectors use laser light sources. It's relatively easy to stimulate (more or less) only the S cones, because that response is well separated from the others. The other two are more problematic, because they overlap. On the other hand, that overlap means that in real-world use they will be stimulated together by most sources, so the aim is to provide maximum differential control of the stimuli. The stimulus for the M cones fits the peak response, but that for the L cones doesn't - it's shifted to longer wavelengths to get more separation.
In short, the spectrum of the exciting illuminant need have no direct relationship to that in the original scene. All that matters is that the correct cones are stimulated. All the brain knows is which cones are stimulated and how much - it has no way of measuring the exact spectrum of the applied stimulus.

Deleted likes this.

favorite 1
link

Andrew564
Members 86 posts

March 24, 2024, 10:33 a.m. March 24, 2024, 10:33 a.m.
link

@bobn2 has written:

In short, the spectrum of the exciting illuminant need have no direct relationship to that in the original scene. All that matters is that the correct cones are stimulated. All the brain knows is which cones are stimulated and how much - it has no way of measuring the exact spectrum of the applied stimulus.

Thanks for the clarification of the terms.

So...

In terms of colour reproduction the device recording the scene responds to real wavelength, but this information is not preserved. Somewhere along the line this is transformed into the sensation of colour. This is the maths you are talking about? How colour recorded by the sensor is transformed to a model of absolute colour and then how that is transformed into the output. It is not about collecting light and preserving the absolute wavelength, or really about absolute wavelength at all past a certain point. It's about preserving the sensation of colour.

Question: And I hope I use your terms correctly, if we move to a 4D system (4 sensors on a camera) and had 4 colours on an output screen, does this increase accuracy or really just push the gamut closer to saturated colour? And given most people's confusion between accurate and pleasing colour would it really be noticed by the human eye?

I've always been confused by some, even on camera forums, photographers perception that it is what the camera recorded and so is the real colour of the scene, "SOOC, I've not changed the colours!" There is a thread on another photo forum about unexpected "natural colours revealed" in highly processed images taken at night. I know about night vision and the nature of moonligh, but you have to see the image. I found the whole concept quite absurd. Just wondering what you impression is about the general understainging of colour on photo forums?
link

JACS
Members 878 posts

March 24, 2024, 2:37 p.m. March 24, 2024, 2:37 p.m.
link

[deleted]
link

bobn2
Team 2240 posts

March 24, 2024, 3:46 p.m. March 24, 2024, 3:46 p.m.
link

@Andrew564 has written:

@bobn2 has written:

In short, the spectrum of the exciting illuminant need have no direct relationship to that in the original scene. All that matters is that the correct cones are stimulated. All the brain knows is which cones are stimulated and how much - it has no way of measuring the exact spectrum of the applied stimulus.

Thanks for the clarification of the terms.

So...

In terms of colour reproduction the device recording the scene responds to real wavelength, but this information is not preserved. Somewhere along the line this is transformed into the sensation of colour. This is the maths you are talking about? How colour recorded by the sensor is transformed to a model of absolute colour and then how that is transformed into the output. It is not about collecting light and preserving the absolute wavelength, or really about absolute wavelength at all past a certain point. It's about preserving the sensation of colour.

Question: And I hope I use your terms correctly, if we move to a 4D system (4 sensors on a camera) and had 4 colours on an output screen, does this increase accuracy or really just push the gamut closer to saturated colour? And given most people's confusion between accurate and pleasing colour would it really be noticed by the human eye?

It's probably best working this one back form first principles. On of the issues about 'colour science' is that people come at it from the middle, with a lot of abstruse maths. Devoid of context, it's hard to understand.
Ultimately, what we want to do is provide the same data to the visual cortext that would be the case if the eyes were actually looking at the scene. The data comes from the rods and cones in the eye, so what we want to do is to 'fire†' the rods and cones in the same way as would be done by the eye looking at the actual scene. To do this we need to arrange for the eye to be looking at a device with models the relative light intensity of the scene, and also ensure that the light has a wavelength characteristic which triggers the same set (L,M or H) of cones as would looking at the scene. All that matters to our perception is that the right set of cones gets fired, not at all whether the spectral pattern of the light firing them is the same as the scene was originally. So, in a light emitting display (as opposed to a reflective print) the best way to do this is to choose narrow band light sources with wavelengths chosen to give maximum differentiation between the three different sets of cones‡.

So to capture a scene to provide the correct stimuli we need to 'record' how a retina would have reacted to the light - hence we use three channels. There are some confounding issues. One is that the reception pattern of cones varies between individuals - but not too much and it turns out that an average works for teh vast majority of people. The second is that it's very difficult to manufacture dyes or colour separators which have exactly the same spectral profile as do the human cones. Again, we can approximate - and that is what has been done. The CIE XYZ 31 (31 because it was released in 1931) provides a model of human colour vision good enough to work for most people. It's deliberately inaccurate, because it's been designed to be mathematically convenient, of which more later.
For engineering reasons using colour channels that replicate the eye is not the best solution. It doesn't make best use of the optical capture technology available. To get the best quality images (in terms of noise) we want to make most use of the wavelengths at which our capture devices (these days, silicon sensors) are most efficient. One of the advantages of using a mathematically convenient (rather than strictly accurate) model of colour perception is that we can use simple mathematical operators to translate between them. The LIM conditions are quite strict - it means that we must be able to generate XYZ simply by multiplying by some constants and then adding the three channels together. This puts quite a constraint on the colour channels of the camera. There's a looser condition whereby we can get to XYZ by multiplying by some translation functions - that vary by wavelength - rather than constants, and this is what Foveon sensors use to get good colour from 'filters' that are determined by physics rather than design.

So, in answer to your question. Three channels is enough so long as they are a good three channels (LIM or equivalent). In teh case that you can't get a 'good' three channels it might be possible to use a fourth 'helper' channel to generate useful XYZ. It has been attempted with four channel modifications to Bayer, but it didn't provide a clear advantage and no-one's doing it now.

† the word 'fire' here suggest that it is a binary on/off response. This isn't quite true. In fact it appears that there is a mixture of binary type cones and ones with a more gradual but graded response - see here
‡ narrow band sources cannot satisfy the LIM conditions because at most visible wavelengths the emittance is zero, and however much you multiply a zero, you won't get a spectral contribution to match the XYZ spectra. But this doesn't matter, because LIM applies to capture and not reproduction. For reflective (print) reproduction it is important to have pigments which satisfy LIM, since we can't control the illumination, which is why modern colour printers end up depositing so many different inks.
link

bobn2
Team 2240 posts

March 24, 2024, 3:55 p.m. March 24, 2024, 3:55 p.m.
link

@JACS has written:

@bobn2 has written:

'Linearly independent' - cannot be derived from each other by a fixed multiplier.

Here is the actual definition: en.wikipedia.org/wiki/Linear_independence .

Equivalent, and I was trying to provide an explanation for non-mathematicians. I thought that I might lose them at 'there exists no nontrivial linear combination of the vectors that equals the zero vector'.

@JACS has written:

I presumed that my post would be read by humans only when I said "your eyes."

No harm in clarifying for people that missed the presumption.

@JACS has written:

Well, you would not want to sit on it, it would hurt. And, as I said, it can still create nice images. You seem to take this remark as a claim that the spectrum us suboptimal, while my point was exactly the opposite - it looks horrible but it works.

The emphasis that I would put on it is different from 'it can still create nice images'. It's because it's like that it creates nice images. Monochromatic sources are optimum for an emissive display. They control the stimulation of the required sets of cones as precisely as is possible, given the overlap of the bands.

@JACS has written:

A lot of words to say that we are restricted to linear combinations with non-negative coefficients, which I said already; but then it reduces to an optimization problem: how to get the widest gamut with not-so extreme coefficients, and which part of the gamut matters more.

One of the great advantages of mathematical parlance is that it can convey precise concepts concisely. When trying to communicate with non-mathematicians a lot of words might be necessary if they are to understand what you're saying.
link

Deleted Removed user

March 24, 2024, 4:09 p.m. March 24, 2024, 4:09 p.m.
link

@bobn2 has written:

One of the great advantages of mathematical parlance is that it can convey precise concepts concisely. When trying to communicate with non-mathematicians a lot of words might be necessary if they are to understand what you're saying.

Hear, hear!
link

JACS
Members 878 posts

March 24, 2024, 4:38 p.m. March 24, 2024, 4:38 p.m.
link

[deleted]
link

bobn2
Team 2240 posts

March 24, 2024, 4:54 p.m. March 24, 2024, 4:54 p.m.
link

@JACS has written:

Well, for 3 or more vectors, your definition does not apply.

Yes, but again that leads to a discussion that possibly most non-mathematicians would not follow.

@JACS has written:

Not quite. "Spiky" is not the same as monochrome. Put more than one spike, and you limit the gamut.

Again, monochrome is spiky, it's not the only spiky.

@JACS has written:

The example I posted had more than one spike in a channel. Hardly by design, looks more like a technological compromise.

Design is always a technological compromise. Clearly the design intent was a single spike in each channel. Frankly, knowing how this technology works, and how old the paper was, I'm very surprised that it's that good.
Anyhow, I'm getting the impression that you're seeing this more as a mutual urination contest between you and I than an attempt to clarify this whole matter for people who perhaps don't have the mathematical background that you do.
link

JACS
Members 878 posts

March 24, 2024, 5:49 p.m. March 24, 2024, 5:49 p.m.
link

[deleted]
link

bobn2
Team 2240 posts

March 24, 2024, 5:52 p.m. March 24, 2024, 5:52 p.m.
link

@JACS has written:

@bobn2 has written:

Anyhow, I'm getting the impression that you're seeing this more as a mutual urination contest between you and I [...]

It is not? 🤨

It isn't. Sorry if that disappoints you.

MarshallG likes this.

favorite 1
link

ArvoJ
Team 1067 posts

March 24, 2024, 6:35 p.m. March 24, 2024, 6:35 p.m.
link

Thank @JACS and @bobn2 for explanations!

Actually I know, what is linear transformation, what means linearly independent and so on, I just get mental block everytime when I read about infinite dimensional spaces :) Or super unitary goups and similarly abstract algebraic concepts.

I'll attempt to explain in few words, what I gathered from this discussion (assuming linear model for all operations).

(1) image (incl. color information) is recorded by sensor in three channels, whose sensitivity is designed to [approximately] satisfy LI condition (linear combination of cone response); this process is subject to metamerism errors (or metamerism according JACS terminolgy)
(2) recorded data is converted into some RGB space, using linear transformations
(3) RBG data is used to produce output (screen or print), which (within device gamut) generates similar eye response as original scene

In non-linear case conversions (2) are a bit more difficult, but general idea is same.
'Color science differences' are mostly (a) different metamerism errors in different cameras/sensors in step 1 and (b) manufacturer preferred color tweaking models in step 2.
(I recall reading article from one Nokia designer, who told about tweaking Nokia's colors for months to get really pleasant image.)

PS. About Foveon - in first approximation image data is processed using linear transformations, it is seen in dcraw code for example. (I've coded Foveon software for older cameras after all :))

Deleted, Bryan and AlanSh like this.

favorite 3