I think a simplified rule of thumb is all you need to control perspective as a photographer:
"Select a focal length that magnifies the background to whatever size you need. Then move your feet towards or away from the foreground to get that the size you need"
You call yourself a scientist yet you are unable to produce a rational scientific argument to support the claims that Ansel Adams made. If you go back to the full quote I gave in the OP, it is very clear that he was claiming that telephoto perspective and wide-angle perspective were due to the camera position and not the viewer's position (which he never mentions).
You are blustering and trying to divert attention rather than giving a rational scientific argument.
The truth is that telephoto compression and wide-angle perspective distortion depend primarily on how the image is viewed relative to its centre of perspective (which depends on the focal length and degree of enlargement).
I have edited my previous post to use the terms wide-angle perspective and telephoto perspective instead of wide-angle distortion and telephoto compression which are often used to refer to the same effects. It does not change my argument.
I am not going to discuss this with you further. If you think you can present a credible scientific explanation for what people colloquially refer to as telephoto compression, then please do so in a thread of your own. I look forward to reading it.
Don't be so sure. Did you try? But at 10 meters? (Assuming you can see any details then...)
But human perception is involved too, this I don't oppose. We have used to typical image sizes and scenes and often correct some [perspective] distortions unconciously. We may not process scene, visible through window, in the same way - thereby I actually can't say, will your tiny flat image on faraway phone screen and exactly same scene, seen through small window at correct distance, preceived similarly. Or the same scene, viewed without window in its entirety...
Geometrically all they should be identical, perceptually they probably are not.
So in the above, if any size of print is made from the film, then the optimum viewing distance is that which subtends the same object angle as from the pinhole to the film, if I understand correctly. And by similar triangles, the ratio (object size / object distance) is all one needs to figure the viewing distance for any size of print (or any zoom on one's monitor).
Say a 30 ft tree at 100 yds subtends about 0.1 rad. Ergo, if the tree shows up as 2 in. on a monitor, then the optimum viewing distance is about 20 in ... but, if I zoom out to see more of the scene, I should move closer! Who actually does that, I wonder?
I played a bit more - I dowsized image (Lisa 200 in fact - trees are clearer there) and attempted to place it into one of NCV architecture shots, like a small window.
I'll attach resulting image, but not as preview - I have not asked NCV about reposting rights :(
There appears very interesting effect.
Initially this attempt-to-be-a-window looked like TV set and this way it was perceived flat.
Then I attempted to imagine that this is window - and now objects shifted in perspective (only in my mind!); image feels more like Lisa 105 :)
(I have no good background image with real small window - if you find one, we can experiment further. It likely needs to be perceived as real window, not some TV set.)
Both Lisas are also included in attached image, they somehow helped seeing result as look through window.
Here are two more example images to exercise the little grey cells of those who are interested in perspective and how we see it.
The first is a multiple-exposure image from Wikimedia. It illustrates a number of common cues that our brains use to see depth in a 2-D image.
The size of the last girl is about one fifth the size of the first girl, which information our brain uses to perceive the last girl as about five times as far away as the first.
In the next image I have created a small inset which is a copy of the original image at around 1/7th the size. The perspective matches sufficiently well to create a credible image of ten girls running along the path. The 6th girl is about 1/7th the size of the first and the last girl is less than 1/30th the size of the first. The distance between the 6th and 10th girls appears to be about 7 times as much as the distance between the 1st and 5th girls. Yet the inset image is an exact copy of the original image, just reduced in size.
Here is another little puzzle about the perspective: why does the 6th girl look as if she is veering off to her left, rather than running straight along the path?
Because the track is not parallel to the line of sight of the camera, each point on the track has a different angle to the camera. The closer the girl, the greater the angle between her and the line of sight. The 6th girl, being a copy, is still pointing in a direction with the same angle as the first, while if she was real, her angle would actually be on the other side of the 90 degree point of intersection.
Question: If there were really 10 girls roughly equally spaced, and not a resized copy, would the distance between the 6th and 10th still appear to be much greater than the 1st and 5th?
It is a pity that the example is not something like a railroad line or simulation thereof. Then the effects of zooming, resizing and pasting, cropping or windowing, etc. would be more easily understood ...
Thanks!! Took me a minute to two notice the inset which preserves the viewpoint lines of perspective but breaks another rule or two e.g. foreshortening.