>Is this a workaround to let us see “what it would look like”, or are there optical reasons why this produces an image that is inherently artificial, and could never really be perceived that way?
Both in a way. When you look at a landscape, your eyes and brain are constantly adjusting everything so what you look at "directly" is sharp, and you don't really realize most of what is in your field of view is low resolution, maybe a bit blurry. Same when looking at something really close.
When you look at a picture, if some parts of it are blurry, your eyes/brain can't adjust so that it becomes sharp, because it was captured blurry. Even if you had a camera that exactly reproduces your eye, the pictures would look nothing like what your eyes see, because your eyes and brain are a very different system from a camera.
In photo there is something called "depth of field", which is "the distance between the nearest and the farthest objects that are in acceptably sharp focus in an image captured with a camera" [1]. You can see on the wikipedia page that there's an equation for approximating depth of field, that has in it 2u², where u is the distance to the subject. That means the closer the subject, the smaller the size of depth of field. You can test this with your eye. Take an object 30cm away, put your finger between your eye and the object, and you can change the focus of your eye between your finger and the object. When you focus on your finger the object is a bit blurry, when you focus on the object your finger is a bit blurry [2]. Now take two object that are 15cm away from each other, but 2m or more away from you. Changing the focus from one object to the other won't make the first object as blurry as when you did that close. This is because your depth of field is larger, as the distance increases.
Finally macro. In macro photography, you're often extremely close, so depth of field is extremely thin. When I say extremely thin, I mean "it can take 10 or more pictures to cover a whole fly". A solution in that case, to get all your subject in focus (sharp), is to take lots of different pictures, focusing a tiny bit closer/farther away each time, and then taking all the sharp parts of each picture. That's the technique used here, often called "focus stacking".
[1]: https://en.wikipedia.org/wiki/Depth_of_field
[2]: This might be harder if you're older, as we age we slowly lose the ability to adjust focus, hence the need for reading glasses (cameras can also use "reading glasses" when they can't focus close enough, they're called "close-up filters" and work the same).
>and you don't really realize most of what is in your field of view is low resolution, maybe a bit blurry
PBS did a great video on this
https://www.youtube.com/watch?v=HU6LfXNeQM4&t=514s
This links to the section in question, but it's well worth watching all of it to see and example of how your brain tricks you. The computer doing eye tracking and blurring everything else out to the user really points out how much your brain lies to you about reality.