You can use the same framing for human reasoning except its over visual/auditority/spatial data and not just text.
You don't remember every detail of what you've seen correct? You store some lossy compression like "I went to a park"