Latecomers to the field may be tempted to write this off as antiquated (though updated to cover transformers, attention, etc.) but a better framing would be that it is _grounded_. Understanding the range of related approaches is key to understanding the current dominant paradigm.