To me it seems like files could get hard to understand if events that happen simultaneously aren't horizontally lined up like this:
2.0 voice1 | voice2 | ...
https://youtu.be/eclMFa0mD1c
POS | TRACK #1 | TRACK #2 | ...