1. The predictions get better with more data - and we don't seem to be anywhere near diminishing returns.
2. The thing we care about is generalization between people. For this, less data from more people is much better.
I noticed you tracked sessions per person, implying a subset of people have many hours of data collected on them. Are predictions for this subset better than the median?
For a given amount of data, is it better to have more people with less data per person or fewer people with more data per person?
I noticed you tracked sessions per person, implying a subset of people have many hours of data collected on them. Are predictions for this subset better than the median?
For a given amount of data, is it better to have more people with less data per person or fewer people with more data per person?