They pre-train with all data up to 1900 and then fine-tune with 1900-1913 data.
Where does it say that? I tried to find more detail. Thanks.
See pretraining section of the prerelease_notes.md:
https://github.com/DGoettlich/history-llms/blob/main/ranke-4...
See pretraining section of the prerelease_notes.md:
https://github.com/DGoettlich/history-llms/blob/main/ranke-4...