Woah, I'm impressed! The voice cloning also worked much better than expected! Will there be separate models for other languages? I know the National Library in Norway has done a good job curating speech datasets with many different dialects [1][2].
[1] https://data.norge.no/en/datasets/220ef03e-70e1-3465-a4af-ed...