ByteDance looks like it's eager to make up for lost time when it comes to scraping the web for data needed to train its ...