Split large data set - guidelines and recommendations

Just as informative info, see below of runs that have not finished yet. Data is 2788 images, each roughly 6MB.

Using --split 500

Using --split 1000

Using --split 1500

Also --split-overlap 100 and all other params are the defaults.