Building a Reproducible ML Pipeline with Datasets.do and PyTorch