The Importance of Data Versioning in Reproducible AI