Hi! Am I correct in assuming that if we add a repartition board to the transformations performed on an incremental dataset and then replay this incremental dataset on deploy, we should expect the entire dataset (from the start of the data) to be repartitioned?
If so, it is not the behavior I’m experiencing. Essentially I have a dataset of 8.2GB that is backed by 50,000 files, and I followed the process described above and set the number of partitions to be 50. I replayed the dataset, and the repartitioning didn’t happen. What might be the cause of it?
Thanks!