Pipeline Builder: How to dynamically run only latest data in Pipeline with multiple inputs?

_Zak · January 14, 2026, 8:36pm

I have a pipeline with 10 inputs where each updates at various times at the end of the month. Is there a way I can run the pipeline so that only the most recently updated input is the only data that runs through the pipeline and the other 9 inputs are ignored?

All inputs are given to me as snapshots so I cannot do an incremental pipeline.

yix · January 14, 2026, 8:43pm

One possible approach is to create a dataset downstream for each of your snapshot inputs that is incremental and only contains new records (note: be careful if you have datasets that join together, new records need to be processed concurrently). You can then pipe those incremental datasets into your pipeline as inputs and process it incrementally. This community post runs through how you can convert a snapshot dataset into incremental: How can I process a dataset by chunk?

helenq · January 14, 2026, 9:08pm

Have you already tried creating a separate schedule for each of the dataset outputs and making the trigger only if the input has updated?

You can create this schedule in data lineage after you have built + deployed the pipeline and outputs.