When using incremental state on a JDBC sync, how is the new value of that state determined?

michaello · March 6, 2025, 1:06pm

Is it:

The largest value present in the specified column anywhere in the table?
The largest value present in that column which was ingested during the previous run?
Something else?

Context is that I’ve got a query that pulls in rows from the source system, where last_update_timestamp > ? . However the table is large and the update_timestamp column has no index so this query is very slow.

I’m hoping I can batch my initial ingest such that each run is limited to only pull in a single month of data at a time. I’ll run it a few dozen times to get hold of the full historical data, and then once it’s caught up to today then I can swap it back to normal incremental behaviour.

blaunet · March 6, 2025, 1:52pm

Hi,

As explained in our docs the behavior is such that after an incremental run, the new value of the incremental state is the largest value that was present during that run.

In any subsequent run, the wildcard will be replaced with the maximum synced value of the incremental column from the previous run.

That means, if you apply a filter in your SQL query with a max value on that same incremental column, the value of the incremental state after the run will effectively be that max value.

system · March 20, 2025, 1:52pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.