Row Level Data Expectations

Is my understanding of how row-level data expectations expressions work?

I want to add a check so that if a column called aip_summary has a length greater than 1000, the row will fail.

I checked the data itself and currently the max len is around 750. Originally i had the check set up as if Lenght(aip_summary) > 1000 then Fail. I would expect no failures given the summaries are at most around 750 and the condition to check returns False for every row.

Hi @ggsmith842 a couple of things to note here:

  • If any of the rows fail, your entire build will fail. I am not sure if you were aware of this but wanted to flag that the rows will not be filtered out, but the build will fail.
  • The condition added in the expectation is the condition that should pass. In the case you shared where you want the expectation to pass when the length is <= 1000, then the condition should be “Less than or equals” rather than “Greater than”.

You can test that the expectations are working as expected by previewing the dataset. You will see a green check if your output passes the expectations, and a red x otherwise.

1 Like

Thank you for explaining it!