I’m using Python functions as UDFs in Pipeline Builder, via this guide: https://www.palantir.com/docs/foundry/functions/python-functions-builder/
The examples given in those docs are all row-by-row UDFs which take scalar inputs and produce an output per row. Is it possible instead to pass in something like a DataFrame so that I can compute my own aggregation across rows? I believe PySpark calls this a “UDTF”.
I’ve seen cases where an array is stored per row in order to pass an array of values to a UDF to emulate this, but I’m curious whether it’s possible to directly operate across rows in a UDF in Pipeline Builder. Thank you!