I would like to create new columns in a dataset (or a new dataset) based on existing column values. So if an existing column of interest has 3 unique values, I would like to create 3 new columns who’s names correspond to the 3 unique values. The new columns’ values would then correspond to another existing column who’s value would be part of the same row.
In the example below (data is notional, from dev-stack), I would like to create new columns where the values in p__type
would be the new column names, and the corresponding p_propertValue_...
would be the column values.
So after transforming this dataset, ideally there would be new columns that look like:
-
col_name:
com.palantir.property.Name
w/ valuesoakley
,elijah
,baker
, andaaa
-
col_name:
com.palantir.property.Language
w/ valueshouse
,above
,itself
-
etc.
I’m wondering if this is possible using built in PipelineBuilder transforms or if I’ll need to write a UDF / External Transform.