Dear Community !
Need support to have a clarity over, how to import & read any unstructured data file (for example: .pdf, .rpt etc.) from Foundry to code repository. Further get the file output to be picked up in dataset / data frame using python code?
Many thanks in advance for your support with the same.
You should be able to upload the files to a dataset (you will not be able to see anything in the preview, but if you go into the files (details > files), you should be able to see the files that are included. You can then iterate and parse the raw files in a transform as @manu flagged above by looking at the individual files and transforming them into a new output dataframe.
It might be worth taking a look at our docs for building with unstructured data to understand more on how to read from a dataset with no schema and transform it into a dataframe.