I am trying to ingest data from an xlsx file hosted on a public website (something like https://acme.org/downloads/my_excel_file.xlsx). What is the best way to do this ?
I could not find a data connector that allows doing this directly (did I miss it ?), so I did the following:
- run a script that fetches it on a scheduled basis and drops it in an S3 bucket
- Import it in foundry using a data connection to this bucket
I now have a dataset containing xslx files, but I don’t know how I can parse its content into a tabular dataset.
Any suggestion ? Thanks