Hi All,
Is there a way to include .xlsx files in to mediaset? so that later we can use them in pipeline builder for a rag application. I am currently working with docx, pptx and pdf files.
Hi All,
Is there a way to include .xlsx files in to mediaset? so that later we can use them in pipeline builder for a rag application. I am currently working with docx, pptx and pdf files.
Hi-
can do in below ways based on our UC.
1.we can drag-and-drop your .xlsx files directly into a media set direclty.
2.Upload files from your computer into Foundry to pipeline Builder(as input datasets or media sets).
3)Can set up a Data Connection Sync to push those .xlsx files into a media set if it is coming from external systems.
Do you see any observations?
You need to set your media set schema type to spreadsheet.
The mediaset you are currently using is of schema type “Document” whereas you need a mediaset of schematype “Spreadsheet”.
XLSX is supported by mediasets, see here.
It will store as media items once the schema type as “media set”.
Hi Venkat,
Currently I am getting files from an external system to foundry and using code repo to convert them in mediaset files.
Pdf is primary format, docx and pptx are additional inputs. But I dont see xlsx as additional format.
So is there a way I can convert them to mediaset pdf - do I need to write a separate script for this?
Currently I am using. Code repo to convert files in to mediaset. Is there a way to setup this via code repo?
hi currently document media sets (pdf) do not support xlsx as an additional input format.
as @ecitaku mentioned you can utilize a spreadsheet media set and run transformations to get the content within the spreadsheet, as well as preview it in platform.
additionally spreadsheet media sets allow you to utilize spreadsheet annotations via the https://www.palantir.com/docs/foundry/workshop/widgets-spreadsheet-display widget in workshop.
hope that helps!