Errors when uploading PDFs to Files in Project

I’m trying to mass upload PDFs into Foundry so AIP + OCR can extract information for a project, but some files fail to upload with “unexpected error.”

Things I’ve already verified (PowerShell):

  • All Excel files removed (PDF-only)

  • No special characters or nested subfolders

  • No duplicate filenames (case-normalized)

  • PDFs open normally in browser

  • Ran qpdf on every file

Behavior:

  • Some individual PDFs upload

  • Some folders upload partially

  • Some folders appear empty after upload

How do I properly fix the files so I can upload everything to Foundry or mass upload each folder as a media set (is this appropriate here?).

Thank you!

Hi @brandon,

Based on your use-case I would recommend you to use Media Sets with Pipeline Builder
Here’s the recommended approach:

  • Create a media set in Pipeline Builder
  • Upload PDFs to the media set
  • Use Pipeline Builder to process the media set for OCR and text extraction
  • Reference this tutorial: Create Batch Pipeline with Media Set

Best Regards,
Sanjeeth