Hey there @cc36b9a87e50cd105460, there’s a way to configure the spark profiles on the data sync itself. These steps were provided by Palantir’s support when we struggled with a sync that was having OOM issues (no transforms, but the sync itself).
As far as I know, a native “GUI” way of doing this via the data sync config is coming sometime in the future but in the meantime, you should be able to follow these steps to configure the Spark profile of the sync.
What you need: Foundry data sync, PowerShell or other command line tool
Step-by-step instructions
1 – Open the specific data sync you want to configure the spark profile for.
2 – In the browser, enable developer tools (Ctrl+Shift+I in Microsoft Edge), or under: (More Tools → Developer Tools in the settings menu).
3 – Navigate to “Network” on the developer tools.
4 – Edit any setting on the data sync, press “Save” and refresh the page.
5 – Look for a network package starting with something like: ri.maggritte…
This is the identifier for the configuration of the data sync.
6 – Once in PowerShell, paste the contents and look for the sparkProfiles parameter (should be last)
7 – Edit the contents to the selected spark profile. The default is DRIVER_MEMORY_MEDIUM. Try increasing it to DRIVER_MEMORY_LARGE, like so:
8 – Press enter and wait…
9 – If the return status code is 204, the process worked successfully. This will set this spark profile for that data sync to be the selected one (in this case, DRIVER_MEMORY_LARGE). You can validate this configuration by repeating steps 1-5 and seeing that when you paste the new configuration, the spark profile is already set.
10 – Revert any temporary changes in the sync and run it with the updated spark profile.
Hope this helps!
Best,