Feature Request: Allow AI FDE to Submit Internal Product Feedback


Feature Request: Allow AI FDE to Submit Internal Product Feedback

Summary

When AI FDE encounters platform limitations, missing APIs, or has to fall back to manual workarounds, it should have the ability to automatically file structured product feedback tickets to Palantir’s internal teams. This would create a continuous, high-quality feedback loop between real user workflows and platform development.

The Problem Today

AI FDE works across thousands of user sessions, building pipelines, writing functions, configuring ontologies, and more. When it hits a limitation — like a missing DSL feature, an unsupported tool capability, or an API gap — the only options are:

  1. Tell the user to do it manually (breaks the automation promise)
  2. The user files a forum post (requires effort, often doesn’t happen)
  3. The friction is silently absorbed (Palantir never hears about it)

Most of the time, option 3 wins. The feedback is lost.

The Opportunity

AI FDE is arguably the highest-volume, most context-rich source of product feedback Palantir has. Every time it hits a wall, it knows:

  • Exactly what it tried to do (the API call, the tool, the DSL method)
  • Why it failed (missing feature, unsupported parameter, tool not available)
  • What the user was trying to achieve (the business goal behind the request)
  • What workaround it had to use (manual UI steps, alternative approach, or giving up)
  • A proposed solution (what the API/tool should look like)

This is far richer than a typical support ticket or forum post.

Proposed Design

What gets filed

A structured feedback record containing:

Field Example
Category Missing DSL API / Tool gap / Documentation gap / UX friction
Component Pipeline Builder, Functions, Ontology Editing, etc.
What was attempted Add data expectations to Pipeline Builder output via DSL
What happened No expectations parameter exists on PB.outputs.dataset()
Workaround given Manual UI instructions provided to user
Proposed fix Add expectations array to the output DSL (with example API sketch)
Frequency signal Number of times this limitation has been encountered across sessions

Safeguards

  • Deduplication — Don’t file the same issue hundreds of times. Aggregate frequency and add “+1” signals to existing tickets instead.
  • Quality threshold — Only file when there’s a clear, reproducible gap (not “I tried something weird and it didn’t work”).
  • No user data — Tickets should describe the capability gap, not the user’s specific data or business context.
  • Transparency — Users could see a summary like: “I noticed this platform limitation and flagged it to Palantir’s product team.”
  • Feedback loop — When issues are resolved, AI FDE should be updated so it stops suggesting workarounds for fixed problems.

Why This Matters

For users: Friction points get addressed faster without requiring them to write forum posts. The tool they’re already using becomes their advocate.

For Palantir: Instead of relying on users to self-report issues (which happens rarely), the platform gets continuous, structured, high-context feedback from real workflows at scale. Product teams can prioritize based on actual frequency data rather than guessing.

For AI FDE itself: Each resolved issue makes it more capable. It’s a virtuous cycle — feedback → fix → better AI → discovers next friction point → feedback.

Real-World Example

In a single session today, AI FDE:

  • :white_check_mark: Built a 7-step data pipeline (dedup, primary key, type casting, HTML cleanup, null filling, schema ordering)
  • :white_check_mark: Deployed and validated the output dataset
  • :cross_mark: Could not add data expectations because Pipeline Builder’s DSL doesn’t expose them
  • :memo: Had to write manual UI instructions as a workaround

With this feature, that gap would have been automatically captured, categorized, and routed — along with a frequency count showing how often it comes up across all users.

In Short

AI FDE talks to more Foundry users than any solutions architect. Let it file tickets.

+++ I will add on some additional thoughts in the hopes one of these helps.

Ultimately AI FDE uses a superset of the foundry-mcp (not the 18ish ones you see in continue, but all 60+ and the other 100 or so that could be run using the exposed API catalog endpoints.. ik cuz I made this https://github.com/anand-testcompare/opencode-palantir/ and went down a rabbit hole of all rabbit holes to understand the other 200+ platform APIs that control all other behavior.

  1. point being each of those can be anonymously instrumented and you can create these rich trace runs (I have examples in sentry and posthog I can provide privately specific to the palantir mcps and I’ll include a generic example below) that can be used to create a user/agent journey
  2. The 1 click ticket approach is a realistic one as the data is often customer/confidential, but this could be an even more frictionless point of feedback if there was a one click option to then do a foundry branch, the user finishes the work, merges back in and combined w/ 1 that I will argue is sufficient to understand not just frequency of a need, not just what/why something failed, but what the human fix was. Give some of these pro users complimentary spend/limit bumps and you have the one thing labs would give both their kidneys… human, labeled data of what success is.
  3. Once the AI FDE → local / pro code exchange is seamless, enough of the error is exposed to quickly address manually then hand back off.. I and I suspect others will be more likely to adopt for more ‘real work’ or at least let it do the first attempt. The vision is there, the data to realize the vision could already be there and we don’t know or a feedback mechanism like Michael suggested could be established. Even the way the underlying MCPs are written so they are not so hyper granular they just bloat your context was clearly well thought through, but to really start seeing exponential improvements something like this is needed as I’ve currently only been able to really use it for the same kinds of things I use databricks genie for +20% of use cases

Hope this helps,

P.S. Here is a screenshot of the kind of instrumentation depth I’ve found required either for finding where the agent gets stuck or churns (this is via claude agents sdk just using the standard gen_ai OTEL traces/spans that has become standard for datadog, sentry, posthog, etc… i’ve had trees 2000 events long so it needs to be de-noised to create a play by play of what the agent attempted then fedback into gemini 3.1 flash lite or similar fast/cheap llm to bubble up the culprits.