Access Siglip2 embedding model in Code Repo

9eab477d6eb76c06c4f6 · October 3, 2025, 8:44am

Hello,

Happy to see the new image embedding using Siglip2 available in Pipeline Builder.

I could not find any documentation (too early?) on it. Is there any plan to support this function natively in Code Repository as well ? Maybe with transforms.mediasets?

Note: It would be good if Palantir could be more transparent on the upcoming functionalities like these to avoid working on logics that would be available few weeks later natively !

Cheers,

yash · October 3, 2025, 6:57pm

Hi!

Happy to see organic usage of the new Image to Embeddings board already! You are right to point out no public documentation yet. We are planning to ship similar functionality on the querying side of the ontology and then announce an image semantic search package.

For the code repositories part - yes, it should be available as a generateEmbedding transform already!

Let us know if you have any questions.

9eab477d6eb76c06c4f6 · October 6, 2025, 1:06am

Sorry, would you mind sharing where this function is located ?

I have looked into palantir_models.models and palantir_models.transforms but I could only find what I believe is the text embedding one (“GenericEmbeddingModel”) which are relying on language_model_service_api.

9f598c9f9216f2e19a57 · October 22, 2025, 6:23pm

Here is the new public documentation for the method in Code Repositories: https://www.palantir.com/docs/foundry/transforms-python/media-set-transforms-api#generate_image_embeddings