Code Repository (Function) + Webhook Integration

tanavt · December 19, 2024, 11:05am

Hey,

So I currently have a Code Repository set up, and within this I have imported a REST API as a source (called “Gdelt2Retrieve”) . Within this REST API, I also have a webhook called ("Gdelt2) configured. I can run this in the data connection tab and successfully see the correct results from the webhook/API. I also have the egress policies set up.

I am trying to configure this webhook in a code repository, with the end goal of having this function imported into pipeline builder (This would be the input for the rest of the pipeline). I also have the transform-external-systems setting enabled from the documentation. From my understanding, I have to call the webhook, and also have an output as a dataset.

Could somebody tell me if I am on the right track with this code? I tried to mimic the documentation:

from palantir.datasets.core import Dataset
from palantir.datasets.webhooks import WebhookClient
from pyspark.sql import function


@function(sources=["Gdelt2Retrieve"])  
def call_webhook() -> str:
    # Create a WebhookClient instance
    webhook_client = WebhookClient()

    # Execute the webhook
    try:
        response = webhook_client.execute("Gdelt2")
    except Exception as e:
        return f"Error: Webhook call failed due to an exception: {e}"

    # Check if the webhook execution was successful
    if response.status_code != 200:
        return f"Error: Webhook call failed with status code {response.status_code}, response: {response.text}"

    # Process the response data
    try:
        data = response.json()  # Extract JSON data
    except ValueError:
        return "Error:"


    return str(data)

Also, after importing my API: ‘Gdelt2Retrieve’, In the resources side tab it gives me this starter code:

import requests
@function(sources=["Gdelt2Retrieve"])
def my_function() -> String:
    # TODO: specify endpoint url
    response = requests.get(...)
    if response.status_code != 200:
        # Handle error
    data = response.json()
    # Use response data

I don’t necessarily need to use the function, but I figured this would be the easiest way to get a webhook integrated into the pipeline builder.

eiriklt · December 20, 2024, 10:21am

Hey!

I do think you are on the right track, but I don’t believe python functions support the generated webhook client (yet). This means you will have to use a source (as you already are), but instead of a webhook you will have to call the endpoint manually as described here.

Let me know if this doesn’t work for you!

-Eirik

tanavt · December 24, 2024, 10:17am

Hey Eirik,

Thanks for the response.

I did try that approach in another notebook but I’m not sure if I did it right. Here is the way I implemented my code.

from functions.api import function
from functions.sources import get_source


@function(sources=["Gdelt2Retrieve1"])
def my_external_function() -> str:
    source = get_source("Gdelt2Retrieve1")
    url = source.get_https_connection().url
    client = source.get_https_connection().get_client()
    response = client.get(url)
    return response.txt

The error I was getting is shown below. I also tried using the RID where source name should be, and that didn’t work for me. In the screenshot, at the top left it looks like the API name is in the code is correct, but I’m wondering if I’m missing a step.

Thanks,
Tanav

The only thing in the log is:

WARN [2024-12-23T17:48:42.058443785Z] could not get source parameters, unknown runtime environment

2c17e0fe21fe49c8714d · December 30, 2024, 4:26am

Hey!

I took at look at the documentation linked above and one thing I saw was “It is not currently possible to make API calls in function live preview, you will have to publish your function to test it.”

From what I can see, it looks like you are testing out this function in Live Preview. You may have to publish it to see any desired results.

system · February 28, 2025, 4:27am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.