Max Retry in LLM Board

sid · September 16, 2024, 5:16pm

How many retries are done (to ensure structure) in the backend during use LLM board in builder?

And is that configurable ?

drew · September 16, 2024, 5:37pm

Today we retry the output type coercion three times. This isn’t configurable today.

sid · September 17, 2024, 12:11pm

Hey @drew , follow up: are we batching requests in the backend? Asking since batching API calls is cheaper and cost is becoming a concern at scale.

drew · September 17, 2024, 1:33pm

We’re not batching calls today, but we do parallelize individual LLM calls.