Is it possible to ingest data from a Kafka topics as batch datasets instead of streams?

acapras · May 19, 2025, 9:46am

I am ingesting data from a Kafka topic into Foundry Streams. But I don’t actually need the near real-time update capabilities offered by Foundry Steams in this particular instance.

Is there a way to ingest data from a Kafka topic on a schedule (e.g. every 5, 15 or 30 mins) into regular datasets using batch compute (w/o every utilizing Flink)?

blaunet · May 19, 2025, 9:57am

hey @acapras

Consuming Kafka as a stream is the most efficient way and why we only support this ingestion method.

If you don’t need to have all of your downstream pipelines to be real-time, our recommendation is to use the archive dataset (also called cold buffer) as a batch input to your pipeline.

system · June 2, 2025, 9:57am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.