Apache Iceberg ingestion with Apache NiFi Berlin Buzzwords 2025

Apache Iceberg ingestion with Apache NiFi
.ical
2025-06-16 16:30–17:10, Frannz Salon

With Apache NiFi, a multimodal data pipelining tool, you can assemble existing and/or custom Java & Python processors into a variety of flows. Watch a rich data pipeline be constructed from Kafka, stored using the Apache Iceberg table format and consumed from Trino.

A cornerstone requirement of an Icehouse (Iceberg + Trino) is data ingestion. One approach is to leverage Apache NiFi. NiFi, a multimodal data pipelining tool, has a multitude of processors that can be assembled into a flow to address your specific scenarios. NiFi's low-code/no-code approach allows data engineers to rapidly build, deploy, and monitor their data ingestion & transformation pipelines. NiFi also allows custom processor development with a variety of languages, including Java and Python.

This presentation will iterate through a few common approaches and ultimately demonstrate a rich data pipeline that sources data from Kafka, performs typical transformation processing (including enrichment), and loads data into a high-performance Iceberg table that will be consumed via Trino.

Tags: Stream, Store, Scale Level: Beginner

See also: Slides (5.1 MB)

Lester Martin

Lester Martin is a seasoned developer advocate, trainer, blogger, and data engineer focused on data pipelines & data lake analytics using Trino, Iceberg, Hive, Spark, Flink, Kafka, NiFi, NoSQL databases, and, of course, classical RDBMSs. Check out Lester's blog at https://lestermartin.blog.

Apache Iceberg ingestion with Apache NiFi .ical 2025-06-16 16:30–17:10, Frannz Salon

Apache Iceberg ingestion with Apache NiFi
.ical
2025-06-16 16:30–17:10, Frannz Salon