kafka-spark-data-pipeline
Building a Data Pipeline with Kafka, Spark Streaming and Cassandra 1. Overview Apache Kafka is a scalable, high performance, low latency platform that allows reading and writing streams of data like a messaging system. We can start with Kafka in Java fairly easily. Spark Streaming is part of the Apache Spark platform that… Continue Reading kafka-spark-data-pipeline