This is the 55th edition of my blog series blog series around Stream Processing and Analytics!
Every week I’m also updating the following two lists with the presentations/videos of the current week:
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- Hazelcast Launches an Open Source In-Memory Stream Processing Engine by Susan Hall
- Evaluation Guide to Streaming Analytics by evam
Spark Streaming
- Streaming Wikipedia edits with Spark and Clojure by Joel Wilsson
Apache Kafka / Kafka Streams / Confluent Platform
- Streaming databases in realtime with MySQL, Debezium, and Kafka by Chris Riccomini
- Join us for Kafka Summit Hackathon in New York City by Michael Noll
- Connecting ICS and Apache Kafka via REST Proxy API by Ricardo Ferreira
- Apache Kafka: Multiple ways for Produce or Push Message to Kafka topics by Harmeet Singh
- Apache Kafka: Multiple ways for Consume or Read messages from Kafka Topic by Harmeet Singh
- Create a VM Image With Apache Kafka Configured Using Vagrant and Ansible by Andy Boyle
Apache Storm
Apache Flink
- Stream Processing with Apache Flink by Fabien Hueske & Vasiliki Kalavri
StreamSets
- Running Scala Code in StreamSets Data Collector by Pat Patterson
Apache NiFi / Hortonworks HDF
- List/Fetch pattern and Remote Process Group in Apache NiFi by Pierre Villard
New Presentations
- The Data Dichotomy- Rethinking the Way We Treat Data and Services by Ben Stopford
- Containerizing Distributed Pipes by Hagen Toennies
- Kafka as message broker by Haluan Irsad
- Part 1: Lambda Architectures: Simplified by Apache Kudu by Michael Crutcher
- Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic Databases by Alex Gutow & Ryan Lippert
- Kick-Start with SMACK Stack by Sandeep Purohit
- Reactive Fast Data & the Data Lake with Akka, Kafka, Spark by Todd Fritz
- Scalable Real-time Complex Event Processing at Uber by Shuyi Chen
- Streaming all the things with akka streams by Johan Andren
- Apex & Geode: In-memory streaming, storage & analytics by Ashish Tadose
- Complex Event Processing with Esper by Ted Won
- Streaming Data Analytics with Amazon Kinesis Firehose and Redshift by Ray Zhu
New Videos
- The Data Dichotomy- Rethinking the Way We Treat Data and Services by Ben Stopford
- Asynchronous Processing and Multithreading in Apache Samza by Xinyu Liu
- Developing Fast Data Architectures with Streaming Applications by Karl Wehden
- Batching to Streaming Analytics at Optimizely by Mike Davis & Hao Xia
- SSD Benchmarks on Kafka by Mingmin Chen
New Releases
New Books
- Stream Processing with Apache Flink by Fabian Hueske & Vasiliki Kalavri
- Spark: The Definitive Guide – Big data processing made simple by Bill Chambers & Matei Zaharia
- Learning Apache Flink by Tanmay Deshpande
Upcoming Events
- 1.3.2017 (Phoenix, US) – Bio-manufacturing Optimization using Apache NiFi, Kafka and Spark (Meetup)
- 1.3.2017 (Leeds, UK) – Kafka’s Role in Implementing Oracle’s Big Data Reference Architecture (YoDB#7)
- 2.3.2017 (Amsterdam, NL) – Apache Flink’s Stateful Operators And Table SQL Api (Meetup)
- 2.3.2017 (San Francisco, US) – Streaming with MapR and StreamSets Data Collector (Meetup)
- 7.3.2017 (Santa Monica, US) – Stream processing with R in AWS (Meetup)
- 7.3.2017 (Austin, US) – Stream All the Things! (Meetup)
- 21.3.2017 (online) – Getting Started with Streaming Data and Stream Processing with Apache Kafka (KPI Webinar)
- 23.3.2017 (San Francisco, US) – Bay Area Apache Spark Meetup @ Intel in Santa Clara (Meetup)
- 28.3.2017 (Princeton, US) – Apache NiFi: Ingesting Enterprise Data @ Scale (Meetup)
- 29.3.2017 (online) – Part 2: Building Event-Driven Services with Apache Kafka (Confluent Webinar)
- 25.4.2017 (online) – Part 3: Putting the Micro into Microservices with Stateful Stream Processing (Confluent Webinar)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!