This is the 42nd installment of my blog series around Stream Processing and Analytics.
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- Real-time data visualization and machine learning for London traffic analysis by Justin Kestelyn
- Stream Processing Myths Debunked by Kostas Tzoumas
- An Introduction to stream processing systems: Kafka, AWS Kinesis and Azure Event Hubs by Jason Smith
- Drizzle: Fast and Adaptable Stream Processing at Scale by Shivaram Venkataraman et. al.
- 15 Questions and Answers From Apache NiFi, Kafka, and Storm: Better Together by
Apache Kafka
- Kafka Connect – java.lang.IncompatibleClassChangeError by Robin Moffatt
- Processing Twitter Data with Kafka Streams by Sönke Liebau
Apache Storm
- Apache Storm – What? When? Why? by Sakshi Kumari
Apache Spark Streaming
- Monitoring Real-Time Uber Data Using Spark Machine Learning, Streaming, and the Kafka API (Part 1) by Carol McDonald
Apache NiFi / Hortonworks Data Flow (HDF)
- Apache NiFi Class Loading by Bryan Bende
- First Impressions of Apache NiFi by Eric Pugh
- Enterprise NiFi: Implementing Reusable Components and a Software Development Lifecycle by Greg Keys
New Presentations
- Kappa Architecture, IoT of the cars by Juan Tomás García
- Streaming Engines for Big Data by Stavros Kontopoulos
- How to use Kafka to understand the English Premier League by Talend
- Data Stream Analytics – Why they are important by Paris Carbone
- Building Resilient Log Aggregation Pipeline with Elasticsearch & Kafka by Rafał Kuć
New Videos
- Streaming, Databases & Distributed Systems – Bridging the Divide by Ben Stopford
- Structured Streaming for Machine Learning in Apache Spark by Holden Karau & Seth Hendrickson
- Streaming Stock Market Data with Apache Spark and Kafka by John O’Neill
New Releases
Upcoming Events
- 11/29/2016 (Berlin, GE) – Apache Flink Meetup @ data Artisans (Meetup)
- 11/29/2016 (Vienna, AT) – Spark 2.0 and Couchbase (Meetup)
- 11/30/2016 (Tel Aviv, IL) – Introducing StreamSets – Creating Scalable and Resilient Data Pipelines (DataZone Event)
- 12/1/2016 (online) – A Practical Guide to Selecting a Stream Processing Technology (Confluent Online Talk series)
- 12/5/2016 (New York, US) – Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi (Meetup)
- 12/6/2016 (Lisbon, PT) – A Deep-dive into Structured Streaming (Meetup)
- 12/8/2016 (New York, US) – Streaming Analytics in a Flash – presented by Cask Data (Meetup)
- 12/8/2016 (Atlanta, US) – Building Event Data Pipelines with Kafka and Hadoop (Meetup)
- 12/8/2016 (Singapore, SG) – Stream Processing and TensorFlow 101 (Google Meetup)
- 12/8/2016 (Vancouver, CA) – Connecting All Things with Apache Kafka (Meetup)
- 12/15/2016 (online) – Streaming in Practice: Putting Apache Kafka in Production (Confluent Online Talk series)
- 12/17/2016 (Fremont, US) – Spark Saturday – Hands-on Workshop with Apache Spark 2.x on Databricks (Meetup)
- 1/18/2017 (London, UK) – Instrumenting Apache Kafka (Meetup)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!