This is the 75th edition of my blog series blog series around Stream Processing and Analytics!
As every week I was also updating the following two lists with the presentations/videos of the current week:
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- Key Requirements for Streaming Platforms: A Micro-Services Advantage – Whiteboard Walkthrough (Part 1) by Ted Dunning
- Streaming Data: How to Move from State to Flow – Whiteboard Walkthrough (Part 2) by Ted Dunning
- Real-Time Anomaly Detection Streaming Microservices with H2O and MapR – Part 2: Modeling by Mathieu Dumoulin
- Streaming for Personalization Datasets at Netflix by
Apache Kafka / Kafka Streams / Confluent Platform
- How to extract change data events from MySQL to Kafka using Debezium by Vlad Mihalcea
- Chain Services with Exactly-Once Guarantees by Ben Stopford
- What does Kafka’s exactly-once processing really mean? by Adam Warski
- Apache Kafka consumer groups … don’t use them in the “wrong” way ! by Paolo Patierno
- Exactly-once, once more by Jay Kreps
- Exactly-once or not, atomic broadcast is still impossible in Kafka – or anywhere by Henry Robinson
- Self-Learning Kafka Streams With Scala: Part 1 by Himanshu Gupta
- All your streaming data are belong to Kafka by Matt Asay
Spark Streaming
Apache Flink
- On Designing a Stream Processing Benchmark by Stephan Ewen, Kostas Tzoumas and Michael Winters
Apache NiFi / Hortonworks HDF
- Data Ingestion using Apache NiFi for Building Data Lakes – Twitter Data – Part 1 Using Apache Spark,Flink,Beam,Redshift by Navdeep Singh Gill
New Presentations
- Apache Kafka in Adobe Ad Cloud’s Analytics Platform by Michael Schiff & Vikram Patankar
- Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset by Nishant Bangarwa
- Couchbase and Apache Kafka – Bridging the gap between RDBMS and NoSQL by Tyler Mitchell & David Tucker
- Powering Microservices with Docker, Kubernetes, Kafka, & MongoDB by Andrew Morgan
- Real-time Streaming Applications on AWS, Patterns and Use Cases by Ryan Nienhuis
New Videos
- Apache Flink – Stateful Stream Processing (dataArtisans) & CEP (GetInData) Kostas Kloudas & Dawid Wysakowicz
- Building a community fountain around your data stream by Maria Patterson
- Data processing with Apache Beam by Sourabh Bajaj
- Batch and Streaming Processing in the World of Data Engineering and Data Science by Keira Zhou
- Emerging Prevalence of Data Streaming in Analytics and it’s Business Significance by Mike Gualtieri
- Online Change Point Detection Using Spark Streaming by Michal Monselise
New Releases
Upcoming Events
- 3.8.2017 (online) – How Yelp Leapt to Microservices with More than a Message Queue (Confluent Webinar)
- 9.8.2017 (Bellevue, US) – Spark Structured Streaming : Introduction and Internals (Meetup)
- 9.8.2017 (Sydney, AU) – Data-in-Motion: Recent advances in Apache projects for streaming data (Meetup)
- 10.8.2017 (online) – Why VR Needed Stream Processing to Survive (Confluent Webinar)
- 15.8.2017 (Milwaukee, US) – Sensors, Spark and Kafka: Applied Machine Learning (Meetup)
- 16.8.2017 (online) – Pandora Plays Nicely Everywhere with Real-Time Data Pipelines (Confluent Webinar)
- 16.8.2017 (online) – Data Warehouse Modernization: Accelerating Time-to-Action (MapR Webinar)
- 24.8.2017 (San Francisco, US) – Visualizing and Analyzing Salesforce Data with StreamSets and Neo4j (Meetup)
- 27.8.2017 (online) – Embrace Streaming Analytics and Transform Your Business (AWS Webinar)
- 16.9.2017 (McLean, US) – Spark Saturday DC (Meetup)
- 19.9.2017 (online) – Real-time marketing analytics with stream processing (O’Reilly Webcast)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!