This is the 94th edition of my blog series blog series around Stream Processing and Analytics!
As every week I was also updating the following two lists with the presentations/videos of the current week:
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- Comparing Pulsar and Kafka: how a segment-based architecture delivers better performance, scalability, and resilience by Sijie Guo
Apache Kafka / Kafka Streams / Confluent Platform
- Validating Topic Configurations in Apache Kafka by Florian Trossbach
- Kafka stream processing via Lenses SQL, scale with Kafka – part 3 by Andrew Stevenson
- Handling GDPR: How does a log forget? by
- If You Want To Access Kafka From Hive, Then Read This by Melliyal Annamalai
Apache Pulsar
Spark Streaming
- Monitoring Spark Streaming with InfluxDB and Grafana by Christian Gügi
StreamSets
- Generate your Avro Schema – Automatically! by Pat Patterson
New Presentations
- What is Kafka & why is it Important? by Lucas Jellema
- Improving Kafka at-least-once performance at Uber by Ying Zheng
- Scan and go with the flow: how I met Kafka by Matteo Ferroni
- Stream Processing using Samza SQL by Srinivasulu Punuru
- Unified Stream Processing at Scale with Apache Samza by Jake Maes
- Anomaly Detection and Spark Implementation by Maxim Shkarayev & Anand Venugopal & Punit Shah
- Netflix Keystone SPaaS by
- Serverless Architecture and Best Practices by Itzik Paz
- How Netflix Monitors Applications in Near Real-time with Amazon Kinesis by John Bennett & Roy Ben-Alta
- Dealing with Drift: Building an Enterprise Data Lake by Pat Patterson
- You Can’t Search Without Data by Bryan Bende
- Nifi by Julio Castro
- Design patterns and best practices for data analytics with Amazon EMR by Jonathan Fritz & Anya Bida
- Regain Control Thanks To Prometheus by Guillaume Lefevre & Etienne Coutaud
- Building IoT applications with Apache Spark and Apache Bahir by Luciano Resende
New Videos
- Improving Kafka at-least-once performance at Uber by Ying Zheng
- Event Bus as Backbone for Decoupled Microservice Choreography by Lucas Jellema
- A journey from batch to streaming with Kafka Streams by Sander Uiterkamp & Jeroen Resoort
- Stream Processing using Samza SQL by Srinivasulu Punuru
Upcoming Events
- 12.12.2017 (online) – Living on the Edge Ultra Lightweight Data Movement for IoT (StreamSets Webinar)
- 18.12.2017 (Frankfurt, GE) – Real-Time Data-Ingest with Apache NiFi for IoT and Streaming Analytics (Meetup)
- 19.12.2017 (Melbourne, AU) – First Kafka Meetup (Meetup)
- 17.1.2018 (Princeton, US) – Advanced Apache NiFi Flows and Updates (Meetup)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!