This is the 79th edition of my blog series blog series around Stream Processing and Analytics!
As every week I was also updating the following two lists with the presentations/videos of the current week:
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- How to forge a Real-Time Link Between Manufacturing and Actionable Data by
- 14 Tips To Ensure The Deployment Of Your IoT Project Is A Success by Forbes Technology
- No, SQL Isn’t Disappearing by Nick Heudecker
- How We Designed CrateDB as a Realtime SQL DBMS for the Internet of Things by Jodok Batlogg
Apache Kafka / Kafka Streams / Confluent Platform
- Getting Started With Kafka by
- Brewing in Beats: Monitor Kafka logs with Filebeat by Monica Sarbu
- Understanding Kafka Failover by
- Integration Testing for Kafka by Jesse Anderson
- The Simplest Useful Kafka Connect Data Pipeline In The World … or thereabouts—Part 2 by
- Why streaming data is the future of big data, and Apache Kafka is leading the charge by Matt Asay
- How to process streams with Kafka Streams? by Yağız Demirsoy
- Introducing KSQL: Open Source Streaming SQL for Apache Kafka by Neha Narkhede
- A Streaming SQL Engine for Apache Kafka by Tom Smith
- Today is the day! Welcome to Kafka Summit San Francisco 2017! by Tim Berglund
- Kafka Detailed Design and Ecosystem by
- Open Sourcing Kafka Cruise Control by Jiangjie Qin
- Confluent Brings SQL Querying to Kafka Streaming Data by Alex Handy
Apache Pulsar / Apache Heron / Streamlio
- Introduction to Apache Pulsar (incubating) by Matteo Merli & Karthik Ramasamy
- Getting started with the Streamlio Sandbox by Streamlio
- Why Apache Pulsar? Part 1 by Matteo Merli & Karthik Ramasamy
- Why Apache Pulsar? Part 2 by Matteo Merli & Karthik Ramasamy
Apache Flink
Apache Beam
- Powerful and modular IO connectors with Splittable DoFn in Apache Beam by Eugene Kirpichov
Spark Streaming
- Introducing Spark Structured Streaming Support in ES-Hadoop 6.0 by James Baiera
- Anthology of Technical Assets on Apache Spark’s Structured Streaming by
StreamSets
Apache NiFi / Hortonworks HDF
- Using PartitionRecord (GrokReader/JSONWriter) to Parse and Group Log Files by Andrew Lim
- Python Scripts in Apache NiFi by Vasilis Vagias
New Presentations
- Efficient Schemas in Motion with Kafka and Schema Registry by Pat Patterson
- One Data Center is not enough by Gwen Shapira
- State Management in Apache Flink : Consistent Stateful Distributed Stream Processing by Paris Carbone
- Best Practices for Running Kafka on Docker Containers by Nanda Vijaydev
- Multi Cluster, Multi-tenant and Hierarchical Kafka Messaging Service by Allen Wang
- Confluent 3.3 Update and Partner Preview by Robin Moffatt & Gehrig Kunz
- From SMACK to SMAACK by Jörg Schad & Adit Mandan
- Efficient Migration of Very Large Distributed State for Scalable Stream Processing by Bonaventura Del Monte
- Unified Processing with the Samza High-level API by Yi Pan
- Azure Stream Analytics Project: On-demand real-time analytic by Stratos Gounidellis
- Building Event Driven Services with Stateful Streams by Ben Stopford
- On Sampling from Massive Graph Streams by Nesreen Ahmed
- Real-Time Big Data by Handaru Sakti
- Big Data Hadoop Apex App for Device to Mobile, GPS Tracking by Venkatesh Kottapalli & Vikram Patil
New Videos
- Raw Livestream: Kafka Summit San Francisco 2017 Keynote by Confluent
- KSQL from Confluent | Streaming SQL for Apache Kafka by Confluent
- Microservices Explained by Confluent by Confluent
- Visualizing and Analyzing Salesforce Data with StreamSets and Neo4j by Pat Patterson
New Releases
Upcoming Events
- 30.8.2017 (Dublin, IR) – The SMACK Stack (Meetup)
- 31.8.2017 (online) – Simplifying real-time architectures for IoT with Apache Kudu (Cloudera Webinar)
- 31.8.2017 (San Francisco, US) – TensorFlow +Kafka +GPU +Mesos +Kubernetes: Build Google ML Engine from Scratch! (Meetup)
- 5.9.2017 (San Francisco, US) – Kafka Steams: Are your streams keeping up? Monitoring for a streaming world (Meetup)
- 7.9.2017 (Berlin, GE) – Apache Flink Meetup Berlin @SAP (Meetup)
- 13.9.2017 (Tel Aviv, IL) – Apache Kafka from zero to hero – Our first TLV Kafka Meetup! (Meetup)
- 14.9.2017 (online) – Microservices and the Future of Data (Confluent Webinar)
- 16.9.2017 (McLean, US) – Spark Saturday DC (Meetup)
- 18.9.2017 (Zurich, CH) – First Kafka Meetup Zurich (Meetup)
- 19.9.2017 (online) – Real-time marketing analytics with stream processing (O’Reilly Webcast)
- 20.9.2017 (Denver, US) – Production grade container orchestration for fast data applications (Meetup)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!