This is the 20th installment of my blog series around Stream Processing and Analytics.
As usual, find below the new blog articles, presentations, videos and software releases from last week:
News and Blog Posts
General
- Big Data or Bad Data? Survey Shows Enterprises Struggle to Manage Big Data Flows by Brittney Danon
- Avoid These Five Big Data Governance Mistakes by Alex Woodie
- RESTful, Real-Time, Streaming Analytics and IoT by Jordan Thomas-Green
- Making Sense of Stream Processing by Martin Kleppmann
- Survey Shows Enterprises Struggling with Bad Data by
- 4 Predictions For How Data Transforms Everything by Scott Gnau
- OpsClarity survey reveals 92% of companies are increasing investment in real-time analysis of human and machine-generated data by SD Times
- RESTful, Real-Time, Streaming Analytics and IoT by Jordan Thomas-Green
Apache Flink
Apache Spark Streaming
- Manjeet Chayel Analyze Realtime Data from Amazon Kinesis Streams Using Zeppelin and Spark Streaming by Manjeet Chayel
- How-to: Detect and Report Web-Traffic Anomalies in Near Real-Time by
- Monitoring Apache Spark Streaming: Understanding Key Metrics by
Apache Kafka
- Distributed, Real-time Joins and Aggregations on User Activity Events using Kafka Streams by Michael Noll
- Data Ingestion with Apache Flume sending events to Apache Kafka by Rafael Salerno
- Oracle GoldenGate Big Data Adapter: Apache Kafka Producer by Loren Penton
- Just Enough Kafka For The Elastic Stack, Part 2 by Suyog Rao
- Why Streaming? with Apache Kafka by Lewis Gavin
- Stream Processing Hard Problems – Part 1: Killing Lambda by Kartik Paramasivam
- Build and monitor Kafka pipelines with Confluent Control Center by Joseph Adler
- How to read to the end of a Kafka topic by James Cheng
Apache Beam / Google Dataflow
- A Quick Demo of Apache Beam with Docker by Emanuele Cesena
Apache NiFi
- Prescient Transforms 48,000+ Data Sources in Real Time with Apache NiFi by Anna Yong
- Using Solr’s Extracting Request Handler with Apache NiFi by Bryan Bende
StreamSets
- Ingesting Sensor Data on the Raspberry Pi with StreamSets Data Collector by Pat Patterson
- How To Install StreamSets On The MapR Sandbox by C. Warman
New Presentations
- Fast Data Overview by Chuck Scyphers
- Getting Started with Amazon Kinesis by Amazon
- Dataflow with Apache NiFi by Aldrin Piri
New Videos
- Building a Real-time Streaming Platform by Neha Narkhede
- Streaming ETL for All by Joey Echeverria
- Monitoring Big Data Streaming Applications & Apache Apex (Hadoop) operations byDavid Yan
New Releases
Upcoming Events
- 6/27/2016 (San Jose, US) – Building Big Data applications with Apache Beam and Apache Apex (Meetup)
- 6/27/2016 (San Jose, US) – NiFi Meetup at Hadoop Summit (Meetup)
- 6/28/2016 (London, UK) – Crowdmix – An Event Based Social Music Platform & Kafka 0.10 New Features (Meetup)
- 7/5/2016 (San Francisco, US) – Building (and running) Netflix’s Data Pipeline using Apache Kafka (Meetup)
- 7/6/2016 (Atlanta, US) – StreamSets, For The Coding Minimalist In All of Us (Meetup)
- 7/14/2016 (Princeton, US) – Apache NiFi (Meetup)
- 7/14/2016 (Austin, US) – Kafka-Streams talk (Meetup)
- 7/14/2016 (San Francisco, US) – Expert Panel on Streaming Analytics Technologies (Meetup)
- 7/19/2016 (Munich, GE) – Apache Apex: Stream Processing Architecture and Applications (Meetup)
- 7/19/2016 (Taipei, TW) – Stream Processing with Apache Flink (Meetup)
- 7/20/2016 (online) – Streaming data ingest and processing with Kafka (Confluent Webinar)
- 7/21/2016 (San Francisco, US) – Apache Spark Streaming with Apache NiFi and Apache Kafka (Meetup)
- 7/28/2016 (Toronto, CA) – Apache NiFi presentation by Joe Witt (Meetup)
- 8/2(2016 (online) – Streaming Analytics for Real-Time Action – Best Practices for Getting Started (TDWI Webinar)
- 8/18/2016 (New York, US) – Apache NiFi – MiNiFi: Taking Dataflow Management to the Edge (Meetup)
Please let me know if that is of interest. Please tweet your projects, blog posts, and meetups to @gschmutz to get them listed in next week’s edition!