Structured Streaming is a scalable and fault-tolerant stream processing engine built on the Spark SQL engine. inshort Structure streaming rebuild streaming functionalities on top of Catalyst and Data Frames.
Structured Streaming is the first API to build stream processing on top of SQL engine,in...
Friday, 18 August 2017
Saturday, 12 August 2017
Apache Spark Broadcast vs Accumulators
Standard
Broadcast variable :
Broadcast variable is a read-only variable that is made available from the driver program that runs the SparkContext object to the nodes that will execute the computation. This is very useful in applications that need to make the same data available to the worker nodes in an...
Sunday, 8 January 2017
ApacheGeode/GemFire Real-time and Historical Metrics with Grafana-Visualize and Analyse
Standard
Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing.
Geode is a data management platform that provides real-time, consistent access to data-intensive...
Subscribe to:
Posts (Atom)