Sage Journals: Discover world-class research

Abstract

Many big data applications require real-time analysis of continuous data streams. Stream Processing Systems (SPSs) are designed to act on real-time streaming data using continuous queries consisting of interconnected operators. The dynamic nature of data streams, for example, fluctuation in data arrival rates and uneven data distribution, can cause an operator to be a bottleneck one. Scalability is an important factor in SPS, but detecting bottleneck operator correctly and scaling it without affecting application execution are challenging. A stateful operator such as aggregation or join makes scaling operation more difficult as it involves state management. Current research does not address the issue of scaling stateful operators efficiently as mostly stop application for handling state, which results in significant overheads to the performance. In this article, the key idea is to detect bottleneck operator correctly using the runtime bottleneck detection approach and then scale out this operator and manage its internal state in a way that we can achieve almost zero latency. During the bottleneck detection process, we have defined alarming_threshold, a parameter for the operators that can be bottleneck operators in the future and scale_out_threshold, when the operator is bottleneck. To scale out, we have presented two techniques, active backup and checkpointing, the former one will start a Secondary Execution (SE) in back end by partitioning state and input streams to multiple nodes at alarming_threshold; this SE will replace primary node at scale_out_threshold. In the latter technique, a State Manager (SM) module will start state checkpointing at alarming_threshold to external store and perform scale out by managing state and input stream at scale_out_threshold. The first approach will help us to achieve almost zero latency goal, while the latter one is a resource efficient technique. Our results show that both techniques are working while providing desired goals of reducing overall latency during scale out and improving resource utilization.

Get full access to this article

View all access options for this article.

References

Apache. Apache Storm. 2017. Available online at http://storm.apache.org/index.html (last accessed March 14, 2018).

Neumeyer

, Robbins

, Nair

, et al. S4: Distributed stream computing platform. In: 2010 IEEE International Conference on Data Mining Workshops, Sydney, Australia, December 14, 2010, pp. 170–177.

Apache

Flink

. 2016. Available online at https://flink.apache.org (last accessed May 21, 2018).

Ding

, Fu

, Ma

, et al. Optimal operator state migration for elastic data stream processing. arXiv preprint arXiv: 1501.03619. January 15, 2015.

Cardellini

, Nardelli

, Luzi

Elastic stateful stream processing in storm. In: IEEE International Conference on High Performance Computing and Simulation (HPCS), Innsbruck, Austria, July 18, 2016, pp. 583–590.

Castro Fernandez

, Migliavacca

, Kalyvianaki

, et al. Integrating scale out and fault tolerance in stream processing using operator state management. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York, June 22–27, 2013, pp. 725–736.

Humayoo

, Zhai

, He

, et al. Operator scale out using time utility function in big data stream processing. In: International Conference on Wireless Algorithms, Systems, and Applications, Harbin, China, June 23–25, 2014, pp. 54–65.

Zhai

, Xu

Efficient bottleneck detection in stream process system using fuzzy logic model. In: 25th IEEE International Conference on Parallel Distributed and Network-Based Processing (PDP), St. Petersburg, Russia, March 6–8, 2017, pp. 438–445.

Hwang

, Balazinska

, Rasin

, et al. High-availability algorithms for distributed stream processing. In: International Conference on Data Engineering (ICDE), Tokyo, Japan, April 5–8, 2005, pp. 779–790.

10.

Apache

ZooKeeper

. 2008. Available online at http://zookeeper.apache.org (last accessed March 10, 2018).

11.

Redis, redis. 2015, Available online at https://redis.io (last accessed April 24, 2018).

12.

EverWatch Corporation. everwatchsolutions.json-data-generator: Json Data Generator. 2016. Available online at https://github.com/acesinc/json-data-generator (last accessed March 24, 2018).

13.

Gulisano

, Jimenez-Peris

, Patino-Martinez

, et al. Streamcloud: An elastic and scalable data streaming system. In: IEEE Transactions on Parallel and Distributed Systems, 2012, pp. 2351–2365.

14.

, Pu

, Chen

, et al. Enabling elastic stream processing in shared clusters. In: IEEE 9th International Conference on Cloud Computing, San Francisco, CA, June 27–July 2, 2016, pp. 108–115.

15.

Gedik

, Schneider

, Hirzel

, et al. Elastic scaling for data stream processing. IEEE Trans Parallel Distrib Syst. 2014; 25:1447–1463.

16.

Heinze

, Pappalardo

, Jerzak

, et al. Auto-scaling techniques for elastic data stream processing. In: 2014 IEEE 30th International Conference on Data Engineering Workshops (ICDEW), Chicago, IL, March 31–April 4, 2014, pp. 296–302.

17.

, Peng

, Gupta

. Stela: Enabling stream processing systems to scale-in and scale-out on-demand. In: 2016 IEEE International Conference on Cloud Engineering (IC2E), Berlin, Germany, April 4–8, 2016, pp. 22–31.

18.

Frank

, Castignani

, Schmitz

, et al. A novel eco-driving application to reduce energy consumption of electric vehicles. In: IEEE International Conference on Connected Vehicles and Expo (ICCVE), Las Vegas, NV, December 2–6, 2013, pp. 283–288.

19.

, Tan

. ChronoStream: Elastic stateful stream computation in the cloud. In: 2015 IEEE 31st International Conference on Data Engineering (ICDE), Seoul, Korea, April 13–16, 2015, pp. 723–734.

20.

Affetti

, Margara

, Cugola

FlowDB: Integrating stream processing and consistent state management. In: Proceedings of the 11th ACM International Conference on Distributed and Event-Based Systems, Barcelona, Spain, June 19–23, 2017, pp. 134–145.

21.

De Matteis

, Mencagli

. Keep calm and react with foresight: Strategies for low-latency and energy-efficient elastic data stream processing. ACM SIGPLAN Notices, 51; 8:2016.

22.

Yang

, Ma

. Smooth task migration in apache storm. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Australia, May 31–June 4, 2015, pp. 2067–2068.

23.

Liu

, Harwood

, Karunasekera

, et al. E-Storm: Replication-based state management in distributed stream processing systems. In: IEEE 46th International Conference on Parallel Processing (ICPP), Bristol, United Kingdom, August 4–17, 2017, pp. 571–580.

Efficient State Management for Scaling Out Stateful Operators in Stream Processing Systems

Abstract

Abstract

Get full access to this article

References