Using radial basis function neural network to predict dynamic resource availability in heterogeneous distributed environments

Abstract

Today’s large scale distributed platforms comprise thousands of resources from production, educational, and ad hoc environments including Clouds, Grids, P2P, etc. However, finding suitable resources from such a large pool to store large amounts of data and run multi-resource, long-running data processing applications (usually with few or no fault tolerance capabilities) is restricted by the dynamic availability of distributed resources. In addition to resource failures, the resources may be unavailable due to their owners’ policies for sharing their resources as well as the nature of domain they belong to (e.g. P2P systems, non-dedicated desktop Grids etc.). As a result, the availability-aware selection of distributed resources has become a challenging problem for data management, resource provisioning and job scheduling services. To this end, we present a novel resource availability characterization and prediction method for dynamic heterogeneous distributed environments. We identified 14 availability attributes that can be effectively used to model resource availability in dynamic distributed environments. Three data mining methods (particularly the neural network) are proposed to model and predict resource availability using our identified availability attributes. The availability of a resource is predicted for an instant of time as well as for a time duration. Our experiments for 28 different resources in Austrian Grid show that the predictions through the proposed approach are 18% and 31% (on average) more accurate than those by so far the best method (Naive Bayes’ Classifier) for instant and duration availability, respectively.

Keywords

Distributed systems dynamic resource availability resource availability characterization resource availability predictions

Get full access to this article

View all access options for this article.

References

Acharya

, Edjlali

and Saltz

, The utility of exploiting idle workstations for parallel computation, Sigmetrics Perform Eval Rev 25(1) (1997), 225–234.

Alsoghayer

and Djemame

, Resource failures risk assessment modelling in distributed environments, Journal of Systems and Software 88 (2014), 42–53.

Anderson

D.P.

, Boinc: A system for public-resource computing and storage, In IEEE/ACM International Workshop on Grid Computing, Washington, DC, USA, 2004, pp. 4–10.

Andrzejak

, Kondo

and Anderson

D.P.

, Ensuring collective availability in volatile resource pools via forecasting, In Proceedings of the 19th IFIP/IEEE International Workshop on Distributed Systems: Operations and Management: Managing Large-Scale Service Deployment, DSOM ’08, Berlin, Heidelberg, Springer–Verlag, 2008, pp. 149–161.

Bhagwan

, Savage

and Voelker

G.M.

, Understanding availability, In Peer-to-Peer Systems II, Second International Workshop, Berkeley, CA, USA, 2003, pp. 256–267.

Bouyer

, Mohebi

and Abdullah

A.H.

, Using self-announcer approach for resource availability detection in grid environment, In Proceedings of the Fourth International Multi-Conference on Computing in the Global Information Technology, ICCGI ’09, IEEE Computer Society, 2009, pp. 151–156.

Brevik

, Nurmi

and Wolski

, Automatic methods for predicting machine availability in desktop grid and peer-to-peer systems, In Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid, CCGRID ’04, Washington, DC, USA, IEEE Computer Society, 2004, pp. 190–199.

Chun

, et al., Planetlab: An overlay testbed for broad-coverage services, Sigcomm Comput Commun Rev 33(3) (2003), 3–12.

De Salve

, Guidi

and Mori

, Predicting the availability of users’ devices in decentralized online social networks, Concurrency and Computation: Practice and Experience, pp. e4390–n/a.

10.

Finger

, Bezerra

G.C.

and Conde

D.R.

, Resource use pattern analysis for predicting resource availability in opportunistic grids, Concurrency and Computation: Practice and Experience 22(3) (2015), 295–313.

11.

, Hu

, Tang

and Che

, Grid resource prediction based on support vector regression and genetic algorithms, In Fifth International Conference on Natural Computation, ICNC ’09, volume 1, 2009, pp. 499–505.

12.

Iosup

, Jan

, Sonmez

and Epema

D.H.J.

, On the dynamic resources availability in grids, In Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, GRID ’07, Washington, DC, USA, IEEE Computer Society, 2007, pp. 26–33.

13.

Kondo

, Andrzejak

and Anderson

D.P.

, On correlated availability in internet-distributed systems, In Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing, GRID ’08, Washington, DC, USA, 2008, pp. 276–283.

14.

Kondo

, Taufer

, Brooks

, Casanova

and Chien

, Characterizing and evaluating desktop grids: An empirical study, In Proceedings of Parallel and Distributed Processing Symposium 2004, pp. 26–34.

15.

Lerida

J.L.

, Solsona

, Hernandez

, Gine

, Hanzich

and Conde

, State-based predictions with self-correction on enterprise desktop grid environments, Journal of Parallel and Distributed Computing 73(6) (2013), 777–789.

16.

Long

, Muir

and Golding

, A longitudinal survey of internet host reliability, In Proceedings of the 14TH Symposium on Reliable Distributed Systems, SRDS ’95, Washington, DC, USA, IEEE Computer Society, 1995, pp. 2–10.

17.

Mahato

D.P.

and Singh

R.S.

, Maximizing availability for task scheduling in on-demand computingŰbased transaction processing system using ant colony optimization, Concurrency and Computation: Practice and Experience, pp. e4405–n/a.

18.

Nadeem

, Ranking grid-sites based on their reliability for successfully executing jobs of given durations, International Journal of Computer Network and Information Security 5 (2015), 9–15.

19.

Nadeem

, Alghazzawi

, Mashat

, Fakeeh

, Almalaise

and Hagras

, Modeling and predicting execution time of scientific workflows in the grid using radial basis function neural network, Cluster Computing 20(3) (2017), 2805–2819.

20.

Nadeem

, Prodan

and Fahringer

, Characterizing, modeling and predicting dynamic resource availability in a large scale multipurpose grid, In 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008, pp. 348–357.

21.

Nadeem

, Prodan

, Fahringer

and Iosup

, A framework for resource availability characterization and online prediction in the grids, In Grid Computing, Springer US, 2008, pp. 209–224.

22.

Nadeem

, Prodan

, Fahringer

and Keller

, An evaluation of availability comparison and prediction for optimized resource selection in the grid. In Priol

and Vanneschi

, editors, From Grids to Service and Pervasive Computing, Springer US, 2008, pp. 63–76.

23.

Prakash

and Vidyarthi

D.P.

, Maximizing availability for task scheduling in computational grid using genetic algorithm, Concurrency and Computation: Practice and Experience 27(1) (2015), 193–210.

24.

Quinlan

J.R.

, Induction of decision trees, Machine Learning 1(1) (1986), 81–106.

25.

Rahman

, Hassan

M.R.

and Buyya

, Jaccard index based availability prediction in enterprise grids, In Proceedings of International Conference on Computational Science, number 1 in ICCS’10, 2010, pp. 2707–2716.

26.

Ramachandran

, Lutfiyya

and Perry

, Decentralized resource availability prediction for a desktop grid, In 10th IEEE/ACM International ConferenRence on Cluster, Cloud and Grid Computing (CCGrid), 2010, pp. 643–648.

27.

Ramachandran

, Lutfiyya

and Pery

, Decentralized approach to resource availability prediction using group availability in a p2p desktop grid, Future Generation Computer Systems 28(6) (2012), 854–860.

28.

RapidMiner Inc. RapidMiner. http://rapidminer.com/products/rapidminer-studio/, Last accessed on Sep. 2014.

29.

Ren

, Lee

, Eigenmann

and Bagchi

, Prediction of resource availability in fine-grained cycle sharing systems empirical evaluation, Journal of Grid Computing 5(2) (2007), 173–195.

30.

Rood

and Lewis

M.J.

, Multi-state grid resource availability characterization, In Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, GRID ’07, Austin, TX, 2007, pp. 42–49.

31.

Rubio

J.D.J.

, Ricardo Cruz

, Elias

, Ochoa

, Balcazarand

and Aguilar

, Anfis system for classification of brain signals, Journal of Intelligent & Fuzzy Systems, (Preprint) (2019), 1–9.

32.

Schroeder

and Gibson

G.A.

, A large-scale study of failures in high-performance computing systems, In Proceedings of the International Conference on Dependable Systems and Networks, DSN ’06, Washington, DC, USA, IEEE Computer Society, 2006, pp. 249–258.

33.

Shang

, Wang

, Zhou

, Huang

and Cheng

, TM-DG: A trust model based on computer users’ daily behavior for desktop grid platform, In Proceedings of the 2007 Symposium on Component and Framework Technology in High-performance and Scientific Computing, CompFrame ’07, New York, NY, USA, 2007, pp. 59–66. ACM.

34.

Shearer

, The crisp-dm model: The new blueprint for data mining, Journal of Data Warehousing 5(4) (2000).

35.

Singh

and Kaur

, Resource grouping in grid environment towards the availability and reliability of computing service, Journal of Advanced Computing 1(1) (2013), 1–8.

36.

Srivastava

and Banicescu

, Robust resource allocations through performance modeling with stochastic process algebra, Concurrency and Computation: Practice and Experience 29(7) (2017), e3894–n/a. e3894 cpe.3894.

37.

Tan

P.-N.

, Steinbach

and Kumar

, Introduction to Data Mining, (First Edition), Addison-Wesley Longman Publishing Co, Inc., Boston, MA, USA, 2005.

38.

Tang

and Iyer

R.K.

, Dependability measurement and modeling of a multicomputer system, IEEE Trans Comput 42(1) (1993), 62–75.

39.

Vrignat

, Avila

, Duculty

and Kratz

, Failure event prediction using hidden markov model approaches, IEEE Transactions on Reliability 64(3) (2015), 1038–1048.

40.

Xiaojuan

and Rudolf

, Empirical studies on the behavior of resource availability in fine-grained cycle sharing systems, In 2006 International Conference on Parallel Processing (ICPP’06), 2006, pp. 3–11.

41.

Zhang

and Han

, State estimation for static neural networks with time-varying delays based on an improved reciprocally convex inequality, IEEE Transactions on Neural Networks and Learning Systems 29(4) (2018), 1376–1381.

42.

Zheng

, Grid resource prediction based on support vector regression and simulated annealing algorithms, Modern Applied Science 4(11) (2010), 97–104.