Authors
Alexandra Olteanu, Florin Pop, Ciprian Dobre, Valentin Cristea
Publication date
2012/5/1
Journal
Computers & Mathematics with Applications
Volume
63
Issue
9
Pages
1409-1423
Publisher
Pergamon
Description
Scheduling is a key component for performance guarantees in the case of distributed applications running in large scale heterogeneous environments. Another function of the scheduler in such system is the implementation of resilience mechanisms to cope with possible faults. In this case resilience is best approached using dedicated rescheduling mechanisms. The performance of rescheduling is very important in the context of large scale distributed systems and dynamic behavior. The paper proposes a generic rescheduling algorithm. The algorithm can use a wide variety of scheduling heuristics that can be selected by users in advance, depending on the system’s structure. The rescheduling component is designed as a middleware service that aims to increase the dependability of large scale distributed systems. The system was evaluated in a real-world implementation for a Grid system. The proposed approach …
Total citations
20132014201520162017201820192020202120222023202446121138556311
Scholar articles