Authors
Shlomi Dolev, Ronen I Kat, Elad M Schiller
Publication date
2010/12/31
Journal
Journal of Computer and System Sciences
Volume
76
Issue
8
Pages
884-900
Publisher
Academic Press
Description
This paper presents a shared-memory self-stabilizing failure detector, asynchronous consensus and replicated state-machine algorithm suite, the components of which can be started in an arbitrary state and converge to act as a virtual state-machine. Self-stabilizing algorithms can cope with transient faults. Transient faults can alter the system state to an arbitrary state and hence, cause a temporary violation of the safety property of the consensus. Started in an arbitrary state, the long lived, memory bounded and self-stabilizing failure detector, asynchronous consensus, and replicated state-machine suite, presented in the paper, recovers to satisfy eventual safety and eventual liveness requirements. Several new techniques and paradigms are introduced. The bounded memory failure detector abstracts away synchronization assumptions using bounded heartbeat counters combined with a balance–unbalance …
Total citations
2011201220132014201520162017201820192020202120222023202432445245444212
Scholar articles
S Dolev, RI Kat, EM Schiller - Journal of Computer and System Sciences, 2010