Algorithms for distributed termination detection - The Distributed ...

Report 4 Downloads 238 Views
Distributed Computing (1987) 2:161–175

© Springer-Verlag 1987

Algorithms for distributed termination detection* Friedemann Mattern Department of Computer Science, SFB124, University of Kaiserslautern, P.O. Box 3049, D-6750 Kaiserslautern, Federal Republic of Germany Abstract. The termination problem for distributed computations is analyzed in the general context of asynchronous communication. In the underlying computational model it is assumed that messages take an arbitrary but finite time and do not necessarily obey the FIFO rule. Time diagrams are used as a graphic means of representing the overall communication scheme, giving a clear insight into the difficulties involved (e.g., lack of global state or time, inconsistent time cuts) and suggesting possible solutions. Several efficient algorithms for the solution of the termination problem are presented. They are all based on the idea of message counting but have a number of different characteristics. The methods are discussed and compared with other known solutions. Key words: Distributed termination – Termination detection – Asynchronous communication systems – Distributed programming – Decentralized control – Atomic model of computation – Global snapshots – Communication deadlock detection – Global quiescence – Diffusing computation * This work has been supported by the Deutsche Forschungsgemeinschaft (DFG) as part of the SFB124 research project “VLSI-Design and Parallelism”