Open Access Open Access  Restricted Access Subscription or Fee Access

The Design of a Group Communication System for Distributed and Complex Systems

F. Lassoued, R. Bouallegue

Abstract


Distributed and Complex Systems are more and more vulnerable to failures on account of their growing complexity and distribution. This problem motivates the need for a fault tolerance mechanism in such systems. Hence, we focus on two reviews: (1) the distributed algorithmic review, and (2) the development of the component systems that make the task of failure detection and repair much easier. Our paper highlights the first one approach. Indeed, it focuses on the problem of reliable duplication mechanism on grid environments. As a first task, we define a new group communication system extending the FSR protocol which is developed by the INRIA/EPFL. Because of his interesting results in a cluster of homogenous machines, this work consists on adapting this system to the grid environments. Whereas, the following part of our paper, shows architecture of a fault-tolerant distributed system. So in order to reach our objective and achieve our research, this is our planning: A group communication system algorithmic review. What is a group communication system? The group communication system FSR([ 2]) Introducing our group communication system for grid environment denoted FSRG. The simulation on JSim to evaluate the performance of FSRG.

Keywords


Fuzzy Cognitive Maps; Fuzzy Inference; Complex System; Analysis of Impacts; Ci (Configuration Item).

Full Text:

PDF

References


R. Ekwall and A.Shipper. Modeling and Validating the Performance of Atomic Broadcast Algorithms in High Latency Networks.Europar 2007

R. Guerraoui, R. R. Levy, B. Pochon, and V. Quéma. High Throughput Total Order for Cluster Environments. In IEEE International Conference on Dependable Systems and Networks (DSN 2006), June 2006.

R. Ekwall, A. Schiper, and P. Urbàn. Token-based atomic broadcast using unreliable failure detectors. In Proc. of the 23rd Symposium on Reliable Distributed Systems (SRDS 2004), Florianopolis, Brazil, Oct. 2004.

A. Mostefaoui and M. Raynal. Solving Consensus using Chandra-Toueg’s Unreliable Failure Detectors: A Synthetic Approach. In 13th. Intl. Symposium on Distributed Computing (DISC’99). Springer Verlag, LNCS 1693, September 1999.

T. Anker, D. Dolev, G. Greenman, and I. Shnayderman. Evaluating total order algorithms in WAN. In Proc. International Workshop on Large-Scale Group Communication,Florence, Italy, October 2003.

R. Ekwall, A. Schiper, and P. Urbàn. Token-based atomic broadcast using unreliable failure detectors. In Proc. of the 23rd Symposium on Reliable Distributed Systems (SRDS 2004), Florianopolis, Brazil, Oct. 2004.

T. D. Chandra and S. Toueg. Unreliable failure detectors for reliable distributed systems. Journal of ACM, 43(2):225–267, 1996.

X. Défago, A. Schiper,and P.Urbàn. Comparative performance analysis of ordering strategies in atomic broadcast algorithms. IEICE Trans. on Information and Systems, E86- D (12):2698.2709, 2003.

V. Hadzilacos and S. Toueg. A modular approach to fault-tolerant broadcasts and related problems.TR 94- 1425, Dept. of Computer Science, Cornell University, Ithaca, NY, USA, May 1994.

G. Chockler, I. Keidar, and R. Vitenberg. Group Communication Specifications: A Comprehensive Study. ACM Computing Surveys, 4(33):1–43, December 2001.

P. Urban, I. Shnayderman, and A. Schiper. Comparison of failure detectors and group membership: Performance study of two atomic broadcast algorithms. In Proc. of the Int’l Conf. on Dependable Systems and Networks (DSN), pages 645–654, June 2003.

http://www.j-sim.org.

F. Cappello Caron, M. Dayde, F. Desprez, E. Jeannot, Y. Jegou, S. Lanteri, J. Leduc, N. Melab, G. Mornet, R. Namyst, P. Primet, and O. Richard. Grid’5000: a large scale, reconfigurable, controlable and monitorable Grid platform. In Grid’2005 Workshop, Seattle, USA, November 13-14 2005.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.