Reliable Distributed Network Management by Replication



A. L. dos Santos
Dept. Computer Science,
Federal University of Minas Gerais,
Belo Horizonte,
Minas Gerais,
Brazil
Email: aldri_AT_dcc.ufmg.br

E. P. Duarte Jr.
Dept. Informatics,
Federal University of Paraná,
Curitiba, Paraná,
Brazil
Email: elias_AT_inf.ufpr.br

G. M. Keeni
Cyber Solutions Inc.,
Aoba-ku Sendai-shi,
Miyagi, Japan
Email: glenn_AT_cysols.com



Abstract
This paper presents a new clustering architecture for SNMP agents that supports semi-active replication of managed objects. A cluster of agents provides fault-tolerant monitoring: replicated managed objects of crashed and working agents of a given cluster may be accessed through a peer cluster. The proposed architecture is structured in three layers. The lower layer corresponds to the managed objects at the network elements. The middle layer contains management entities called clusters that monitor and replicate managed objects. The upper layer allows the definition of management clusters as well as the relationship between clusters. A practical tool was implemented and is presented. The impact of replication on network performance is evaluated as well as a probabilistic analysis of replicated object consistency.

Keywords: Distributed Management, Fault Management, Dependability, Replication, SNMP

JNSM: Vol. 12, No. 2, 2004 Reliable Distributed Network Management by Replication [Vol. 12, No. 2, 2004]



NOTE: only abstract of paper available on-line; please contact your library or the authors for the full paper

Back to JNSM main page