Distributed Program Reliability Analysis

V. K.Prasanna Kumar, Salim Hariri, C. S. Raghavendra

Research output: Contribution to journalArticlepeer-review

83 Scopus citations

Abstract

The reliability of distributed processing systems can be ex- pressed in terms of the reliability of the processing elements that run the programs, the reliability of the processing elements holding the required files, and the reliability of the communication links used in file transfers. We introduce two reliability measures, namely, distributed program reliability and distributed system reliability to accurately model the reliability of distributed systems. The first measure describes the probability of successful execution of a distributed program which runs on some processing elements and needs to communicate with other processing elements for remote files, while the second measure describes the probability that all the programs of a given set can run successfully. The notion of minimal file spanning trees is introduced to efficiently evaluate these reliability measures. Graph theory techniques are used to systematically generate file spanning trees that provide all the required connections. Our technique is general and can be used in a dynamic environment for efficient reliability evaluation.

Original languageEnglish (US)
Pages (from-to)42-50
Number of pages9
JournalIEEE Transactions on Software Engineering
VolumeSE-12
Issue number1
DOIs
StatePublished - Jan 1986
Externally publishedYes

Keywords

  • Distributed program
  • distributed system
  • graph
  • reliability
  • spanning tree
  • theory

ASJC Scopus subject areas

  • Software

Fingerprint Dive into the research topics of 'Distributed Program Reliability Analysis'. Together they form a unique fingerprint.

Cite this