Hierarchical modeling of availability in distributed systems

Salim A Hariri, Hasan Mutlu

Research output: Contribution to journalArticle

27 Citations (Scopus)

Abstract

Distributed computing systems are attractive due to the potential improvement in availability, fault-tolerance, performance, and resource sharing. Modeling and evaluation of such computing systems is an important step in the design process of distributed systems. In this paper, we present a two-level hierarchical model to analyze the availability of distributed systems. At the higher level (user level), the availability of the tasks- (processes) is analyzed using a graph-based approach. At the lower level (component level), detailed Markov models are developed to analyze the component availabilities. These models take into account the hardware/software failures, congestion and collisions in communication links, allocation of resources, and the redundancy level. A systematic approach is developed to apply the two-level hierarchical model to evaluate the availability of the processes and the services provided by a distributed computing environment. This approach is then applied to analyze some of the distributed processes of a real distributed system, Unified Workstation Environment (UWE), that is currently being implemented at AT&T Bell Laboratories.

Original languageEnglish (US)
Pages (from-to)50-56
Number of pages7
JournalIEEE Transactions on Software Engineering
Volume21
Issue number1
DOIs
StatePublished - Jan 1995
Externally publishedYes

Fingerprint

Availability
Distributed computer systems
Fault tolerance
Telecommunication links
Redundancy
Hardware

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Software
  • Electrical and Electronic Engineering

Cite this

Hierarchical modeling of availability in distributed systems. / Hariri, Salim A; Mutlu, Hasan.

In: IEEE Transactions on Software Engineering, Vol. 21, No. 1, 01.1995, p. 50-56.

Research output: Contribution to journalArticle

@article{0731c386d4e242e6adc17853dcea90c2,
title = "Hierarchical modeling of availability in distributed systems",
abstract = "Distributed computing systems are attractive due to the potential improvement in availability, fault-tolerance, performance, and resource sharing. Modeling and evaluation of such computing systems is an important step in the design process of distributed systems. In this paper, we present a two-level hierarchical model to analyze the availability of distributed systems. At the higher level (user level), the availability of the tasks- (processes) is analyzed using a graph-based approach. At the lower level (component level), detailed Markov models are developed to analyze the component availabilities. These models take into account the hardware/software failures, congestion and collisions in communication links, allocation of resources, and the redundancy level. A systematic approach is developed to apply the two-level hierarchical model to evaluate the availability of the processes and the services provided by a distributed computing environment. This approach is then applied to analyze some of the distributed processes of a real distributed system, Unified Workstation Environment (UWE), that is currently being implemented at AT&T Bell Laboratories.",
author = "Hariri, {Salim A} and Hasan Mutlu",
year = "1995",
month = "1",
doi = "10.1109/32.341847",
language = "English (US)",
volume = "21",
pages = "50--56",
journal = "IEEE Transactions on Software Engineering",
issn = "0098-5589",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "1",

}

TY - JOUR

T1 - Hierarchical modeling of availability in distributed systems

AU - Hariri, Salim A

AU - Mutlu, Hasan

PY - 1995/1

Y1 - 1995/1

N2 - Distributed computing systems are attractive due to the potential improvement in availability, fault-tolerance, performance, and resource sharing. Modeling and evaluation of such computing systems is an important step in the design process of distributed systems. In this paper, we present a two-level hierarchical model to analyze the availability of distributed systems. At the higher level (user level), the availability of the tasks- (processes) is analyzed using a graph-based approach. At the lower level (component level), detailed Markov models are developed to analyze the component availabilities. These models take into account the hardware/software failures, congestion and collisions in communication links, allocation of resources, and the redundancy level. A systematic approach is developed to apply the two-level hierarchical model to evaluate the availability of the processes and the services provided by a distributed computing environment. This approach is then applied to analyze some of the distributed processes of a real distributed system, Unified Workstation Environment (UWE), that is currently being implemented at AT&T Bell Laboratories.

AB - Distributed computing systems are attractive due to the potential improvement in availability, fault-tolerance, performance, and resource sharing. Modeling and evaluation of such computing systems is an important step in the design process of distributed systems. In this paper, we present a two-level hierarchical model to analyze the availability of distributed systems. At the higher level (user level), the availability of the tasks- (processes) is analyzed using a graph-based approach. At the lower level (component level), detailed Markov models are developed to analyze the component availabilities. These models take into account the hardware/software failures, congestion and collisions in communication links, allocation of resources, and the redundancy level. A systematic approach is developed to apply the two-level hierarchical model to evaluate the availability of the processes and the services provided by a distributed computing environment. This approach is then applied to analyze some of the distributed processes of a real distributed system, Unified Workstation Environment (UWE), that is currently being implemented at AT&T Bell Laboratories.

UR - http://www.scopus.com/inward/record.url?scp=0029197586&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0029197586&partnerID=8YFLogxK

U2 - 10.1109/32.341847

DO - 10.1109/32.341847

M3 - Article

AN - SCOPUS:0029197586

VL - 21

SP - 50

EP - 56

JO - IEEE Transactions on Software Engineering

JF - IEEE Transactions on Software Engineering

SN - 0098-5589

IS - 1

ER -