FPGA based distributed self healing architecture for reusable systems

Ali Akoglu, Adarsha Sreeramareddy, Jeff G. Josiah

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Creating an environment of "no doubt" for computing systems is critical for supporting next generation science, engineering, and commercial applications. With reconfigurable devices such as Field Programmable Gate Arrays (FPGAs), designers are provided with a seductive tool to use as a basis for sophisticated but highly reliable platforms. Reconfigurable computing platforms potentially offer the enhancement of reliability and recovery from catastrophic failures through partial and dynamic reconfigurations; and eliminate the need for redundant hardware resources typically used by existing fault-tolerant systems. We propose a two-level self-healing methodology to offer 100% availability for mission critical systems with comparatively less hardware overhead and performance degradation. Our proposed system first undertakes healing at the node-level. Failing to rectify the system at the node-level, network-level healing is then undertaken. We have designed a system based on Xilinx Virtex-5 FPGAs and Cirronet wireless mesh nodes to demonstrate autonomous wireless healing capability among networked node devices. Our prototype is a proof-of-concept work which demonstrates the feasibility of using FPGAs to provide maximum computational availability in a critical self-healing distributed architecture.

Original languageEnglish (US)
Pages (from-to)269-284
Number of pages16
JournalCluster Computing
Volume12
Issue number3
DOIs
StatePublished - 2009

Fingerprint

Field programmable gate arrays (FPGA)
Availability
Hardware
Recovery
Degradation

Keywords

  • Adaptive
  • FPGA
  • Partial reconfiguration
  • Reconfigurable
  • Self-healing

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Cite this

FPGA based distributed self healing architecture for reusable systems. / Akoglu, Ali; Sreeramareddy, Adarsha; Josiah, Jeff G.

In: Cluster Computing, Vol. 12, No. 3, 2009, p. 269-284.

Research output: Contribution to journalArticle

Akoglu, Ali ; Sreeramareddy, Adarsha ; Josiah, Jeff G. / FPGA based distributed self healing architecture for reusable systems. In: Cluster Computing. 2009 ; Vol. 12, No. 3. pp. 269-284.
@article{025e45b44b1d485fbdb23c77bd7f07b6,
title = "FPGA based distributed self healing architecture for reusable systems",
abstract = "Creating an environment of {"}no doubt{"} for computing systems is critical for supporting next generation science, engineering, and commercial applications. With reconfigurable devices such as Field Programmable Gate Arrays (FPGAs), designers are provided with a seductive tool to use as a basis for sophisticated but highly reliable platforms. Reconfigurable computing platforms potentially offer the enhancement of reliability and recovery from catastrophic failures through partial and dynamic reconfigurations; and eliminate the need for redundant hardware resources typically used by existing fault-tolerant systems. We propose a two-level self-healing methodology to offer 100{\%} availability for mission critical systems with comparatively less hardware overhead and performance degradation. Our proposed system first undertakes healing at the node-level. Failing to rectify the system at the node-level, network-level healing is then undertaken. We have designed a system based on Xilinx Virtex-5 FPGAs and Cirronet wireless mesh nodes to demonstrate autonomous wireless healing capability among networked node devices. Our prototype is a proof-of-concept work which demonstrates the feasibility of using FPGAs to provide maximum computational availability in a critical self-healing distributed architecture.",
keywords = "Adaptive, FPGA, Partial reconfiguration, Reconfigurable, Self-healing",
author = "Ali Akoglu and Adarsha Sreeramareddy and Josiah, {Jeff G.}",
year = "2009",
doi = "10.1007/s10586-009-0082-2",
language = "English (US)",
volume = "12",
pages = "269--284",
journal = "Cluster Computing",
issn = "1386-7857",
publisher = "Kluwer Academic Publishers",
number = "3",

}

TY - JOUR

T1 - FPGA based distributed self healing architecture for reusable systems

AU - Akoglu, Ali

AU - Sreeramareddy, Adarsha

AU - Josiah, Jeff G.

PY - 2009

Y1 - 2009

N2 - Creating an environment of "no doubt" for computing systems is critical for supporting next generation science, engineering, and commercial applications. With reconfigurable devices such as Field Programmable Gate Arrays (FPGAs), designers are provided with a seductive tool to use as a basis for sophisticated but highly reliable platforms. Reconfigurable computing platforms potentially offer the enhancement of reliability and recovery from catastrophic failures through partial and dynamic reconfigurations; and eliminate the need for redundant hardware resources typically used by existing fault-tolerant systems. We propose a two-level self-healing methodology to offer 100% availability for mission critical systems with comparatively less hardware overhead and performance degradation. Our proposed system first undertakes healing at the node-level. Failing to rectify the system at the node-level, network-level healing is then undertaken. We have designed a system based on Xilinx Virtex-5 FPGAs and Cirronet wireless mesh nodes to demonstrate autonomous wireless healing capability among networked node devices. Our prototype is a proof-of-concept work which demonstrates the feasibility of using FPGAs to provide maximum computational availability in a critical self-healing distributed architecture.

AB - Creating an environment of "no doubt" for computing systems is critical for supporting next generation science, engineering, and commercial applications. With reconfigurable devices such as Field Programmable Gate Arrays (FPGAs), designers are provided with a seductive tool to use as a basis for sophisticated but highly reliable platforms. Reconfigurable computing platforms potentially offer the enhancement of reliability and recovery from catastrophic failures through partial and dynamic reconfigurations; and eliminate the need for redundant hardware resources typically used by existing fault-tolerant systems. We propose a two-level self-healing methodology to offer 100% availability for mission critical systems with comparatively less hardware overhead and performance degradation. Our proposed system first undertakes healing at the node-level. Failing to rectify the system at the node-level, network-level healing is then undertaken. We have designed a system based on Xilinx Virtex-5 FPGAs and Cirronet wireless mesh nodes to demonstrate autonomous wireless healing capability among networked node devices. Our prototype is a proof-of-concept work which demonstrates the feasibility of using FPGAs to provide maximum computational availability in a critical self-healing distributed architecture.

KW - Adaptive

KW - FPGA

KW - Partial reconfiguration

KW - Reconfigurable

KW - Self-healing

UR - http://www.scopus.com/inward/record.url?scp=70349608063&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70349608063&partnerID=8YFLogxK

U2 - 10.1007/s10586-009-0082-2

DO - 10.1007/s10586-009-0082-2

M3 - Article

AN - SCOPUS:70349608063

VL - 12

SP - 269

EP - 284

JO - Cluster Computing

JF - Cluster Computing

SN - 1386-7857

IS - 3

ER -