Reconfiguration in robust distributed real-time systems based on global checkpoints

Download
Author
Puett, Ronnie Douglas
Date
1991-12Advisor
Shukla, Shridhar B.
Yang, Chyan
Metadata
Show full item recordAbstract
Fast, ultra-reliable, real-time computing is fundamental in today's weapons system.
Increased system throughput and reliability can be achieved by utilizing distributed
systems in which a single application program executes on multiple processors,
connected to a network. The distributed nature of such systems make it possible
to tolerate failures and react to overloads without the application level performance
degrading unacceptably. Fault tolerance in these systems typically involves fault
detection and recovery. Repair following failure involves smooth integration of the
repaired processor and subsequent reconfiguration. These actions must take place
transparently, that is without the application program noticing it. Therefore, sufficient
information must be maintained through the use of checkpointing to describe
the state of the system at any time and ensure correct operation after failure/repair. This thesis investigates a possible framework for achieving a fault- tolerant realtime
distributed system which provides transparent function-to-function message
passing, status monitoring using periodic health messages and maintains a globally
consistent system state by carrying out independent checkpointing procedures.
The proposed scheme is simulated using concurrent Ada processing for a four node,
twelve function, distributed system.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Maintaining high availability in distributed mobile systems
Boitnott, Brad P. (Monterey, California. Naval Postgraduate School, 2009-09);Distributed Mobile Systems often require the ability to continue working, even when a major system component fails or there is a fault in the system. In some situations when a distributed mobile system is used, the ... -
Distributed deployment of Therminators in the network
Cheng, Kah Wai (Monterey, California. Naval Postgraduate School, 2004-12);The idea of deploying a distributed network intrusion system using Therminator is explored in this thesis. There are many advantages in having a distributed system compared to a standalone network intrusion system. The ... -
Scheduling and prototyping of distributed real-time systems (an approach using JINI/JAVASPACES)
Demirtas, Tolga. (Monterey, California. Naval Postgraduate School, 2002-03);Scheduling is one of the basic issues in building real-time applications on a distributed computing system. A distributed computing system is typically modeled as a collection of processes interconnected by a communication ...