Fault Tolerance in Distributed Systems

Author: Pankaj Jalote
Publisher: Pearson Education (US)
Edition: US ed
ISBN:

9780133013672

Pages: 448
Publication Date: 01 May 1994
Format: Paperback
Availability: In Print

Limited stock is available. It will be ordered for you and shipped pending supplier's limited stock.

Our Price $197.13 Quantity:

Share |

Customer Reviews (0)

Overview

Fault tolerance is an approach by which reliability of a computer system can be increased beyond what can be achieved by traditional methods. While hardware supported fault tolerance has been well-documented, the newer, software supported fault tolerance techniques have remained scattered throughout the literature. Comprehensive and self-contained, this book organizes that body of knowledge with a focus on fault tolerance in distributed systems. (The uniprocess case is treated as a special case of distributed systems.) KEY TOPICS: Treats fault tolerant distributed systems as consisting of levels of abstraction, providing different tolerant services. For researchers/practitioners working in the area of fault tolerance.

Full Product Details

Author: Pankaj Jalote
Publisher: Pearson Education (US)
Imprint: Prentice Hall
Edition: US ed
Dimensions: Width: 18.40cm , Height: 2.20cm , Length: 24.10cm
Weight: 0.830kg
ISBN:

9780133013672

ISBN 10: 0133013677
Pages: 448
Publication Date: 01 May 1994
Audience: College/higher education , Tertiary & Higher Education
Format: Paperback
Publisher's Status: Out of Print
Availability: In Print

Limited stock is available. It will be ordered for you and shipped pending supplier's limited stock.

1. Introduction. Basic Concepts and Definitions. Phases in Fault Tolerance. Overview of Hardware Fault Tolerance. Reliability and Availability. Summary. 2. Distributed Systems. System Model. Interprocess Communication. Ordering of Events and Logical Clocks. Execution Model and System State. Summary. 3. Basic Building Blocks. Byzantine Agreement. Synchronized Clocks. Stable Storage. Fail Stop Processors. Failure Detection and Fault Diagnosis. Reliable Message Delivery. Summary. 4. Reliable, Atomic, and Causal Broadcast. Reliable Broadcast. Atomic Broadcast. Causal Broadcast. 5. Recovering A Consistent State. Asynchronous Checkpointing and Rollback. Distributed Checkpointing. Summary. 6. Atomic Actions. Atomic Actions and Serializability. Atomic Actions in a Centralized System. Commit Protocols. Atomic Actions on Decentralized Data. Summary. 7. Data Replication And Resiliency. Optimistic Approaches. Primary Site Approach. Resiliency with Active Replicas. Voting. Degree of Replication. Summary. 8. Process Resiliency. Resilient Remote Procedure Call. Resiliency with Asynchronous Communication. Resiliency with Synchronous Message Passing. Total Failure and Last Process to Fail. Summary. 9. Software Design Faults. Approaches for Uniprocess Software. Backward Recovery in Concurrent Systems. Forward Recovery in Concurrent Systems. Summary. Bibliography.

Reviews

Author Information

Tab Content 6

Author Website:

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions

Latest Reading Guide

Shopping Cart

Your cart is empty

Mailing List

Fault Tolerance in Distributed Systems

9780133013672

Availability Information

Overview

Full Product Details

9780133013672

Table of Contents

Reviews

Author Information

Tab Content 6

Customer Reviews

Recent Reviews

Countries Available

Sign up now