Bài giảng Hệ phân tán - Chương 8: Fault tolerance
Số trang: 50
Loại file: pdf
Dung lượng: 3.33 MB
Lượt xem: 13
Lượt tải: 0
Xem trước 5 trang đầu tiên của tài liệu này:
Thông tin tài liệu:
Bài giảng Hệ phân tán - Chương 8: Fault tolerance. Chương này cung cấp cho sinh viên những nội dung gồm: introduction to fault tolerance; process resilience; reliable client-server communication; reliable group communication; distributed commit; recovery;... Mời các bạn cùng tham khảo chi tiết nội dung bài giảng!
Nội dung trích xuất từ tài liệu:
Bài giảng Hệ phân tán - Chương 8: Fault tolerance 1 Trần Hải Anh – Distributed System CHƯƠNG 8: FAULT TOLERANCETS. Trần Hải Anh Content2 1. Introduction to fault tolerance 2. Process resilience 3. Reliable client-Server Communication 4. Reliable Group Communication 5. Distributed Commit 6. Recovery Trần Hải Anh – Distributed System3 1. Introduction to fault tolerance 1.1. Basic concept 1.2. Failure models 1.3. Failure masking by redundancy Trần Hải Anh – Distributed System 1.1. Basic concept4 ¨ Being fault tolerant related to Dependable systems which cover: ¤ Availability ¤ Reliability ¤ Safety ¤ Maintainability • Fail/Fault • Fault Tolerance • Transient Faults • Intermittent Faults • Permanent Faults Trần Hải Anh – Distributed System 1.2. Failure models5 ¨ Different types of failures Typeoffailure Descrip0on Crashfailure Aserverhalts,butisworkingcorrectlyun8lithalts Omissionfailure Aserverfailstorespondtoincomingrequests Receiveomission Aserverfallstoreceiveincomingmessages Sendomission Aserverfallstosendmessages Timingfailure Aserversresponseliesoutsidethespecified8meinterval Responsefailure Aserversresponseisincorrect Valuefailure Thevalueoftheresponseiswrong Statetransi8onfailure Theserverdeviatesfromthecorrectflowofcontrol Arbitraryfailure Aservermayproducearbitraryresponsesatarbitrary8mes Fail-stopfailure Aserverstopsproducingoutputanditshal8ngcanbedetectedbyothersystems Fail-silentfailure Anotherprocessmayincorrectlyconcludethataserverhashalted Fail-safe Aserverproducesrandomoutputwhichisrecognizedbyotherprocessesasplainjunk Trần Hải Anh – Distributed System 1.3. Failure masking by redundancy6 ¨ Three possible kinds for masking failure ¤ Information redundancy ¤ Time redundancy ¤ Physical redundancy ¨ Triple Modular Redundancy (TMR) 2. Process resilience7 2.1. Design issues 2.2. Failure masking and replication 2.3. Agreement in faulty system 2.4. Failure detection Trần Hải Anh – Distributed System 2.1. Design issues (1/3)8 ¨ Process group ¤ Key approach: organize several identical processes into a group ¤ Key property: message is sent to the group itself and all members receive it ¤ Dynamic: create, destroy, join or leave Trần Hải Anh – Distributed System 2.1. Design issues (2/3)9 • Flat Groups versus Hierarchical Groups ¤ Comparison Advantages Disadvantages Symmetrical Nosinglepointoffailure FlatGroups Complicateddecisionmaking Groups8llcon8nueswhileoneoftheprocesses crashes Loss of coordinator brings the HierarchicalGroups Easydecisionmaking grouptohalt 2.1. Group membership(3/3)10 • Group Server Approach - Send request - Maintain databases of all groups - Maintain their memberships Disadvantages - A single point of failure • Distributed way Approach - each member communicates directly to all others Disadvantages - Fail-stop semantics are not appropriate - Leaving and joining must be synchronous with data messages being sent • Membership issues What happens when multiple machines crash at the same time? 2.2. Failure masking and Replication11 • Primary-based protocols - Used in form of primary-backup protocol - Organize group of processes in hierarchy - Backups execute election algorithm to choose a new primary • Rep ...
Nội dung trích xuất từ tài liệu:
Bài giảng Hệ phân tán - Chương 8: Fault tolerance 1 Trần Hải Anh – Distributed System CHƯƠNG 8: FAULT TOLERANCETS. Trần Hải Anh Content2 1. Introduction to fault tolerance 2. Process resilience 3. Reliable client-Server Communication 4. Reliable Group Communication 5. Distributed Commit 6. Recovery Trần Hải Anh – Distributed System3 1. Introduction to fault tolerance 1.1. Basic concept 1.2. Failure models 1.3. Failure masking by redundancy Trần Hải Anh – Distributed System 1.1. Basic concept4 ¨ Being fault tolerant related to Dependable systems which cover: ¤ Availability ¤ Reliability ¤ Safety ¤ Maintainability • Fail/Fault • Fault Tolerance • Transient Faults • Intermittent Faults • Permanent Faults Trần Hải Anh – Distributed System 1.2. Failure models5 ¨ Different types of failures Typeoffailure Descrip0on Crashfailure Aserverhalts,butisworkingcorrectlyun8lithalts Omissionfailure Aserverfailstorespondtoincomingrequests Receiveomission Aserverfallstoreceiveincomingmessages Sendomission Aserverfallstosendmessages Timingfailure Aserversresponseliesoutsidethespecified8meinterval Responsefailure Aserversresponseisincorrect Valuefailure Thevalueoftheresponseiswrong Statetransi8onfailure Theserverdeviatesfromthecorrectflowofcontrol Arbitraryfailure Aservermayproducearbitraryresponsesatarbitrary8mes Fail-stopfailure Aserverstopsproducingoutputanditshal8ngcanbedetectedbyothersystems Fail-silentfailure Anotherprocessmayincorrectlyconcludethataserverhashalted Fail-safe Aserverproducesrandomoutputwhichisrecognizedbyotherprocessesasplainjunk Trần Hải Anh – Distributed System 1.3. Failure masking by redundancy6 ¨ Three possible kinds for masking failure ¤ Information redundancy ¤ Time redundancy ¤ Physical redundancy ¨ Triple Modular Redundancy (TMR) 2. Process resilience7 2.1. Design issues 2.2. Failure masking and replication 2.3. Agreement in faulty system 2.4. Failure detection Trần Hải Anh – Distributed System 2.1. Design issues (1/3)8 ¨ Process group ¤ Key approach: organize several identical processes into a group ¤ Key property: message is sent to the group itself and all members receive it ¤ Dynamic: create, destroy, join or leave Trần Hải Anh – Distributed System 2.1. Design issues (2/3)9 • Flat Groups versus Hierarchical Groups ¤ Comparison Advantages Disadvantages Symmetrical Nosinglepointoffailure FlatGroups Complicateddecisionmaking Groups8llcon8nueswhileoneoftheprocesses crashes Loss of coordinator brings the HierarchicalGroups Easydecisionmaking grouptohalt 2.1. Group membership(3/3)10 • Group Server Approach - Send request - Maintain databases of all groups - Maintain their memberships Disadvantages - A single point of failure • Distributed way Approach - each member communicates directly to all others Disadvantages - Fail-stop semantics are not appropriate - Leaving and joining must be synchronous with data messages being sent • Membership issues What happens when multiple machines crash at the same time? 2.2. Failure masking and Replication11 • Primary-based protocols - Used in form of primary-backup protocol - Organize group of processes in hierarchy - Backups execute election algorithm to choose a new primary • Rep ...
Tìm kiếm theo từ khóa liên quan:
Bài giảng Hệ phân tán Hệ phân tán Fault tolerance Process resilience Reliable client-server communication Reliable group communicationGợi ý tài liệu liên quan:
-
Khảo sát, đánh giá một số thuật toán xử lý tương tranh cập nhật dữ liệu trong các hệ phân tán
7 trang 184 0 0 -
Chương 6 Thuật toán loại trừ tương hỗ và bầu cử
45 trang 89 0 0 -
Báo cáo Hệ tin học phân tán: Vấn đề bế tắc trong hệ tập trung và hệ phân tán
48 trang 35 0 0 -
Chương 1 Giới thiệu tổng quan về hệ phân tán
66 trang 27 0 0 -
NHẬP MÔN CƠ SỞ DỮ LIỆU PHÂN TÁN - TS NGUYỄN BÁ TƯỜNG
591 trang 26 0 0 -
26 trang 24 0 0
-
73 trang 22 0 0
-
Bài giảng Hệ phân tán - Chương 3: Quản lý tiến trình và luồng
70 trang 21 0 0 -
Cung cấp tài nguyên truyền thông cho hệ phân tán trong máy ảo
4 trang 21 0 0 -
Bài giảng Hệ phân tán - Chương 1: Tổng quan về hệ phân tán
32 trang 21 0 0