Instant recovery with write-ahead logging [electronic resource] : page repair, system restart, media restore, and system failover / Goetz Graefe, Wey Guy, Caetano Sauer.

Graefe, Goetz., author.
Second edition.
San Rafael, California : Morgan & Claypool, 2016.
Synthesis digital library of engineering and computer science.
Synthesis lectures on data management ; 44.
Synthesis lectures on data management ; 44
1 online resource (xvii, 113 pages) : illustrations.
Data recovery (Computer science)
Data logging.
Traditional theory and practice of write-ahead logging and of database recovery focus on three failure classes: transaction failures (typically due to deadlocks) resolved by transaction rollback; system failures (typically power or software faults) resolved by restart with log analysis, "redo," and "undo" phases; and media failures (typically hardware faults) resolved by restore operations that combine multiple types of backups and log replay. The recent addition of single-page failures and single-page recovery has opened new opportunities far beyond the original aim of immediate, lossless repair of single-page wear-out in novel or traditional storage hardware. In the contexts of system and media failures, efficient single-page recovery enables on-demand incremental "redo" and "undo" as part of system restart or media restore operations. This can give the illusion of practically instantaneous restart and restore: instant restart permits processing new queries and updates seconds after system reboot and instant restore permits resuming queries and updates on empty replacement media as if those were already fully recovered. In the context of node and network failures, instant restart and instant restore combine to enable practically instant failover from a failing database node to one holding merely an out-of-date backup and a log archive, yet without loss of data, updates, or transactional integrity. In addition to these instant recovery techniques, the discussion introduces self-repairing indexes and much faster offline restore operations, which impose no slowdown in backup operations and hardly any slowdown in log archiving operations. The new restore techniques also render differential and incremental backups obsolete, complete backup commands on a database server practically instantly, and even permit taking full up-to-date backups without imposing any load on the database server. Compared to the first version of this book, this second edition adds sections on applications of single-page repair, instant restart, single-pass restore, and instant restore. Moreover, it adds sections on instant failover among nodes in a cluster, applications of instant failover, recovery for file systems and data files, and the performance of instant restart and instant restore.
1. Introduction
2. Related prior work
2.1 System model
2.3 Restart after a system failure
2.4 Database backup and log archive
2.5 Restore after a media failure
2.6 Mirroring, log shipping, and failover
2.7 Allocation-only logging
2.8 System transactions
2.9 Summary of prior work
3. Single-page recovery
3.1 Detection of single-page failures
3.2 Recovery for logged updates
3.3 Recovery for non-logged updates
3.4 Chains of log records
3.5 Summary of single-page recovery
4. Applications of single-page recovery
4.1 Self-repairing indexes
4.2 Write elision
4.3 Read elision
4.4 Deferred undo
4.5 Summary of single-page recovery applications
5. Instant restart after a system failure
5.1 Restart techniques
5.2 Restart schedules
5.3 Optimizing log scans
5.4 Summary of instant restart
6. Applications of instant restart
6.1 Parallel "redo" and "undo"
6.2 Distributed transactions
6.3 Fast reboot
6.4 Summary of instant restart applications
7. Single-pass restore
7.1 Partially sorted log archive
7.2 Archiving logic
7.3 Restore logic
7.4 Active transactions
7.5 Summary of single-pass restore
8. Applications of single-pass restore
8.1 Pipeline extensions
8.2 Instant backup
8.3 Virtual backups
8.4 Obsolete incremental backups
8.5 Summary of single-pass restore applications
9. Instant restore after a media failure
9.1 Indexed backup and log archive
9.2 Restore techniques
9.3 Restore schedules
9.4 Summary of instant restore
10. Applications of instant restore
10.1 Pipeline extensions
10.2 Hot restore
10.3 Restore without replacement media
10.4 Online database migration
10.5 Summary of instant restore applications
11. Multiple page, system, and media failures
11.1 Single-page failure during restore
11.2 Single-page failure during restart
11.3 Multiple system failures
11.4 Multiple media failures
11.5 System failure during media restore
11.6 Media failure during system restart
11.7 Summary of recovery after multiple failures
12. Instant failover
12.1 Log shipping and log archiving
12.2 Recovery of server state in a failover
12.3 Recovery of database contents in a failover
12.4 Summary of instant failover
13. Applications of instant failover
13.1 Instant failback
13.2 Failover pools
13.3 Elasticity
13.4 Summary of instant failover applications
14. File systems and data files
14.1 Fault detection
14.2 Fault repair
14.3 Logging small updates
14.4 Logging large updates
14.5 Summary of instant recovery in file systems
15. Performance and scalability
15.1 System failure and restart
15.2 Media failure and restore
15.3 Summary of performance and scalability
16. Conclusions
Author biographies.
Part of: Synthesis digital library of engineering and computer science.
Title from PDF title page (viewed on May 13, 2016).
Includes bibliographical references (pages 109-111).
Guy, Wey., author.
Sauer, Caetano., author.
Other format:
Print version:
Publisher Number:
10.2200/S00710ED2V01Y201603DTM044 doi
Access Restriction:
Restricted for use by site license.
Location Notes Your Loan Policy
Description Status Barcode Your Loan Policy