Practical Schemes using Logs for Lightweight Recoverable DSM

Y. Kim, S. Park, S.R. Maeng (Korea)


Distributed System, Software Distributed Shared Memory, Fault-tolerance System, Message Logging


In the existing Fault-Tolerant Software Distributed Shared Memory (FT-SDSM) with the message logging, the logs are used only to recover the failed nodes. In our previous work, we have implemented a lightweight logging protocol, called remote logging, on the SDSM for fault tolerance, which incurs low logging overhead with a fast network and a remote memory for back-up data. In this paper, we propose two practical schemes for the logs, which enhance our based remote logging protocol. In these proposed schemes, the logs are applicable to reduce the stalled times for updating the invalid pages, minimizing the failure-free execution time.

Important Links:

Go Back