Abstract: Storage systems exhibit silent data corruptions that go unnoticed until too late, potenially resulting in whole trees of lost data. To deal with this, we've integrated a checksumming mechanism into Linux's Multi-Device Software RAID layer so that we are able to detect and correct these silent data corruptions. The analysis of our naive implementation shows that this can be done with a reasonable performance overhead.
Click here to download a PDF of our paper.
Click here to download a tar of our software.
Click here to download a PDF of our presentation slides.