By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Mondrian Conformal Prediction for Disk Health Scoring and Scrubbing Optimization | HackerNoon
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > Mondrian Conformal Prediction for Disk Health Scoring and Scrubbing Optimization | HackerNoon
Computing

Mondrian Conformal Prediction for Disk Health Scoring and Scrubbing Optimization | HackerNoon

News Room
Last updated: 2025/10/07 at 4:26 AM
News Room Published 7 October 2025
Share
SHARE

Table of Links

Abstract and 1. Introduction

  1. Motivation and design goals

  2. Related Work

  3. Conformal prediction

    4.1. Mondrian conformal prediction (MCP)

    4.2. Evaluation metrics

  4. Mondrian conformal prediction for Disk Scrubbing: our approach

    5.1. System and Storage statistics

    5.2. Which disk to scrub: Drive health predictor

    5.3. When to scrub: Workload predictor

  5. Experimental setting and 6.1. Open-source Baidu dataset

    6.2. Experimental results

  6. Discussion

    7.1. Optimal scheduling aspect

    7.2. Performance metrics and 7.3. Power saving from selective scrubbing

  7. Conclusion and References

2. Motivation and design goals

In data centers, a significant number of unhealthy drives go undetected due to latent failure attributes, resulting in fail-stop scenarios. One common approach to mitigate such scenarios is disk scrubbing, which consists of verifying disk data through a background scanning process to identify bad sectors. However, this process can consume energy and cause performance degradation depending on the trigger schedule. This scenario raises concerns in the industry, especially as disk capacities increase. We notice a missing link in addressing ’which disk to scrub’, ’when to scrub’, based on frequency of scrub cycle while minimizing storage array performance impact and also maximizing the reliability. In this paper, we consider the following objectives and design approaches to tackle this challenge:

• Which disk to scrub? Depending on the specific scrubbing process, it can temporarily degrade the performance of the drive. To ensure that the drive remains fast and responsive, minimizing the frequency of scrubbing is crucial. Instead of performing scrubbing for all disks in the storage array, our approach focuses on selectively scrubbing only the disks that require it, thereby reducing the overall time required to complete the process.

• When to scrub? We can optimize the disk drive scrubbing schedule by considering factors such as the workload of the system, the importance of the data on the drive, and the availability of resources. This approach ensures that scrubbing is performed at the most appropriate times, minimizing the impact on the overall system performance.

3. Related Work

Storage device reliability has long been a critical concern in the industry, and existing solutions often rely on failure analysis of storage systems. However, traditional methods like accelerated life tests (Cho et al., 2015) have not proven to be reliable indicators of actual failure rates in production environments. Recent machine learning-based approaches, such as multivariate time-series (Yu, 2019) and time-series classification (Ircio et al., 2022), have focused on improving model accuracy, but often lack deep integration of domain knowledge. Moreover, the multi-modal approach by (Lu et al., 2020) using performance metrics (disklevel and server-level) and disk spatial location only focuses on fail-stop scenarios, which may not be helpful in detecting latent failures. A most recent study (Lu et al., 2023) has addressed this issue by investigating grey failures (fail-slow drives) using a regression model to pinpoint and analyze fail-slow failures at the granularity of individual drives.

Another important factor of disk scrubbing is the implementation cost and power consumption. (Mi et al., 2008) and (Jiang et al., 2019) address performance degradation due to scrubbing and propose assigning a lower priority to the background process during idle time, i.e. when the disk drive is not actively engaged in processing data or performing any other tasks. (Liu et al., 2010) and (Oprea and Juels, 2010) propose a method to mitigate power consumption and determine when to scrub in systems with inexpensive data but require designing another method to identify less critical data. Drive space management in case of replacing the failed disk is discussed in (Pˆaris et al., 2010), along with reducing the need for frequent scrubbing. A multilevel scrubbing is proposed in (Zhang et al., 2020) using a Long Short-Term Memory (LSTM) model to detect latent sector errors in a binary classification setup. However, using machine learning-based models may treat healthy and relatively less healthy disks the same, leading to unnecessary scrubbing of healthy disks.

To the best of our knowledge, our work is the first to adopt Mondrian conformal prediction for assigning a health score to each individual disk drive and using the metrics to design a scrubbing cycle aligned with the system idle time.

:::info
This paper is available on arxiv under CC BY-NC-ND 4.0 Deed (Attribution-Noncommercial-Noderivs 4.0 International) license.

:::


:::info
Authors:

(1) Rahul Vishwakarma, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]);

(2) Jinha Hwang, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]);

(3) Soundouss Messoudi, HEUDIASYC – UMR CNRS 7253, Universit´e de Technologie de Compiegne, 57 avenue de Landshut, 60203 Compiegne Cedex – France ([email protected]);

(4) Ava Hedayatipour, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]).

:::

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article You can now find your favorite tracks, create playlists on Spotify using ChatGPT
Next Article iOS 26.1 Beta 2 Arrives With Additional Apple Intelligence Languages And More – BGR
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Russian AI trend lets war widows see their fallen husbands one last time
News
New In 2025: Linux Patches Enable PCI Support For The Amiga 4000
Computing
It’s Time To Bag a Power Bank Bargain
Gadget
Thinking Machines Releases Tinker API for Flexible Model Fine-Tuning
News

You Might also Like

Computing

New In 2025: Linux Patches Enable PCI Support For The Amiga 4000

2 Min Read
Computing

US weighs potential regulations on Chinese drones · TechNode

1 Min Read
Computing

I get full-length audiobooks for free legally — here’s the site I use

8 Min Read
Computing

Kenya’s Pawa IT CEO found liable in sexual harassment case

5 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?