Data mining university project
What is S.M.A.R.T.
https://en.wikipedia.org/wiki/S.M.A.R.T.
What Backblaze has done until now and dataset information
https://www.backblaze.com/b2/hard-drive-test-data.html
Analysis example of S.M.A.R.T. data (useful thresholds in comments)
https://www.backblaze.com/blog/hard-drive-smart-stats/
More recent analysis
https://www.backblaze.com/blog/what-smart-stats-indicate-hard-drive-failures/
General statistics
https://www.backblaze.com/blog/hard-drive-reliability-stats-q1-2016/
Dataset
- https://f001.backblazeb2.com/file/Backblaze-Hard-Drive-Data/data_2015.zip
- https://f001.backblaze.com/file/Backblaze-Hard-Drive-Data/data_Q1_2016.zip
- https://f001.backblaze.com/file/Backblaze-Hard-Drive-Data/data_Q2_2016.zip
- https://f001.backblazeb2.com/file/Backblaze-Hard-Drive-Data/data_Q3_2016.zip
- https://f001.backblazeb2.com/file/Backblaze-Hard-Drive-Data/data_Q4_2016.zip
- https://f001.backblaze.com/file/Backblaze-Hard-Drive-Data/data_Q1_2017.zip
Some nice graphs from some poke around with Spark
Capacity and drive count in the 2015,2016 and Q1 2017
Stats about the temperature