Tertiary Disk: Large Scale Distributed Storage

Nisha Talagala, Satoshi Asami, Thomas Anderson and David Patterson

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-98-989
May 1997

http://www2.eecs.berkeley.edu/Pubs/TechRpts/1998/CSD-98-989.pdf

In the past 5 years, disk costs have been falling at a factor of 2 per year. Today, terabyte capacity disk storage systems are feasible. Given the rapidly increasing areal density and disk transfer rates, these systems will have significant cost/performance advantages over tape libraries of similar capacity. If commodity hardware is used, large disk systems can avoid the high cost of custom designed disk arrays, as well as their limitations on scalability. This paper presents Tertiary Disk, a 3TB disk storage system built from commodity hardware. Tertiary Disk uses PCs and switched networks to connect 370 8GB disks. We show that even though commodity hardware is used, the overall system can be more reliable than a single disk. A cost analysis of our prototype shows that the additional infrastructure needed to create a terabyte scale storage system is a fraction of the cost of the underlying disks. In comparison, the costs of large disk arrays are many times the cost of the underlying disks. We also present performance measurements from our prototype, and show that the PC architecture is a good match for hosting a large number of disks. Overall, we show that storage systems designs like Tertiary Disk have cost/performance and reliability advantages over most choices available today for terabyte scale storage.


BibTeX citation:

@techreport{Talagala:CSD-98-989,
    Author = {Talagala, Nisha and Asami, Satoshi and Anderson, Thomas and Patterson, David},
    Title = {Tertiary Disk: Large Scale Distributed Storage},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {1997},
    Month = {May},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/1997/5431.html},
    Number = {UCB/CSD-98-989},
    Abstract = {In the past 5 years, disk costs have been falling at a factor of 2 per year. Today, terabyte capacity disk storage systems are feasible. Given the rapidly increasing areal density and disk transfer rates, these systems will have significant cost/performance advantages over tape libraries of similar capacity. If commodity hardware is used, large disk systems can avoid the high cost of custom designed disk arrays, as well as their limitations on scalability. This paper presents Tertiary Disk, a 3TB disk storage system built from commodity hardware. Tertiary Disk uses PCs and switched networks to connect 370 8GB disks. We show that even though commodity hardware is used, the overall system can be more reliable than a single disk. A cost analysis of our prototype shows that the additional infrastructure needed to create a terabyte scale storage system is a fraction of the cost of the underlying disks. In comparison, the costs of large disk arrays are many times the cost of the underlying disks. We also present performance measurements from our prototype, and show that the PC architecture is a good match for hosting a large number of disks. Overall, we show that storage systems designs like Tertiary Disk have cost/performance and reliability advantages over most choices available today for terabyte scale storage.}
}

EndNote citation:

%0 Report
%A Talagala, Nisha
%A Asami, Satoshi
%A Anderson, Thomas
%A Patterson, David
%T Tertiary Disk: Large Scale Distributed Storage
%I EECS Department, University of California, Berkeley
%D 1997
%@ UCB/CSD-98-989
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/1997/5431.html
%F Talagala:CSD-98-989