An Analytic Performance Model of Disk Arrays and its Application

Edward K. Lee and Randy H. Katz

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-91-660
November 1991

http://www2.eecs.berkeley.edu/Pubs/TechRpts/1991/CSD-91-660.pdf

As disk arrays become widely used, tools for understanding and analyzing their performance become increasingly important. In particular, performance models can be invaluable in both configuring and designing disk arrays. Accurate analytic performance models are desirable over other types of models because they can be quickly evaluated, are applicable under a wide range of system and workload parameters, and can be manipulated by a range of mathematical techniques. Unfortunately, analytic performance models of disk arrays are difficult to formulate due to the presence of queuing and fork-join synchronization; a disk array request is broken up into independent disk requests which must all complete to satisfy the original request. In this paper, we develop, validate and apply an analytic performance model for disk arrays. We derive simple equations for approximating their utilization, response time and throughput. We then validate the analytic model via simulation and investigate the accuracy of each approximation used in deriving the analytic model. Finally, we apply the analytic model to derive an equation for the optimal unit of data striping in disk arrays.


BibTeX citation:

@techreport{Lee:CSD-91-660,
    Author = {Lee, Edward K. and Katz, Randy H.},
    Title = {An Analytic Performance Model of Disk Arrays and its Application},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {1991},
    Month = {Nov},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/1991/6145.html},
    Number = {UCB/CSD-91-660},
    Abstract = {As disk arrays become widely used, tools for understanding and analyzing their performance become increasingly important. In particular, performance models can be invaluable in both configuring and designing disk arrays. Accurate analytic performance models are desirable over other types of models because they can be quickly evaluated, are applicable under a wide range of system and workload parameters, and can be manipulated by a range of mathematical techniques. Unfortunately, analytic performance models of disk arrays are difficult to formulate due to the presence of queuing and fork-join synchronization; a disk array request is broken up into independent disk requests which must all complete to satisfy the original request. In this paper, we develop, validate and apply an analytic performance model for disk arrays. We derive simple equations for approximating their utilization, response time and throughput. We then validate the analytic model via simulation and investigate the accuracy of each approximation used in deriving the analytic model. Finally, we apply the analytic model to derive an equation for the optimal unit of data striping in disk arrays.}
}

EndNote citation:

%0 Report
%A Lee, Edward K.
%A Katz, Randy H.
%T An Analytic Performance Model of Disk Arrays and its Application
%I EECS Department, University of California, Berkeley
%D 1991
%@ UCB/CSD-91-660
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/1991/6145.html
%F Lee:CSD-91-660