Benchmarking DHTs with Queries

David Oppenheimer, Joseph M. Hellerstein, Ryan Huebsch and David A. Patterson

EECS Department
University of California, Berkeley
Technical Report No. UCB/CSD-03-1222
2003

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2003/CSD-03-1222.pdf

The recent proliferation of decentralized distributed hash table (DHT) proposals suggests a need for DHT benchmarks, both to compare existing implementations and to guide future innovation. We argue that a DHT-based query engine provides a unified framework for describing workloads and faultloads, injecting them into a DHT, and recording and analyzing the observed system behavior. To illustrate this argument, we describe the possibilities and challenges of using one such DHT database engine, PIER, to describe and instantiate network dataflow patterns, and to measure and report the resulting system performance. Together, these capabilities form the foundation of a benchmarking tool, while the distributed tracing and analysis facilities alone support debugging.


BibTeX citation:

@techreport{Oppenheimer:CSD-03-1222,
    Author = {Oppenheimer, David and Hellerstein, Joseph M. and Huebsch, Ryan and Patterson, David A.},
    Title = {Benchmarking DHTs with Queries},
    Institution = {EECS Department, University of California, Berkeley},
    Year = {2003},
    URL = {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2003/5439.html},
    Number = {UCB/CSD-03-1222},
    Abstract = {The recent proliferation of decentralized distributed hash table (DHT) proposals suggests a need for DHT benchmarks, both to compare existing implementations and to guide future innovation. We argue that a DHT-based query engine provides a unified framework for describing workloads and faultloads, injecting them into a DHT, and recording and analyzing the observed system behavior. To illustrate this argument, we describe the possibilities and challenges of using one such DHT database engine, PIER, to describe and instantiate network dataflow patterns, and to measure and report the resulting system performance. Together, these capabilities form the foundation of a benchmarking tool, while the distributed tracing and analysis facilities alone support debugging.}
}

EndNote citation:

%0 Report
%A Oppenheimer, David
%A Hellerstein, Joseph M.
%A Huebsch, Ryan
%A Patterson, David A.
%T Benchmarking DHTs with Queries
%I EECS Department, University of California, Berkeley
%D 2003
%@ UCB/CSD-03-1222
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2003/5439.html
%F Oppenheimer:CSD-03-1222