Authors
David J DeWitt, Jeffrey F Naughton, Donovan A Schneider, Srinivasan Seshadri
Publication date
1992
Publisher
University of Wisconsin-Madison Department of Computer Sciences
Description
We present an approach to dealing with skew in parallel joins in database systems. Our approach is easily implementable within current parallel DBMS, and performs well on skewed data without degrading the performance of the system on non-skewed data. The main idea is to use multiple algorithms, each specialized for a different degree of skew, and to use a small sample of the relations being joined to determine which algorithm is appropriate. We developed, implemented, and experimented with four new skew-handling parallel join algorithms; one, which we call virtual processor range partitioning, was the clear winner in lhigh skew cases, while traditional hybrid hash join was the clear winner in lower skew or no skew cases. We present experimental results from an implementation of all four algorithms on the Gamma parallel database machine. To our knowledge, these are the first reported skew-handling …
Total citations
199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024519171621179133913991278152013122025271620171181551752
Scholar articles
DJ DeWitt, JF Naughton, DA Schneider, S Seshadri - 1992
DSDJDWJ Naughton, S Seshardi - Proceeding of the International Conference on Very …, 1992