Title

Performance Of Load Balancing Techniques For Join Operations In Shared-Noting Database Management Systems

Keywords

Parallel database system; join operation; performance evaluation; load balancing; sampling

Abstract

We investigate various load balancing approaches for hash-based join techniques popular in multicomputer-based shared-nothing database systems. When the tuples are not uniformly distributed among the hash buckets, redistribution of these buckets among the processors is necessary to maintain good system performance. Two recent load balancing techniques which rely on sampling and incremental balancing, respectively, have been shown to be more robust than conventional methods. The comparison of these two approaches, however, has not been investigated. In this study, we improve these two schemes and implement them along with a conventional method and a standard join technique which does not do load balancing on an nCUBE/2 parallel computer to compare their performance. Our experi- mental results indicate that the sampling technique is the better approach. To further evaluate the performance of these techniques under diverse hardware conditions, we also develop a cost model and implement a simulator to perform sensitivity analyses with respect to various hardware parameters. The simulation results show that both sampling and incremental techniques provide noticeable savings over conventional methods, with the sampling approach being more scalable in supporting very large database systems. © 1999 Academic Press.

Publication Date

1-1-1999

Publication Title

Journal of Parallel and Distributed Computing

Volume

56

Issue

1

Number of Pages

17-46

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1006/jpdc.1998.1507

Socpus ID

0347663851 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/0347663851

This document is currently not available here.

Share

COinS