Title
Data-Partitioning Using The Hilbert Space Filling Curves: Effect On The Speed Of Convergence Of Fuzzy Artmap For Large Database Problems
Keywords
Data mining; Data-partitioning; Fuzzy-ARTMAP; Hilbert space-filling curve
Abstract
The Fuzzy ARTMAP algorithm has been proven to be one of the premier neural network architectures for classification problems. One of the properties of Fuzzy ARTMAP, which can be both an asset and a liability, is its capacity to produce new nodes (templates) on demand to represent classification categories. This property allows Fuzzy ARTMAP to automatically adapt to the database without having to a priori specify its network size. On the other hand, it has the undesirable side effect that large databases might produce a large network size (node proliferation) that can dramatically slow down the training speed of the algorithm. To address the slow convergence speed of Fuzzy ARTMAP for large database problems, we propose the use of space-filling curves, specifically the Hilbert space-filling curves (HSFC). Hilbert space-filling curves allow us to divide the problem into smaller sub-problems, each focusing on a smaller than the original dataset. For learning each partition of data, a different Fuzzy ARTMAP network is used. Through this divide-and-conquer approach we are avoiding the node proliferation problem, and consequently we speedup Fuzzy ARTMAP's training. Results have been produced for a two-class, 16-dimensional Gaussian data, and on the Forest database, available at the UCI repository. Our results indicate that the Hilbert space-filling curve approach reduces the time that it takes to train Fuzzy ARTMAP without affecting the generalization performance attained by Fuzzy ARTMAP trained on the original large dataset. Given that the resulting smaller datasets that the HSFC approach produces can independently be learned by different Fuzzy ARTMAP networks, we have also implemented and tested a parallel implementation of this approach on a Beowulf cluster of workstations that further speeds up Fuzzy ARTMAP's convergence to a solution for large database problems. © 2005 Elsevier Ltd. All rights reserved.
Publication Date
9-1-2005
Publication Title
Neural Networks
Volume
18
Issue
7
Number of Pages
967-984
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1016/j.neunet.2005.01.007
Copyright Status
Unknown
Socpus ID
24344450095 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/24344450095
STARS Citation
Castro, José; Georgiopoulos, Michael; and Demara, Ronald, "Data-Partitioning Using The Hilbert Space Filling Curves: Effect On The Speed Of Convergence Of Fuzzy Artmap For Large Database Problems" (2005). Scopus Export 2000s. 3772.
https://stars.library.ucf.edu/scopus2000/3772