Article - Main Track

A New Approach of Parallelism and Load Balance for the Apriori Algorithm

Author: BOLINA, A. C., PEREIRA, D. A., ESMIN, A. A. A., PEREIRA, M. R. et al.

Abstract: The main goal of data mining is to discover relevant information on digital content. The Apriori algorithm is widely used to this objective, but its sequential version has a low performance when execu- ted over large volumes of data. Among the solutions for this problem is the parallel implementation of the algorithm, and among the parallel implementations presented in the literature that based on Apriori, it highlights the DPA (Distributed Parallel Apriori) [10]. This paper presents the DMTA (Distributed Multithread Apriori) algorithm, which is based on DPA and exploits the parallelism level of threads in order to increase the performance. Besides, DMTA can be executed over heterogeneous hardware platform, using different number of cores. The results showed that DMTA outperforms DPA, presents load balance among processes and threads, and it is effective in current multicore architectures.

Key Words:Apriori, Parallelization of Algorithm, MPI, OpenMP.

Full text (in Portuguese)

Complete Reference:: BOLINA, A. C., PEREIRA, D. A., ESMIN, A. A. A., PEREIRA, M. R. et al. , "Uma Nova Proposta de Paralelismo e Balanceamento de Carga Para o Algoritmo Apriori", Revista de Sistemas de Informação da FSMA n 11(2013) pp. 33-41