Exploiting Highly Qualified Pattern with Frequency and Weight Occupancy
Document Type
Article
Publication Date
1-1-2017
Publication Title
Knowledge and Information Systems
First page number:
1
Last page number:
32
Abstract
By identifying useful knowledge embedded in the behavior of search engines, users can provide valuable information for web searching and data mining. Numerous algorithms have been proposed to find the desired interesting patterns, i.e., frequent pattern, in real-world applications. Most of those studies use frequency to measure the interestingness of patterns. However, each object may have different importance in these real-world applications, and the frequent ones do not usually contain a large portion of the desired patterns. In this paper, we present a novel method, called exploiting highly qualified patterns with frequency and weight occupancy (QFWO), to suggest the possible highly qualified patterns that utilize the idea of co-occurrence and weight occupancy. By considering item weight, weight occupancy and the frequency of patterns, in this paper, we designed a new highly qualified patterns. A novel Set-enumeration tree called the frequency-weight (FW)-tree and two compact data structures named weight-list and FW-table are designed to hold the global downward closure property and partial downward closure property of quality and weight occupancy to further prune the search space. The proposed method can exploit high qualified patterns in a recursive manner without candidate generation. Extensive experiments were conducted both on real-world and synthetic datasets to evaluate the effectiveness and efficiency of the proposed algorithm. Results demonstrate that the obtained patterns are reasonable and acceptable. Moreover, the designed QFWO with several pruning strategies is quite efficient in terms of runtime and search space. © 2017 Springer-Verlag London Ltd.
Keywords
Data mining; Frequency; Weight occupancy; Highly qualified; Weight-list
Language
english
Repository Citation
Gan, W.,
Lin, J. C.,
Fournier Viger, P.,
Chao, H.,
Zhan, J.,
Zhang, J.
(2017).
Exploiting Highly Qualified Pattern with Frequency and Weight Occupancy.
Knowledge and Information Systems
1-32.
http://dx.doi.org/10.1007/s10115-017-1103-8