#include <Apriori_Trie.hpp>
Collaboration diagram for Apriori_Trie:
Public Member Functions | |
Apriori_Trie (const unsigned long counter_of_emptyset) | |
void | insert_frequent_items (const vector< unsigned long > &counters) |
Insert the frequent items and their counters into the trie;. | |
void | candidate_generation (const itemtype &frequent_size, Input_Output_Manager &input_output_manager) |
Generates candidates. | |
void | find_candidate (const vector< itemtype > &basket, const itemtype candidate_size, const unsigned long counter=1) |
Increases the counter of those candidates that are contained by the given basket. | |
void | delete_infrequent (const unsigned long min_occurrence, const itemtype candidate_size) |
Deletes unfrequent itemsets. | |
bool | is_there_any_candidate () const |
void | statistics () const |
~Apriori_Trie () | |
Protected Member Functions | |
bool | is_all_subset_frequent (const set< itemtype > &maybe_candidate) const |
Decides if all subset of an itemset is contained in the Apriori_Trie. | |
void | candidate_generation_two () |
Generates candidate of size two. | |
void | candidate_generation_assist (Trie *Trie, const itemtype distance_from_generator, set< itemtype > &maybe_candidate, Input_Output_Manager &input_output_manager) |
Generates candidate of size more than two. | |
void | find_candidate_two (const vector< itemtype > &basket, const unsigned long counter=1) |
Increases the counter for those itempairs that are in the given basket. | |
void | delete_infrequent_two (const unsigned long min_occurrence) |
Deletes the Tries that represent infrequent itemsets of size 2. | |
Protected Attributes | |
Trie | main_trie |
Trie to store the candidates and the frequent itemsets. | |
vector< vector< unsigned long > > | temp_counter_array |
temp_counter_array stores the occurences of the itempairs | |
vector< unsigned long > | nr_of_freq_itemsets |
Apriori_Trie is a special tree designed to provide efficient methods for the apriori algorithm. It mostly uses a regular trie except when there exist faster solution. For example for storing one and two itemset candidate where a simple vector and array gives better performance. Apriori_Trie extends the functions provided by the regular trie with a candidate generation process.
Definition at line 33 of file Apriori_Trie.hpp.
|
Definition at line 19 of file Apriori_Trie.cpp. References nr_of_freq_itemsets. |
|
Definition at line 106 of file Apriori_Trie.cpp. |
|
Generates candidates.
Definition at line 42 of file Apriori_Trie.cpp. References candidate_generation_assist(), candidate_generation_two(), itemtype, and main_trie. Referenced by Apriori::APRIORI_alg(). |
|
Generates candidate of size more than two.
Definition at line 153 of file Apriori_Trie.cpp. References Trie::add_empty_state(), Trie::counter, Trie::edgevector, is_all_subset_frequent(), itemtype, and Input_Output_Manager::write_out_basket_and_counter(). Referenced by candidate_generation(). |
|
Generates candidate of size two.
Definition at line 136 of file Apriori_Trie.cpp. References Trie::edgevector, main_trie, and temp_counter_array. Referenced by candidate_generation(). |
|
Deletes unfrequent itemsets.
Definition at line 79 of file Apriori_Trie.cpp. References Trie::delete_infrequent(), delete_infrequent_two(), itemtype, main_trie, and nr_of_freq_itemsets. Referenced by Apriori::APRIORI_alg(). |
|
Deletes the Tries that represent infrequent itemsets of size 2.
temp_counter_array[stateIndex_1] will never be used again! temp_counter_array will never be used again! Definition at line 236 of file Apriori_Trie.cpp. References Trie::edgevector, main_trie, nr_of_freq_itemsets, and temp_counter_array. Referenced by delete_infrequent(). |
|
Increases the counter of those candidates that are contained by the given basket.
Definition at line 62 of file Apriori_Trie.cpp. References Trie::find_candidate(), find_candidate_two(), itemtype, and main_trie. Referenced by Apriori::support(). |
|
Increases the counter for those itempairs that are in the given basket.
Definition at line 216 of file Apriori_Trie.cpp. References temp_counter_array. Referenced by find_candidate(). |
|
Insert the frequent items and their counters into the trie;.
Definition at line 29 of file Apriori_Trie.cpp. References Trie::add_empty_state(), main_trie, and nr_of_freq_itemsets. Referenced by Apriori::APRIORI_alg(). |
|
Decides if all subset of an itemset is contained in the Apriori_Trie.
Definition at line 114 of file Apriori_Trie.cpp. References Trie::is_included(), and main_trie. Referenced by candidate_generation_assist(). |
|
Definition at line 92 of file Apriori_Trie.cpp. References Trie::edgevector, and main_trie. Referenced by Apriori::APRIORI_alg(). |
|
Definition at line 97 of file Apriori_Trie.cpp. References nr_of_freq_itemsets. Referenced by Apriori::APRIORI_alg(). |
|
Trie to store the candidates and the frequent itemsets.
Definition at line 89 of file Apriori_Trie.hpp. Referenced by candidate_generation(), candidate_generation_two(), delete_infrequent(), delete_infrequent_two(), find_candidate(), insert_frequent_items(), is_all_subset_frequent(), and is_there_any_candidate(). |
|
Definition at line 98 of file Apriori_Trie.hpp. Referenced by Apriori_Trie(), delete_infrequent(), delete_infrequent_two(), insert_frequent_items(), and statistics(). |
|
temp_counter_array stores the occurences of the itempairs We can use a simple array to determine the support of itemset of size two. This requires less memory than the trie-based supportcount. temp_counter_array[i][j-i] stores the occurence of the itempair (i,j). Definition at line 97 of file Apriori_Trie.hpp. Referenced by candidate_generation_two(), delete_infrequent_two(), and find_candidate_two(). |