Questions tagged [association-rules]

106 questions
22
votes
3 answers

What is the practical difference between association rules and decision trees in data mining?

Is there a really simple description of the practical differences between these two techniques? Both seem to be used for supervised learning (though association rules can also handle unsupervised). Both can be used for prediction The closest I've…
Tumbledown
  • 769
  • 1
  • 8
  • 23
11
votes
2 answers

Maximal & closed frequent -- Answer Included

$$My \ \ dataset:$$ $$1: A,B,C,E$$ $$2:A,C,D,E$$ $$3:\ \ \ \ \ B,C,E$$ $$4:A,C,D,E$$ $$5:\ \ \ \ C, D, E$$ $$6: \ \ \ \ A, D,E$$ I want to find out the maximal frequent item sets and the closed frequent item sets. Frequent item set $X ∈ F$ is…
Mike John
  • 624
  • 3
  • 6
  • 19
11
votes
1 answer

Finding suitable rules for new data using arules

I am using R (and the arules package) to mining transactions for association rules. What I wish to do is construct the rules and then apply them to new data. For example, say I have many rules, one of which is the canonical {Beer=YES} ->…
B_Miner
  • 7,560
  • 20
  • 81
  • 144
10
votes
3 answers

What are the statistical methods I can use to find popular or common combinations of categorical variables?

I am doing a study on polydrug use. I have a data sets of 400 drug addicts, who each stated the drugs that they abuse. There are more than 10 drugs and hence there large possible combinations. I have recoded most of the drugs that they consume into…
7
votes
1 answer

Association rules - support, confidence and lift

I am trying to mine association rules from my transaction dataset and I have questions regarding the support, confidence and lift of a rule. Assume we have rule like {X} -> {Y} I know that support is P(XY), confidence is P(XY)/P(X) and lift is…
BigData
  • 97
  • 1
  • 1
  • 7
6
votes
4 answers

What is the difference between Apriori and Eclat algorithms?

What is the difference between Apriori and Eclat algorithms in association rule mining?
Kavipriya
  • 211
  • 1
  • 2
  • 8
5
votes
1 answer

How is the support calculated using hash trees for Apriori algorithm?

I started studying association rules and specially the Apriori algorithm through this free chapter. I understood most of the points in relation with this algorithm except the one on how to build the hash tree in order to optimize support…
ChiPlusPlus
  • 207
  • 2
  • 10
5
votes
3 answers

Best algorithm for association rule mining

I am working on an application where I have to extract or identify association / correlation between different sets of items. An example would be say if a person buys shoes at a store, would he/she buy socks also? So, I will have to find the…
Jason
  • 59
  • 2
5
votes
3 answers

Finding association rules / frequent Itemsets - what are the application restrictions

What are the restrictions of application fields in searching for association rules (finding frequent itemsets)? All examples I came across cover topic of 'true' basket-analysis in the sense of using a list of products which a sample of customers…
Marta Karas
  • 517
  • 5
  • 19
4
votes
1 answer

Estimating the influence of different features on the outcome

I been trying, with no luck, to find the correct algorithm for the following 2 scenarios and I can't seem to get it right. First scenario Every day I get data like the following: +---------+----------+----------+----------+----------+---------+ |…
Yak
  • 143
  • 3
4
votes
1 answer

How do I express this hypothesis using symbols?

I'm just making this up to understand the rules... Claim: People wearing roller blades can get to places in lesser time than those who don't. (Assuming on foot) Now, I'd like to express this using symbols (that's what I call them, unless there's…
itsols
  • 729
  • 1
  • 7
  • 8
4
votes
2 answers

Which statistical models are suitable for prediction with clickstream data?

I'm a Statistics student, and I'm thinking of writing my master's thesis on clickstream data analysis. For my analysis I have a pretty big dataset (80 million rows), each of them being a click "impression". The dataset is from a news website and…
4
votes
1 answer

For association rules, does it make sense to plot a learning curve using the lift measure?

As I understand, the learning curve is normally plotted as accuracy vs. sample size. However, accuracy takes true negatives into account. When doing association rule mining, true negatives aren't very significant because there are usually a lot of…
user98500
4
votes
1 answer

Evaluating Association Rules Using Kulczynski and Imbalance Ratio

I have a dataset containing information about movies and their genres. From the dataset I have generated association rules from the frequent itemsets that I have mined using the Apriori algorithm. From that I have found some interesting association…
Simon Langhoff
  • 143
  • 1
  • 5
3
votes
1 answer

Market Basket Analysis: comparing rules between two models

Given two independent MBA models 1 and 2 (each model is a set of rules with calcualted support, confidence and lift metrics) that were generated on subsets of large population of transactions, how to effectively compare rules between models? In…
topchef
  • 196
  • 1
  • 9
1
2 3 4 5 6 7 8