|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
Object org.apache.spark.mllib.fpm.FPGrowth
public class FPGrowth
:: Experimental ::
A parallel FP-growth algorithm to mine frequent itemsets. The algorithm is described in
Li et al., PFP: Parallel FP-Growth for Query
Recommendation
. PFP distributes computation in such a way that each worker executes an
independent group of mining tasks. The FP-Growth algorithm is described in
Han et al., Mining frequent patterns without candidate
generation
.
param: minSupport the minimal support level of the frequent pattern, any pattern appears more than (minSupport * size-of-the-dataset) times will be output param: numPartitions number of partitions used by parallel FP-growth
http://en.wikipedia.org/wiki/Association_rule_learning Association rule learning
(Wikipedia)}
,
Serialized FormNested Class Summary | |
---|---|
static class |
FPGrowth.FreqItemset<Item>
Frequent itemset. |
Constructor Summary | |
---|---|
FPGrowth()
Constructs a default instance with default parameters {minSupport: 0.3 , numPartitions: same
as the input data}. |
Method Summary | ||
---|---|---|
|
run(JavaRDD<Basket> data)
|
|
|
run(RDD<Object> data,
scala.reflect.ClassTag<Item> evidence$2)
Computes an FP-Growth model that contains frequent itemsets. |
|
FPGrowth |
setMinSupport(double minSupport)
Sets the minimal support level (default: 0.3 ). |
|
FPGrowth |
setNumPartitions(int numPartitions)
Sets the number of partitions used by parallel FP-growth (default: same as input data). |
Methods inherited from class Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface org.apache.spark.Logging |
---|
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning |
Constructor Detail |
---|
public FPGrowth()
0.3
, numPartitions: same
as the input data}.
Method Detail |
---|
public FPGrowth setMinSupport(double minSupport)
0.3
).
minSupport
- (undocumented)
public FPGrowth setNumPartitions(int numPartitions)
numPartitions
- (undocumented)
public <Item> FPGrowthModel<Item> run(RDD<Object> data, scala.reflect.ClassTag<Item> evidence$2)
data
- input data set, each element contains a transactionevidence$2
- (undocumented)
FPGrowthModel
public <Item,Basket extends Iterable<Item>> FPGrowthModel<Item> run(JavaRDD<Basket> data)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |