public class FPGrowth extends Object implements Logging, scala.Serializable
param: minSupport the minimal support level of the frequent pattern, any pattern that appears more than (minSupport * size-of-the-dataset) times will be output param: numPartitions number of partitions used by parallel FP-growth
| Modifier and Type | Class and Description |
|---|---|
static class |
FPGrowth.FreqItemset<Item>
Frequent itemset.
|
| Constructor and Description |
|---|
FPGrowth()
Constructs a default instance with default parameters {minSupport:
0.3, numPartitions: same
as the input data}. |
| Modifier and Type | Method and Description |
|---|---|
<Item,Basket extends Iterable<Item>> |
run(JavaRDD<Basket> data)
Java-friendly version of
run. |
<Item> FPGrowthModel<Item> |
run(RDD<Object> data,
scala.reflect.ClassTag<Item> evidence$4)
Computes an FP-Growth model that contains frequent itemsets.
|
FPGrowth |
setMinSupport(double minSupport)
Sets the minimal support level (default:
0.3). |
FPGrowth |
setNumPartitions(int numPartitions)
Sets the number of partitions used by parallel FP-growth (default: same as input data).
|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitinitializeLogging, initializeLogIfNecessary, initializeLogIfNecessary, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic FPGrowth()
0.3, numPartitions: same
as the input data}.
public FPGrowth setMinSupport(double minSupport)
0.3).
minSupport - (undocumented)public FPGrowth setNumPartitions(int numPartitions)
numPartitions - (undocumented)public <Item> FPGrowthModel<Item> run(RDD<Object> data, scala.reflect.ClassTag<Item> evidence$4)
data - input data set, each element contains a transactionevidence$4 - (undocumented)FPGrowthModel
public <Item,Basket extends Iterable<Item>> FPGrowthModel<Item> run(JavaRDD<Basket> data)
run.data - (undocumented)