| Interface | Description |
|---|---|
| LDAOptimizer |
:: DeveloperApi ::
|
| Class | Description |
|---|---|
| BisectingKMeans |
A bisecting k-means algorithm based on the paper "A comparison of document clustering techniques"
by Steinbach, Karypis, and Kumar, with modification to fit Spark.
|
| BisectingKMeansModel |
Clustering model produced by
BisectingKMeans. |
| BisectingKMeansModel.SaveLoadV1_0$ | |
| BisectingKMeansModel.SaveLoadV2_0$ | |
| DistributedLDAModel |
Distributed LDA model.
|
| EMLDAOptimizer |
:: DeveloperApi ::
|
| ExpectationSum | |
| GaussianMixture |
This class performs expectation maximization for multivariate Gaussian
Mixture Models (GMMs).
|
| GaussianMixtureModel |
Multivariate Gaussian Mixture Model (GMM) consisting of k Gaussians, where points
are drawn from each Gaussian i=1..k with probability w(i); mu(i) and sigma(i) are
the respective mean and covariance for each Gaussian distribution i=1..k.
|
| KMeans |
K-means clustering with a k-means++ like initialization mode
(the k-means|| algorithm by Bahmani et al).
|
| KMeansModel |
A clustering model for K-means.
|
| KMeansModel.SaveLoadV1_0$ | |
| KMeansModel.SaveLoadV2_0$ | |
| LDA |
Latent Dirichlet Allocation (LDA), a topic model designed for text documents.
|
| LDAModel |
Latent Dirichlet Allocation (LDA) model.
|
| LDAUtils |
Utility methods for LDA.
|
| LocalKMeans |
An utility object to run K-means locally.
|
| LocalLDAModel |
Local LDA model.
|
| OnlineLDAOptimizer |
:: DeveloperApi ::
|
| PowerIterationClustering |
Power Iteration Clustering (PIC), a scalable graph clustering algorithm developed by
Lin and Cohen.
|
| PowerIterationClustering.Assignment |
Cluster assignment.
|
| PowerIterationClustering.Assignment$ | |
| PowerIterationClusteringModel |
Model produced by
PowerIterationClustering. |
| PowerIterationClusteringModel.SaveLoadV1_0$ | |
| StreamingKMeans |
StreamingKMeans provides methods for configuring a
streaming k-means analysis, training the model on streaming,
and using the model to make predictions on streaming data.
|
| StreamingKMeansModel |
StreamingKMeansModel extends MLlib's KMeansModel for streaming
algorithms, so it can keep track of a continuously updated weight
associated with each cluster, and also update the model by
doing a single iteration of the standard k-means algorithm.
|