public class LogisticRegressionWithSGD extends GeneralizedLinearAlgorithm<LogisticRegressionModel> implements scala.Serializable
LogisticRegressionWithSGD.optimizer
.
NOTE: Labels used in Logistic Regression should be {0, 1, ..., k - 1}
for k classes multi-label classification problem.
Using LogisticRegressionWithLBFGS
is recommended over this.Constructor and Description |
---|
LogisticRegressionWithSGD()
Deprecated.
Use ml.classification.LogisticRegression or LogisticRegressionWithLBFGS. Since 2.0.0.
|
Modifier and Type | Method and Description |
---|---|
static int |
getNumFeatures() |
static boolean |
isAddIntercept() |
GradientDescent |
optimizer()
The optimizer to solve the problem.
|
static M |
run(RDD<LabeledPoint> input) |
static M |
run(RDD<LabeledPoint> input,
Vector initialWeights) |
static GeneralizedLinearAlgorithm<M> |
setIntercept(boolean addIntercept) |
static GeneralizedLinearAlgorithm<M> |
setValidateData(boolean validateData) |
static LogisticRegressionModel |
train(RDD<LabeledPoint> input,
int numIterations)
Train a logistic regression model given an RDD of (label, features) pairs.
|
static LogisticRegressionModel |
train(RDD<LabeledPoint> input,
int numIterations,
double stepSize)
Train a logistic regression model given an RDD of (label, features) pairs.
|
static LogisticRegressionModel |
train(RDD<LabeledPoint> input,
int numIterations,
double stepSize,
double miniBatchFraction)
Train a logistic regression model given an RDD of (label, features) pairs.
|
static LogisticRegressionModel |
train(RDD<LabeledPoint> input,
int numIterations,
double stepSize,
double miniBatchFraction,
Vector initialWeights)
Train a logistic regression model given an RDD of (label, features) pairs.
|
getNumFeatures, isAddIntercept, run, run, setIntercept, setValidateData
public LogisticRegressionWithSGD()
public static LogisticRegressionModel train(RDD<LabeledPoint> input, int numIterations, double stepSize, double miniBatchFraction, Vector initialWeights)
miniBatchFraction
fraction of the data to calculate the gradient. The weights used in
gradient descent are initialized using the initial weights provided.
NOTE: Labels used in Logistic Regression should be {0, 1}
input
- RDD of (label, array of features) pairs.numIterations
- Number of iterations of gradient descent to run.stepSize
- Step size to be used for each iteration of gradient descent.miniBatchFraction
- Fraction of data to be used per iteration.initialWeights
- Initial set of weights to be used. Array should be equal in size to
the number of features in the data.public static LogisticRegressionModel train(RDD<LabeledPoint> input, int numIterations, double stepSize, double miniBatchFraction)
miniBatchFraction
fraction of the data to calculate the gradient.
NOTE: Labels used in Logistic Regression should be {0, 1}
input
- RDD of (label, array of features) pairs.numIterations
- Number of iterations of gradient descent to run.stepSize
- Step size to be used for each iteration of gradient descent.
miniBatchFraction
- Fraction of data to be used per iteration.public static LogisticRegressionModel train(RDD<LabeledPoint> input, int numIterations, double stepSize)
input
- RDD of (label, array of features) pairs.stepSize
- Step size to be used for each iteration of Gradient Descent.
numIterations
- Number of iterations of gradient descent to run.public static LogisticRegressionModel train(RDD<LabeledPoint> input, int numIterations)
input
- RDD of (label, array of features) pairs.numIterations
- Number of iterations of gradient descent to run.public static int getNumFeatures()
public static boolean isAddIntercept()
public static GeneralizedLinearAlgorithm<M> setIntercept(boolean addIntercept)
public static GeneralizedLinearAlgorithm<M> setValidateData(boolean validateData)
public static M run(RDD<LabeledPoint> input)
public static M run(RDD<LabeledPoint> input, Vector initialWeights)
public GradientDescent optimizer()
GeneralizedLinearAlgorithm
optimizer
in class GeneralizedLinearAlgorithm<LogisticRegressionModel>