Skip to content

Latest commit

 

History

History
254 lines (185 loc) · 4.6 KB

pyspark.mllib.rst

File metadata and controls

254 lines (185 loc) · 4.6 KB

MLlib (RDD-based)

Classification

.. currentmodule:: pyspark.mllib.classification

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    LogisticRegressionModel
    LogisticRegressionWithSGD
    LogisticRegressionWithLBFGS
    SVMModel
    SVMWithSGD
    NaiveBayesModel
    NaiveBayes
    StreamingLogisticRegressionWithSGD


Clustering

.. currentmodule:: pyspark.mllib.clustering

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/


    BisectingKMeansModel
    BisectingKMeans
    KMeansModel
    KMeans
    GaussianMixtureModel
    GaussianMixture
    PowerIterationClusteringModel
    PowerIterationClustering
    StreamingKMeans
    StreamingKMeansModel
    LDA
    LDAModel


Evaluation

.. currentmodule:: pyspark.mllib.evaluation

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    BinaryClassificationMetrics
    RegressionMetrics
    MulticlassMetrics
    RankingMetrics


Feature

.. currentmodule:: pyspark.mllib.feature

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    Normalizer
    StandardScalerModel
    StandardScaler
    HashingTF
    IDFModel
    IDF
    Word2Vec
    Word2VecModel
    ChiSqSelector
    ChiSqSelectorModel
    ElementwiseProduct


Frequency Pattern Mining

.. currentmodule:: pyspark.mllib.fpm

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    FPGrowth
    FPGrowthModel
    PrefixSpan
    PrefixSpanModel


Vector and Matrix

.. currentmodule:: pyspark.mllib.linalg

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    Vector
    DenseVector
    SparseVector
    Vectors
    Matrix
    DenseMatrix
    SparseMatrix
    Matrices
    QRDecomposition


Distributed Representation

.. currentmodule:: pyspark.mllib.linalg.distributed

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    BlockMatrix
    CoordinateMatrix
    DistributedMatrix
    IndexedRow
    IndexedRowMatrix
    MatrixEntry
    RowMatrix
    SingularValueDecomposition


Random

.. currentmodule:: pyspark.mllib.random

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    RandomRDDs


Recommendation

.. currentmodule:: pyspark.mllib.recommendation

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    MatrixFactorizationModel
    ALS
    Rating


Regression

.. currentmodule:: pyspark.mllib.regression

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    LabeledPoint
    LinearModel
    LinearRegressionModel
    LinearRegressionWithSGD
    RidgeRegressionModel
    RidgeRegressionWithSGD
    LassoModel
    LassoWithSGD
    IsotonicRegressionModel
    IsotonicRegression
    StreamingLinearAlgorithm
    StreamingLinearRegressionWithSGD


Statistics

.. currentmodule:: pyspark.mllib.stat

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    Statistics
    MultivariateStatisticalSummary
    ChiSqTestResult
    MultivariateGaussian
    KernelDensity
    ChiSqTestResult
    KolmogorovSmirnovTestResult


Tree

.. currentmodule:: pyspark.mllib.tree

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    DecisionTreeModel
    DecisionTree
    RandomForestModel
    RandomForest
    GradientBoostedTreesModel
    GradientBoostedTrees


Utilities

.. currentmodule:: pyspark.mllib.util

.. autosummary::
    :template: autosummary/class_with_docs.rst
    :toctree: api/

    JavaLoader
    JavaSaveable
    LinearDataGenerator
    Loader
    MLUtils
    Saveable