Spark.ML Module: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - " [[" to " [[")
m (Text replacement - " [[" to " [[")
 
Line 28: Line 28:
         Pipeline
         Pipeline
         PipelineModel
         PipelineModel
  [[pyspark.ml.param]] module
[[pyspark.ml.param]] module
         Param
         Param
         Params
         Params
         TypeConverters
         TypeConverters
  [[pyspark.ml.feature]] module
[[pyspark.ml.feature]] module
         Binarizer
         Binarizer
         BucketedRandomProjectionLSHE
         BucketedRandomProjectionLSHE
Line 78: Line 78:
         Word2Vec
         Word2Vec
         Word2VecModel
         Word2VecModel
  [[pyspark.ml.classification]] module
[[pyspark.ml.classification]] module
         LinearSVCE
         LinearSVCE
         LinearSVCModelE
         LinearSVCModelE
Line 99: Line 99:
         OneVsRestE
         OneVsRestE
         OneVsRestModelE
         OneVsRestModelE
  [[pyspark.ml.clustering]] module
[[pyspark.ml.clustering]] module
         BisectingKMeans
         BisectingKMeans
         BisectingKMeansModel
         BisectingKMeansModel
Line 112: Line 112:
         LocalLDAModel
         LocalLDAModel
         DistributedLDAModel
         DistributedLDAModel
  [[pyspark.ml.linalg]] module
[[pyspark.ml.linalg]] module
         Vector
         Vector
         DenseVector
         DenseVector
Line 121: Line 121:
         SparseMatrix
         SparseMatrix
         Matrices
         Matrices
  [[pyspark.ml.recommendation]] module
[[pyspark.ml.recommendation]] module
         ALS
         ALS
         ALSModel
         ALSModel
  [[pyspark.ml.regression]] module
[[pyspark.ml.regression]] module
         AFTSurvivalRegressionE
         AFTSurvivalRegressionE
         AFTSurvivalRegressionModelE
         AFTSurvivalRegressionModelE
Line 143: Line 143:
         RandomForestRegressor
         RandomForestRegressor
         RandomForestRegressionModel
         RandomForestRegressionModel
  [[pyspark.ml.stat]] module
[[pyspark.ml.stat]] module
         ChiSquareTestE
         ChiSquareTestE
         CorrelationE
         CorrelationE
  [[pyspark.ml.tuning]] module
[[pyspark.ml.tuning]] module
         ParamGridBuilder
         ParamGridBuilder
         CrossValidator
         CrossValidator
Line 152: Line 152:
         TrainValidationSplitE
         TrainValidationSplitE
         TrainValidationSplitModelE
         TrainValidationSplitModelE
  [[pyspark.ml.evaluation]] module
[[pyspark.ml.evaluation]] module
         Evaluator
         Evaluator
         BinaryClassificationEvaluatorE
         BinaryClassificationEvaluatorE
         RegressionEvaluatorE
         RegressionEvaluatorE
         MulticlassClassificationEvaluatorE
         MulticlassClassificationEvaluatorE
  [[pyspark.ml.fpm]] module
[[pyspark.ml.fpm]] module
         FPGrowthE
         FPGrowthE
         FPGrowthModelE
         FPGrowthModelE
  [[pyspark.ml.util]] module
[[pyspark.ml.util]] module
         Identifiable
         Identifiable
         JavaMLReadable
         JavaMLReadable

Latest revision as of 17:07, 1 June 2024

A Spark.ML Module is an ML training framework that is an Apache Spark module.



References

2017

2017

   ML Pipeline APIs
       Transformer
       Estimator
       Model
       Pipeline
       PipelineModel
pyspark.ml.param module
       Param
       Params
       TypeConverters
pyspark.ml.feature module
       Binarizer
       BucketedRandomProjectionLSHE
       BucketedRandomProjectionLSHModelE
       Bucketizer
       ChiSqSelectorE
       ChiSqSelectorModelE
       CountVectorizer
       CountVectorizerModel
       DCT
       ElementwiseProduct
       HashingTF
       IDF
       IDFModel
       ImputerE
       ImputerModelE
       IndexToString
       MaxAbsScaler
       MaxAbsScalerModel
       MinHashLSHE
       MinHashLSHModelE
       MinMaxScaler
       MinMaxScalerModel
       NGram
       Normalizer
       OneHotEncoder
       PCA
       PCAModel
       PolynomialExpansion
       QuantileDiscretizerE
       RegexTokenizer
       RFormulaE
       RFormulaModelE
       SQLTransformer
       StandardScaler
       StandardScalerModel
       StopWordsRemover
       StringIndexer
       StringIndexerModel
       Tokenizer
       VectorAssembler
       VectorIndexer
       VectorIndexerModel
       VectorSlicer
       Word2Vec
       Word2VecModel
pyspark.ml.classification module
       LinearSVCE
       LinearSVCModelE
       LogisticRegression
       LogisticRegressionModel
       LogisticRegressionSummaryE
       LogisticRegressionTrainingSummaryE
       BinaryLogisticRegressionSummary
       BinaryLogisticRegressionTrainingSummaryE
       DecisionTreeClassifier
       DecisionTreeClassificationModel
       GBTClassifier
       GBTClassificationModel
       RandomForestClassifier
       RandomForestClassificationModel
       NaiveBayes
       NaiveBayesModel
       MultilayerPerceptronClassifier
       MultilayerPerceptronClassificationModel
       OneVsRestE
       OneVsRestModelE
pyspark.ml.clustering module
       BisectingKMeans
       BisectingKMeansModel
       BisectingKMeansSummaryE
       KMeans
       KMeansModel
       GaussianMixture
       GaussianMixtureModel
       GaussianMixtureSummaryE
       LDA
       LDAModel
       LocalLDAModel
       DistributedLDAModel
pyspark.ml.linalg module
       Vector
       DenseVector
       SparseVector
       Vectors
       Matrix
       DenseMatrix
       SparseMatrix
       Matrices
pyspark.ml.recommendation module
       ALS
       ALSModel
pyspark.ml.regression module
       AFTSurvivalRegressionE
       AFTSurvivalRegressionModelE
       DecisionTreeRegressor
       DecisionTreeRegressionModel
       GBTRegressor
       GBTRegressionModel
       GeneralizedLinearRegressionE
       GeneralizedLinearRegressionModelE
       GeneralizedLinearRegressionSummaryE
       GeneralizedLinearRegressionTrainingSummaryE
       IsotonicRegression
       IsotonicRegressionModel
       LinearRegression
       LinearRegressionModel
       LinearRegressionSummaryE
       LinearRegressionTrainingSummaryE
       RandomForestRegressor
       RandomForestRegressionModel
pyspark.ml.stat module
       ChiSquareTestE
       CorrelationE
pyspark.ml.tuning module
       ParamGridBuilder
       CrossValidator
       CrossValidatorModel
       TrainValidationSplitE
       TrainValidationSplitModelE
pyspark.ml.evaluation module
       Evaluator
       BinaryClassificationEvaluatorE
       RegressionEvaluatorE
       MulticlassClassificationEvaluatorE
pyspark.ml.fpm module
       FPGrowthE
       FPGrowthModelE
pyspark.ml.util module
       Identifiable
       JavaMLReadable
       JavaMLReader
       JavaMLWritable
       JavaMLWriter
       JavaPredictionModel
       MLReadable
       MLReader
       MLWritable
       MLWriter