GaussianMixtureSummary#
- class pyspark.ml.clustering.GaussianMixtureSummary(java_obj=None)[source]#
- Gaussian mixture clustering results for a given model. - New in version 2.1.0. - Attributes - DataFrame of predicted cluster centers for each training data point. - Size of (number of data points in) each cluster. - Name for column of features in predictions. - The number of clusters the model was trained with. - Total log-likelihood for this model on the given data. - Number of iterations. - Name for column of predicted clusters in predictions. - DataFrame produced by the model's transform method. - DataFrame of probabilities of each cluster for each training data point. - Name for column of predicted probability of each cluster in predictions. - Attributes Documentation - cluster#
- DataFrame of predicted cluster centers for each training data point. - New in version 2.1.0. 
 - clusterSizes#
- Size of (number of data points in) each cluster. - New in version 2.1.0. 
 - featuresCol#
- Name for column of features in predictions. - New in version 2.1.0. 
 - k#
- The number of clusters the model was trained with. - New in version 2.1.0. 
 - logLikelihood#
- Total log-likelihood for this model on the given data. - New in version 2.2.0. 
 - numIter#
- Number of iterations. - New in version 2.4.0. 
 - predictionCol#
- Name for column of predicted clusters in predictions. - New in version 2.1.0. 
 - predictions#
- DataFrame produced by the model’s transform method. - New in version 2.1.0. 
 - probability#
- DataFrame of probabilities of each cluster for each training data point. - New in version 2.1.0. 
 - probabilityCol#
- Name for column of predicted probability of each cluster in predictions. - New in version 2.1.0.