Packages

  • package root
    Definition Classes
    root
  • package org
    Definition Classes
    root
  • package apache
    Definition Classes
    org
  • package spark
    Definition Classes
    apache
  • package ml

    DataFrame-based machine learning APIs to let users quickly assemble and configure practical machine learning pipelines.

    DataFrame-based machine learning APIs to let users quickly assemble and configure practical machine learning pipelines.

    Definition Classes
    spark
  • package mllib

    RDD-based machine learning APIs (in maintenance mode).

    RDD-based machine learning APIs (in maintenance mode).

    The spark.mllib package is in maintenance mode as of the Spark 2.0.0 release to encourage migration to the DataFrame-based APIs under the org.apache.spark.ml package. While in maintenance mode,

    • no new features in the RDD-based spark.mllib package will be accepted, unless they block implementing new features in the DataFrame-based spark.ml package;
    • bug fixes in the RDD-based APIs will still be accepted.

    The developers will continue adding more features to the DataFrame-based APIs in the 2.x series to reach feature parity with the RDD-based APIs. And once we reach feature parity, this package will be deprecated.

    Definition Classes
    spark
    See also

    SPARK-4591 to track the progress of feature parity

  • package classification
  • package clustering
  • package evaluation
  • package feature
  • package fpm
  • package linalg
  • package optimization
  • package pmml
  • package random
  • package rdd
  • package recommendation
  • package regression
  • package stat
  • package tree

    This package contains the default implementation of the decision tree algorithm, which supports:

    This package contains the default implementation of the decision tree algorithm, which supports:

    • binary classification,
    • regression,
    • information loss calculation with entropy and Gini for classification and variance for regression,
    • both continuous and categorical features.
  • package util
  • JavaPackage

package mllib

RDD-based machine learning APIs (in maintenance mode).

The spark.mllib package is in maintenance mode as of the Spark 2.0.0 release to encourage migration to the DataFrame-based APIs under the org.apache.spark.ml package. While in maintenance mode,

  • no new features in the RDD-based spark.mllib package will be accepted, unless they block implementing new features in the DataFrame-based spark.ml package;
  • bug fixes in the RDD-based APIs will still be accepted.

The developers will continue adding more features to the DataFrame-based APIs in the 2.x series to reach feature parity with the RDD-based APIs. And once we reach feature parity, this package will be deprecated.

See also

SPARK-4591 to track the progress of feature parity

Linear Supertypes
AnyRef, Any

Package Members

  1. package classification
  2. package clustering
  3. package evaluation
  4. package feature
  5. package fpm
  6. package linalg
  7. package optimization
  8. package pmml
  9. package random
  10. package rdd
  11. package recommendation
  12. package regression
  13. package stat
  14. package tree

    This package contains the default implementation of the decision tree algorithm, which supports:

    This package contains the default implementation of the decision tree algorithm, which supports:

    • binary classification,
    • regression,
    • information loss calculation with entropy and Gini for classification and variance for regression,
    • both continuous and categorical features.
  15. package util

Type Members

  1. class JavaPackage extends AnyRef

    A dummy class as a workaround to show the package doc of spark.mllib in generated Java API docs.

    A dummy class as a workaround to show the package doc of spark.mllib in generated Java API docs.

    Annotations
    @AlphaComponent()
    See also

    JDK-4492654

Inherited from AnyRef

Inherited from Any

Ungrouped