pyspark.SparkContext.setCheckpointDir#
- SparkContext.setCheckpointDir(dirName)[source]#
- Set the directory under which RDDs are going to be checkpointed. The directory must be an HDFS path if running on a cluster. - New in version 0.7.0. - Parameters
- dirNamestr
- path to the directory where checkpoint files will be stored (must be HDFS path if running in cluster)