checkpoint {SparkR}R Documentation

Checkpoint an RDD

Description

Mark this RDD for checkpointing. It will be saved to a file inside the checkpoint directory set with setCheckpointDir() and all references to its parent RDDs will be removed. This function must be called before any job has been executed on this RDD. It is strongly recommended that this RDD is persisted in memory, otherwise saving it on a file will require recomputation.

Usage

checkpoint(rdd)

## S4 method for signature 'RDD'
checkpoint(rdd)

Arguments

rdd

The RDD to checkpoint

Examples

## Not run: 
##D sc <- sparkR.init()
##D setCheckpointDir(sc, "checkpoints")
##D rdd <- parallelize(sc, 1:10, 2L)
##D checkpoint(rdd)
## End(Not run)

[Package SparkR version 0.1 Index]