map(func)
: apply a function on all functionsfilter(func)
: filter based on functionflatMap(func)
: flatten returned lists into one giant listunion(rdd)
: create a union of multiple RDD0subtract(rdd)
: subtract RDDscartesian(rdd)
: cartesian product of rddparallelize(list)
: make an RDD from list
Special transformations for Pair RDDs
reduceByKey(func)
: key thingsgroupByKey(func)
: key thingssortByKey(func)
: key things
See also Database “Join”