map(func): apply a function on all functionsfilter(func): filter based on functionflatMap(func): flatten returned lists into one giant listunion(rdd): create a union of multiple RDD0subtract(rdd): subtract RDDscartesian(rdd): cartesian product of rddparallelize(list): make an RDD from list
Special transformations for Pair RDDs
reduceByKey(func): key thingsgroupByKey(func): key thingssortByKey(func): key things
See also Database “Join”
