Parquet to Spark Deprecated

Creates a Spark DataFrame/RDD from given parquet file.

ORC to Spark Deprecated

Creates a Spark DataFrame/RDD from given ORC file.

Spark to Parquet Deprecated

Converts an incoming Spark DataFrame/RDD into a parquet file

Spark to Text Deprecated

Writes a Spark DataFrame/RDD into a text file

PCA 

Principal component analysis

PCA Deprecated

Principal component analysis

Document vector Deprecated

Creates a document vector for each document.

Document Vector 

Creates a document vector for each document.