Parquet to Spark Deprecated

Creates a Spark DataFrame/RDD from given parquet file.

Spark to Avro Deprecated

Converts an incoming Spark DataFrame/RDD into an Avro file

ORC to Spark Deprecated

Creates a Spark DataFrame/RDD from given ORC file.

Parquet to Spark Deprecated

Creates a Spark DataFrame/RDD from given parquet file.

Spark to Text Deprecated

Writes a Spark DataFrame/RDD into a text file

PCA Deprecated

Principal component analysis

PCA 

Principal component analysis

Document vector Deprecated

Creates a document vector for each document.

Document Vector 

Creates a document vector for each document.