public final class RDDDataSource extends DataSource implements Serializable
org.apache.mrql.DataSource.DataSourceDirectory, org.apache.mrql.DataSource.ParserDirectory
Modifier and Type | Field and Description |
---|---|
org.apache.spark.api.java.JavaRDD<MRData> |
rdd |
dataSourceDirectory, inputFormat, parserDirectory, path, separator, source_num, to_be_merged, tuple_container
Constructor and Description |
---|
RDDDataSource(org.apache.spark.api.java.JavaRDD<MRData> rdd) |
Modifier and Type | Method and Description |
---|---|
MRData |
reduce(MRData zero,
Function acc)
accumulate all dataset values
|
long |
size(org.apache.hadoop.conf.Configuration conf) |
List<MRData> |
take(int num)
return the first num values
|
, get, getCached, loadParsers, read
public org.apache.spark.api.java.JavaRDD<MRData> rdd
RDDDataSource(org.apache.spark.api.java.JavaRDD<MRData> rdd)
public long size(org.apache.hadoop.conf.Configuration conf)
size
in class DataSource
public List<MRData> take(int num)
take
in class DataSource
public MRData reduce(MRData zero, Function acc)
reduce
in class DataSource
Copyright © 2013–2014 The Apache Software Foundation. All rights reserved.