org.apache.pig
Interface OrderedLoadFunc

All Known Implementing Classes:
BinStorage, FileInputLoadFunc, HiveColumnarLoader, PigStorage, PigStorageSchema, SequenceFileLoader, TableLoader

public interface OrderedLoadFunc

Implementing this interface indicates to Pig that a given loader can be used for MergeJoin. The position as represented by the WritableComparable object is stored in the index created by MergeJoin sampling MR job to get an ordered sequence of splits. This is necessary when the sort key spans multiple splits.

Since:
Pig 0.7

Method Summary
 org.apache.hadoop.io.WritableComparable<?> getSplitComparable(org.apache.hadoop.mapreduce.InputSplit split)
          The WritableComparable object returned will be used to compare the position of different splits in an ordered stream
 

Method Detail

getSplitComparable

org.apache.hadoop.io.WritableComparable<?> getSplitComparable(org.apache.hadoop.mapreduce.InputSplit split)
                                                              throws IOException
The WritableComparable object returned will be used to compare the position of different splits in an ordered stream

Parameters:
split -
Returns:
WritableComparable representing the position of the split in input
Throws:
IOException


Copyright © ${year} The Apache Software Foundation