org.apache.pig
Interface OrderedLoadFunc
- All Known Implementing Classes:
- BinStorage, FileInputLoadFunc, HiveColumnarLoader, PigStorage, PigStorageSchema, SequenceFileLoader, TableLoader
public interface OrderedLoadFunc
Implementing this interface indicates to Pig that a given loader
can be used for MergeJoin. The position as represented by the
WritableComparable object is stored in the index created by
MergeJoin sampling MR job to get an ordered sequence of splits.
This is necessary when the sort key spans multiple splits.
- Since:
- Pig 0.7
Method Summary |
org.apache.hadoop.io.WritableComparable<?> |
getSplitComparable(org.apache.hadoop.mapreduce.InputSplit split)
The WritableComparable object returned will be used to compare
the position of different splits in an ordered stream |
getSplitComparable
org.apache.hadoop.io.WritableComparable<?> getSplitComparable(org.apache.hadoop.mapreduce.InputSplit split)
throws IOException
- The WritableComparable object returned will be used to compare
the position of different splits in an ordered stream
- Parameters:
split
-
- Returns:
- WritableComparable representing the position of the split in input
- Throws:
IOException
Copyright © ${year} The Apache Software Foundation