org.apache.hadoop.hbase.mapreduce
Class TableOutputFormat<KEY>

java.lang.Object
  extended by org.apache.hadoop.mapreduce.OutputFormat<KEY,org.apache.hadoop.io.Writable>
      extended by org.apache.hadoop.hbase.mapreduce.TableOutputFormat<KEY>
Type Parameters:
KEY - The type of the key. Ignored in this class.
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable

public class TableOutputFormat<KEY>
extends org.apache.hadoop.mapreduce.OutputFormat<KEY,org.apache.hadoop.io.Writable>
implements org.apache.hadoop.conf.Configurable

Convert Map/Reduce output and write it to an HBase table. The KEY is ignored while the output value must be either a Put or a Delete instance.


Nested Class Summary
protected static class TableOutputFormat.TableRecordWriter<KEY>
          Writes the reducer output to an HBase table.
 
Field Summary
static String OUTPUT_TABLE
          Job parameter that specifies the output table.
static String QUORUM_ADDRESS
          Optional job parameter to specify a peer cluster.
static String REGION_SERVER_CLASS
          Optional specification of the rs class name of the peer cluster
static String REGION_SERVER_IMPL
          Optional specification of the rs impl name of the peer cluster
 
Constructor Summary
TableOutputFormat()
           
 
Method Summary
 void checkOutputSpecs(org.apache.hadoop.mapreduce.JobContext context)
          Checks if the output target exists.
 org.apache.hadoop.conf.Configuration getConf()
           
 org.apache.hadoop.mapreduce.OutputCommitter getOutputCommitter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
          Returns the output committer.
 org.apache.hadoop.mapreduce.RecordWriter<KEY,org.apache.hadoop.io.Writable> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
          Creates a new record writer.
 void setConf(org.apache.hadoop.conf.Configuration otherConf)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

OUTPUT_TABLE

public static final String OUTPUT_TABLE
Job parameter that specifies the output table.

See Also:
Constant Field Values

QUORUM_ADDRESS

public static final String QUORUM_ADDRESS
Optional job parameter to specify a peer cluster. Used specifying remote cluster when copying between hbase clusters (the source is picked up from hbase-site.xml).

See Also:
TableMapReduceUtil.initTableReducerJob(String, Class, org.apache.hadoop.mapreduce.Job, Class, String, String, String), Constant Field Values

REGION_SERVER_CLASS

public static final String REGION_SERVER_CLASS
Optional specification of the rs class name of the peer cluster

See Also:
Constant Field Values

REGION_SERVER_IMPL

public static final String REGION_SERVER_IMPL
Optional specification of the rs impl name of the peer cluster

See Also:
Constant Field Values
Constructor Detail

TableOutputFormat

public TableOutputFormat()
Method Detail

getRecordWriter

public org.apache.hadoop.mapreduce.RecordWriter<KEY,org.apache.hadoop.io.Writable> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
                                                                                            throws IOException,
                                                                                                   InterruptedException
Creates a new record writer.

Specified by:
getRecordWriter in class org.apache.hadoop.mapreduce.OutputFormat<KEY,org.apache.hadoop.io.Writable>
Parameters:
context - The current task context.
Returns:
The newly created writer instance.
Throws:
IOException - When creating the writer fails.
InterruptedException - When the jobs is cancelled.
See Also:
FileOutputFormat.getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext)

checkOutputSpecs

public void checkOutputSpecs(org.apache.hadoop.mapreduce.JobContext context)
                      throws IOException,
                             InterruptedException
Checks if the output target exists.

Specified by:
checkOutputSpecs in class org.apache.hadoop.mapreduce.OutputFormat<KEY,org.apache.hadoop.io.Writable>
Parameters:
context - The current context.
Throws:
IOException - When the check fails.
InterruptedException - When the job is aborted.
See Also:
OutputFormat.checkOutputSpecs(org.apache.hadoop.mapreduce.JobContext)

getOutputCommitter

public org.apache.hadoop.mapreduce.OutputCommitter getOutputCommitter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
                                                               throws IOException,
                                                                      InterruptedException
Returns the output committer.

Specified by:
getOutputCommitter in class org.apache.hadoop.mapreduce.OutputFormat<KEY,org.apache.hadoop.io.Writable>
Parameters:
context - The current context.
Returns:
The committer.
Throws:
IOException - When creating the committer fails.
InterruptedException - When the job is aborted.
See Also:
OutputFormat.getOutputCommitter(org.apache.hadoop.mapreduce.TaskAttemptContext)

getConf

public org.apache.hadoop.conf.Configuration getConf()
Specified by:
getConf in interface org.apache.hadoop.conf.Configurable

setConf

public void setConf(org.apache.hadoop.conf.Configuration otherConf)
Specified by:
setConf in interface org.apache.hadoop.conf.Configurable


Copyright © 2011 The Apache Software Foundation. All Rights Reserved.