org.apache.hadoop.hbase.mapred
Class TableOutputFormat

java.lang.Object
  extended by org.apache.hadoop.mapred.OutputFormatBase<Text,MapWritable>
      extended by org.apache.hadoop.hbase.mapred.TableOutputFormat
All Implemented Interfaces:
OutputFormat<Text,MapWritable>

public class TableOutputFormat
extends OutputFormatBase<Text,MapWritable>

Convert Map/Reduce output and write it to an HBase table


Nested Class Summary
protected  class TableOutputFormat.TableRecordWriter
          Convert Reduce output (key, value) to (HStoreKey, KeyedDataArrayWritable) and write to an HBase table
 
Field Summary
static String OUTPUT_TABLE
          JobConf parameter that specifies the output table
 
Constructor Summary
TableOutputFormat()
          constructor
 
Method Summary
 void checkOutputSpecs(FileSystem ignored, JobConf job)
          Check for validity of the output-specification for the job.
 RecordWriter getRecordWriter(FileSystem ignored, JobConf job, String name, Progressable progress)
          Get the RecordWriter for the given job.
 
Methods inherited from class org.apache.hadoop.mapred.OutputFormatBase
getCompressOutput, getOutputCompressorClass, setCompressOutput, setOutputCompressorClass
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

OUTPUT_TABLE

public static final String OUTPUT_TABLE
JobConf parameter that specifies the output table

See Also:
Constant Field Values
Constructor Detail

TableOutputFormat

public TableOutputFormat()
constructor

Method Detail

getRecordWriter

public RecordWriter getRecordWriter(FileSystem ignored,
                                    JobConf job,
                                    String name,
                                    Progressable progress)
                             throws IOException
Get the RecordWriter for the given job.

Specified by:
getRecordWriter in interface OutputFormat<Text,MapWritable>
Specified by:
getRecordWriter in class OutputFormatBase<Text,MapWritable>
job - configuration for the job whose output is being written.
name - the unique name for this part of the output.
progress - mechanism for reporting progress while writing to file.
Returns:
a RecordWriter to write the output for the job.
Throws:
IOException

checkOutputSpecs

public void checkOutputSpecs(FileSystem ignored,
                             JobConf job)
                      throws FileAlreadyExistsException,
                             InvalidJobConfException,
                             IOException
Check for validity of the output-specification for the job.

This is to validate the output specification for the job when it is a job is submitted. Typically checks that it does not already exist, throwing an exception when it already exists, so that output is not overwritten.

Specified by:
checkOutputSpecs in interface OutputFormat<Text,MapWritable>
Overrides:
checkOutputSpecs in class OutputFormatBase<Text,MapWritable>
job - job configuration.
Throws:
IOException - when output should not be attempted
FileAlreadyExistsException
InvalidJobConfException


Copyright © 2006 The Apache Software Foundation