chukwa 0.3.0 API

Hadoop MapReduce and HDFS are designed to support efficient batch processing of large datasets.

See:
          Description

Chukwa
org.apache.hadoop.chukwa  
org.apache.hadoop.chukwa.analysis.salsa.fsm  
org.apache.hadoop.chukwa.analysis.salsa.visualization  
org.apache.hadoop.chukwa.conf  
org.apache.hadoop.chukwa.database  
org.apache.hadoop.chukwa.datacollection  
org.apache.hadoop.chukwa.datacollection.adaptor  
org.apache.hadoop.chukwa.datacollection.adaptor.filetailer  
org.apache.hadoop.chukwa.datacollection.agent  
org.apache.hadoop.chukwa.datacollection.agent.metrics  
org.apache.hadoop.chukwa.datacollection.collector  
org.apache.hadoop.chukwa.datacollection.collector.servlet  
org.apache.hadoop.chukwa.datacollection.connector  
org.apache.hadoop.chukwa.datacollection.connector.http  
org.apache.hadoop.chukwa.datacollection.controller  
org.apache.hadoop.chukwa.datacollection.sender  
org.apache.hadoop.chukwa.datacollection.sender.metrics  
org.apache.hadoop.chukwa.datacollection.test  
org.apache.hadoop.chukwa.datacollection.writer  
org.apache.hadoop.chukwa.datacollection.writer.localfs  
org.apache.hadoop.chukwa.dataloader  
org.apache.hadoop.chukwa.extraction  
org.apache.hadoop.chukwa.extraction.archive  
org.apache.hadoop.chukwa.extraction.demux  
org.apache.hadoop.chukwa.extraction.demux.processor  
org.apache.hadoop.chukwa.extraction.demux.processor.mapper  
org.apache.hadoop.chukwa.extraction.demux.processor.reducer  
org.apache.hadoop.chukwa.extraction.engine  
org.apache.hadoop.chukwa.extraction.engine.datasource  
org.apache.hadoop.chukwa.extraction.engine.datasource.database  
org.apache.hadoop.chukwa.extraction.engine.datasource.record  
org.apache.hadoop.chukwa.hicc  
org.apache.hadoop.chukwa.inputtools  
org.apache.hadoop.chukwa.inputtools.hdfsusage  
org.apache.hadoop.chukwa.inputtools.jplugin  
org.apache.hadoop.chukwa.inputtools.log4j  
org.apache.hadoop.chukwa.inputtools.mdl  
org.apache.hadoop.chukwa.inputtools.plugin  
org.apache.hadoop.chukwa.inputtools.plugin.metrics  
org.apache.hadoop.chukwa.inputtools.plugin.nodeactivity  
org.apache.hadoop.chukwa.inputtools.plugin.pbsnode  
org.apache.hadoop.chukwa.rest.actions  
org.apache.hadoop.chukwa.rest.objects  
org.apache.hadoop.chukwa.rest.services  
org.apache.hadoop.chukwa.tools.backfilling  
org.apache.hadoop.chukwa.util  
org.apache.hadoop.metrics.spi  

 

Hadoop MapReduce and HDFS are designed to support efficient batch processing of large datasets. Many organizations accumulate huge volumes of log files and system metrics data, and it's tempting to use MapReduce to do the processing. However, this data has certain unfortunate characteristics. It's updated incrementally, and spread across many machines. This makes it a difficult to use MapReduce on this data. Chukwa is a Hadoop subproject aiming to bridge this gap, and to facilitate MapReduce processing of monitoring data. Chukwa is an open source data collection system for monitoring and analyzing large distributed systems. Chukwa is built on top of the Hadoop distributed filesystem (HDFS) and MapReduce framework and inherits Hadoop's scalability and robustness. Chukwa also includes a flexible and powerful toolkit for displaying monitoring and analyzing results, in order to make the best use of this collected data.



Copyright © ${year} The Apache Software Foundation