|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
Class Summary | |
---|---|
CalculateEntropyMapper | Calculates the entropy for the value with H(x) = x * log(x) |
CalculateEntropyReducer | Subtracts the partial entropy. |
CalculateSpecificConditionalEntropyMapper | Drops the key. |
ConditionalEntropy | A Hadoop job to compute the conditional entropy H(Value|Key) for a sequence file. |
DoubleSumReducer | Analog of org.apache.hadoop.mapreduce.lib.reduce.IntSumReducer which sums the double values. |
Entropy | A Hadoop job to compute the entropy of keys or values in a SequenceFile . |
GroupAndCountByKeyAndValueMapper | Groups the input by key and value. |
InformationGain | Calculates the information gain for a SequenceFile . |
InformationGainRatio | A job to calculate the normalized information gain. |
KeyCounterMapper | Emits the key and the count of 1 as VarIntWritable . |
SpecificConditionalEntropyMapper | Converts the key from StringTuple with values [key, value] to Text with value key. |
SpecificConditionalEntropyReducer | Does the weighted conditional entropy calculation with H(values|key) = p(key) * sum_i(p(values_i|key) * log_2(p(values_i|key))) = p(key) * (log(|key|) - sum_i(values_i * log_2(values_i)) / |key|) = (sum * log_2(sum) - sum_i(values_i * log_2(values_i))/n WITH sum = sum_i(values_i) = (sum * log(sum) - sum_i(values_i * log(values_i)) / (n * log(2)) |
ValueCounterMapper | Emits the value and the count of 1 as VarIntWritable . |
VarIntSumReducer | The analog of org.apache.hadoop.mapreduce.lib.reduce.IntSumReducer which uses VarIntWritable . |
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |