org.apache.accumulo.examples.wikisearch.normalizer
Class LcNoDiacriticsNormalizer

java.lang.Object
  extended by org.apache.accumulo.examples.wikisearch.normalizer.LcNoDiacriticsNormalizer
All Implemented Interfaces:
Normalizer

public class LcNoDiacriticsNormalizer
extends Object
implements Normalizer

An Normalizer which performs the following steps:

  1. Unicode canonical decomposition (Normalizer.Form.NFD)
  2. Removal of diacritical marks
  3. Unicode canonical composition (Normalizer.Form.NFC)
  4. lower casing in the English local


Constructor Summary
LcNoDiacriticsNormalizer()
           
 
Method Summary
 String normalizeFieldValue(String fieldName, Object fieldValue)
          Creates normalized content for ingest based upon implemented logic.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

LcNoDiacriticsNormalizer

public LcNoDiacriticsNormalizer()
Method Detail

normalizeFieldValue

public String normalizeFieldValue(String fieldName,
                                  Object fieldValue)
Description copied from interface: Normalizer
Creates normalized content for ingest based upon implemented logic.

Specified by:
normalizeFieldValue in interface Normalizer
Parameters:
fieldName - The field being normalized
fieldValue - The value to normalize
Returns:
a normalized value


Copyright © 2012 The Apache Software Foundation. All Rights Reserved.