Chinese Morphs Identification and Normalization
Internet language is a casual informal language. Entity morph is an important feature of Internet Language. In some situation, Internet users are keen on creating kinds of morphs, special kinds of fake alternative names to achieve some goals, express strong sentiment or humor, and avoid censorship. Entity morphs and their target entities respectively appear on informal and formal text. And in some situation, entity morphs even appear on formal text. Although using en-tity morphs has some advantages, but morphs are big barriers for natural language processing (NLP). So it is very impor-tant to research on morph identification and normalization. First, we will introduce the definition of morphs and the fea-tures of morphs; second, we will show the rules of generating morphs; third, the current progress of morph identification and normalization will be demonstrated. Finally, it is the prospect of this field.