Automatic Extraction of Template Patterns Using n-gram with Tokens

Hajime UCHINO*    Satoshi SHIRAI*    Satoru IKEHARA+    Midori SHINTAMI++

*NTT Communication Science Laboratories
+Tottori University
++Hokkaido University

uchino@nttkb,ntt.jp


In machine translation a useful method for translating repetitive text is template translation, that is using fixed templates with only a few variables. However, it is hard to compile these templates by hand. This paper describes a method of automatic extraction of template patterns from Japanese corpora using n-gram after replacing potential variable elements with tokens.

automatic extraction, template pattern, corpus, n-gram



[ Technical Report of IEICE, NLC96-18, pp.63-68 (July, 1996). ]