µ¡³£ËÝÌõ¼­½ñ¹½ÃÛ»Ù±ç¥Ä¡¼¥ë

Design Tools for Japanese-to-English Machine Translation Dictionaries


ÆâÌî °ì*  ½ÕÌî ²íɧ**  ¹â¶¶ ÂçÏÂ*  Çò°æ Í¡*

Hajime UCHINO Masahiko HARUNO Yamato TAKAHASHI Satoshi SHIRAI

¤¢¤é¤Þ¤·

µ¡³£ËÝÌõ¤ÎÀ­Ç½¤ò¸þ¾å¤µ¤»¤ë¤Ë¤ÏÂÐÌõ¥Ç¡¼¥¿¤¬ÂçÎ̤ËɬÍפȤʤ뤬, ÂÐÌõ¥³¡¼¥Ñ¥¹¤È¤·¤Æ¤Ï¾®µ¬ÌϤΤâ¤Î¤·¤«Â¸ºß¤·¤Ê¤¤. ËÜÏÀʸ¤Ç¤Ï, ¿·Ê¹µ­»ö¤òÂоݤˤ¹¤ì¤Ð·Ñ³Ū¤Ë¥Ç¡¼¥¿¤¬ÆÀ¤é¤ì¤ëÅÀ¤ËÃåÌܤ·¤Æ, ÆüËܸì¤È±Ñ¸ì¤Î¿·Ê¹µ­»ö¤È¤½¤ì¤Ë´Þ¤Þ¤ì¤ëʸ¤òÂбþ¤Å¤±¤ë¤³¤È¤Ë¤è¤ê Â絬ÌϤÊÂÐÌõ¥³¡¼¥Ñ¥¹¤ò¹½ÃÛ¤¹¤ëÊýË¡¤ò¼¨¤¹. ¼¡¤Ë, n-gram Åý·×½èÍý¤ò±þÍѤ¹¤ë¤³¤È¤Ë¤è¤ê, Â絬ÌϤʥ³¡¼¥Ñ¥¹¤ò¸úΨ¤è¤¯ÂηÏΩ¤Æ¤ÆÊ¬ÀϤ¹¤ë¼êË¡¤ò¼¨¤¹. ¤Þ¤¿, ÂÐÌõ¥Ç¡¼¥¿¤äʬÀϤˤè¤Ã¤ÆÆÀ¤é¤ì¤¿¾ðÊó¤Ë´ð¤Å¤¤¤Æ, µ¡³£ËÝÌõÍѤμ­½ñ¤òºîÀ®¤¹¤ë¤¿¤á¤Î»Ù±ç½èÍý¤Ë¤Ä¤¤¤Æ½Ò¤Ù¤ë.



Abstract

Large bilingual corpora are useful for improving the quality of machine translations, but only small bilingual corpora exist. This paper describes a method of building a large bilingual corpus by aligning Japanese sentences with English sentences from newspaper articles which can be easily and continually collected. A corpus analysis method by using n-gram statistics and computer-aided design tools of machine translation dictionaries are also described.



[ NTT R&D, Vol.46, No.12, pp.1425-1432 (1997.12). ]
[ NTT R&D, Vol.46, No.12, pp.1425-1432 (December, 1997). ]



* NTT¥³¥ß¥å¥Ë¥±¡¼¥·¥ç¥ó²Ê³Ø¸¦µæ½ê NTT Communication Science Laboratories
** ATR¿Í´Ö¾ðÊóÄÌ¿®¸¦µæ½ê ATR Human Information Processing Research Laboratories
(C)ÆüËÜÅÅ¿®ÅÅÏóô¼°²ñ¼Ò 1997



INDEX

     1 ¤Þ¤¨¤¬¤­
2 Æü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ
  2.1 Æü±Ñ¿·Ê¹µ­»ö¤Î¼«Æ°Âбþ¤Å¤±
    2.1.1 ¼«Æ°Âбþ¤Å¤±¤ÎÊýË¡
    2.1.2 µ­»öÂбþ¤Îɾ²Á
  2.2 Åý·×¾ðÊó¤È¼­½ñ¾ðÊó¤òÍѤ¤¤¿¼«Æ°¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à
    2.2.1 ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Î³µÍ×
    2.2.2 ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Îɾ²Á
  2.3 ÂÐÌõ¥³¡¼¥Ñ¥¹¹½Ã۴Ķ­: BACCS
    2.3.1 ¥¢¥é¥¤¥á¥ó¥È¹½Ã۴Ķ­¤ÎɬÍ×À­
    2.3.2 BACCS¤Î³µÍ×
3 n-gramÅý·×½èÍý¥Ä¡¼¥ë
  3.1 ÌÜŪ
  3.2 n-gramÅý·×½èÍý¤Ë¤è¤ëÄ귿Ūɽ¸½¤ÎÃê½Ð
  3.3 Ëܥġ¼¥ë¤Î±þÍÑ
4 ËÝÌõ¼­½ñ¤ÎºîÀ®»Ù±ç
  4.1 ̾»ì¤Î°Ọ̃°À­¤Î¼«Æ°¿äÄê
    4.1.1 °Ọ̃°À­¿äÄê½èÍý¤ÎɬÍ×À­¤ÈÊýË¡
    4.1.2 °Ọ̃°À­¿äÄê¤Î¸ú²Ì
  4.2 ·ë¹ç²Á¥Ñ¥¿¥óÂФÎȾ¼«Æ°ºîÀ®
    4.2.1 ÇØ·Ê
    4.2.2 ºîÀ®»Ù±ç¤ÎÊýË¡¤È¸ú²Ì
5 ¤¢¤È¤¬¤­
  Ê¸¸¥



1 ¤Þ¤¨¤¬¤­

²òÀÏ¡¦ÊÑ´¹¡¦À¸À®¤Î½èÍý¥¹¥Æ¥Ã¥×¤«¤é¹½À®¤µ¤ì¤ëµ¡³£ËÝÌõ¥·¥¹¥Æ¥à¤ÎÀ­Ç½¤Î¸þ¾å¤ò¿Þ¤ë¤Ë¤Ï, Â絬ÌϤ«¤Ä¾ÜºÙ¤Ê¼­½ñ¤ä¥ë¡¼¥ë½¸¤òÀ°È÷¤¹¤ëɬÍפ¬¤¢¤ë. ¼­½ñ¤ä¥ë¡¼¥ë¤òÀ°È÷¤¹¤ë¤Ë¤Ï¼ÂºÝ¤ÎÂÐÌõ¥Ç¡¼¥¿¤òʬÀϤ¹¤ëɬÍפ¬¤¢¤ê, ¥Ç¡¼¥¿Ê¬ÀÏ·ë²Ì¤Î¿®ÍêÀ­¤ò¹â¤á¤ë¤Ë¤ÏÂçÎ̤ÎÂÐÌõ¥Ç¡¼¥¿¤¬É¬ÍפȤʤë. ¤Þ¤¿, ÂçÎ̤Υǡ¼¥¿¤ËÂФ·¤Æ, ¤¢¤ëÆÃÄê¤Î¸½¾Ý¤ËÃåÌܤ·¤ÆÊ¬ÀϤ¹¤ë¾ì¹ç¤ÏÌäÂê¤Ë¤Ï¤Ê¤é¤Ê¤¤¤¬, ¼­½ñ¤ä¥ë¡¼¥ë¤ÎÀ°È÷¤È¤¤¤¦´ÑÅÀ¤«¤é¤Ï ¥Ç¡¼¥¿Á´È̤˶¦Ä̤¹¤ëÆÃÀ·¤ò¸úΨ¤è¤¯Ê¬ÀϤ¹¤ë¼êË¡¤Î³ÎΩ¤¬É¬ÍפȤʤ뤬, ¤³¤ì¤Þ¤Ç¤Ï¤¢¤Þ¤êÂηÏΩ¤Æ¤ÆÏÀ¤¸¤é¤ì¤ë¤³¤È¤Ï¤Ê¤«¤Ã¤¿.

ËÜÏÀʸ¤Ç¤Ï, ÂÐÌõ¥Ç¡¼¥¿¤ÎºîÀ®, ʬÀÏ, ¤ª¤è¤Ó, ʬÀϤµ¤ì¤¿¥Ç¡¼¥¿¤«¤é¤Î¼­½ñºîÀ®»Ù±ç¤È¤¤¤Ã¤¿°ìÏ¢¤Î¼êË¡¤Ë¤Ä¤¤¤Æ½Ò¤Ù¤ë.

2¾Ï¤Ç¤Ï, ¿·Ê¹µ­»ö¤òÂоݤˤ¹¤ì¤Ð·Ñ³Ū¤Ë¥Ç¡¼¥¿¤¬ÆÀ¤é¤ì¤ëÅÀ¤ËÃåÌܤ·¤Æ, Â絬ÌϤÊÂÐÌõ¥³¡¼¥Ñ¥¹¤ò¹½ÃÛ¤¹¤ëÊýË¡¤ò¼¨¤¹. ¤³¤³¤Ç¤Ï, ÆüËܸì¤È±Ñ¸ì¤Î¿·Ê¹µ­»ö¤ò¤È¤ê½Ð¤·, ÆâÍÆÅª¤Ë´ØÏ¢¤Î¤¢¤ëµ­»ö¤ò¼«Æ°Åª¤ËÂбþ¤Å¤±, ¤½¤Î¸å, ʸñ°Ì¤ÇÂбþ´Ø·¸¤òÄ´¤Ù¤ë¤³¤È¤Ë¤è¤ê, ÂÐÌõÎãʸ½¸¤òºîÀ®¤¹¤ë.

¼¡¤Ë, 3¾Ï¤Ç¤Ï, n-gramÅý·×½èÍý¤ò±þÍѤ¹¤ë¤³¤È¤Ë¤è¤ê, Â絬ÌϤʥ³¡¼¥Ñ¥¹¤ò¸úΨ¤è¤¯ÂηÏΩ¤Æ¤ÆÊ¬ÀϤ¹¤ë¼êË¡¤ò¼¨¤¹. ¤³¤Î¼êË¡¤Ë¤è¤ì¤Ð, ¥³¡¼¥Ñ¥¹¤Ë´Þ¤Þ¤ì¤ëǤ°Õ¤ÎŤµ¤Îʸ»úÎó¤¬, Ťµ¤äÅÙ¿ô¤Ê¤É¤Î½ç¤Ë´Êñ¤Ë¥ê¥¹¥È¥¢¥Ã¥×¤µ¤ì¤ë¤¿¤á, ¼­½ñ¤ËÅÐÎФ¹¤Ù¤­É½¸½¤äËÝÂô¥ë¡¼¥ë¤Î¸µ¤È¤Ê¤ë¥Ç¡¼¥¿¤¬¸úΨ¤è¤¯Ãê½Ð¤µ¤ì¤ë.

4¾Ï¤Ç¤Ï, ÂÐÌõ¥Ç¡¼¥¿¤äʬÀϤˤè¤Ã¤ÆÆÀ¤é¤ì¤¿¾ðÊó¤Ë´ð¤Å¤¤¤Æ, µ¡³£ËÝÌõÍѤμ­½ñ¤òºîÀ®¤¹¤ë¤¿¤á¤Î»Ù±ç½èÍý¤Ë¤Ä¤¤¤Æ½Ò¤Ù¤ë. ñ¸ì¤ÎÂÐÌõ½¸¤ËÂФ·¤Æ¤Ï, ´û¸¤Îñ¸ì°ÕÌ£¼­½ñ¤È¾È¹ç¤¹¤ë¤³¤È¤Ë¤è¤ê¾ÜºÙ¤Êñ¸ì°À­¤¬Ä󼨤µ¤ì¤ë. Æü±ÑÂÐÌõʸ¤ËÂФ·¤Æ¤Ï, ALT-J/E¤ÎÆüËܸì²òÀϤȴÊñ¤Ê±Ñ¸ì²òÀϤòÁȤ߹ç¤ï¤»¤ë¤³¤È¤Ë¤è¤ê, ·ë¹ç²Á¥Ñ¥¿¥óÂФθ¶·Á¤È¤Ê¤ë¾ðÊ󤬯À¤é¤ì¤ë.




2 Æü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ

ÂÐÌõ¥³¡¼¥Ñ¥¹¤Ï, µ¡³£ËÝÌõ¥·¥¹¥Æ¥à¹½Ãۤˤª¤¤¤Æ ɬÍפÊÍÍ¡¹¤Ê¸À¸ìÃ챤ò¤È¤ê½Ð¤¹¤¦¤¨¤Ç½ÅÍפʾðÊ󸻤Ǥ¢¤ë. ¼ÂºÝ, ÂÐÌõ¥³¡¼¥Ñ¥¹¤«¤éÍÍ¡¹¤Ê¸À¸ì¾ðÊó¤òÃê½Ð¤¹¤ë»î¤ß¤¬ ¤Ê¤µ¤ì¤Æ¤¤¤ë(1)(2)¤¬, ÀºÅ٤Τ褤Ã챤òÃê½Ð¤¹¤ë¤¿¤á¤Ë¤Ï, ¤Ê¤ë¤Ù¤¯ÂçÎ̤ÎʸÂбþ(¥¢¥é¥¤¥á¥ó¥È)¤Î¤È¤ì¤¿ÂÐÌõ¥³¡¼¥Ñ¥¹¤¬¤¢¤ë¤³¤È¤«Ë¾¤Þ¤·¤¤.

ÂÐÌõ¥³¡¼¥Ñ¥¹¤«¤éʸ¥ì¥Ù¥ë¤ÎÂбþ´Ø·¸¤òÆÃÄꤹ¤ë, ¤¤¤ï¤æ¤ëʸÂбþ¤Î¸¦µæ¤Ë´Ø¤·¤Æ¤â, ÍÍ¡¹¤Ê»î¤ß¤¬¤Ê¤µ¤ì¤Æ¤¤¤ë(3)¡Á(6)¤¬, ¹½Â¤¤Î°Û¤Ê¤ë¸À¸ì²´Ö(Î㤨¤Ð, ÆüËܸì¤È±Ñ¸ì)¤ÇÀµ¤·¤¯Ê¸Âбþ¤¬¤È¤ì¤¿ÂÐÌõ¥³¡¼¥Ñ¥¹¤ò ¼ý½¸¤¹¤ë¤¿¤á¤ÎÊýË¡ÏÀ¤ä¥Ä¡¼¥ë¤Ë¤Ä¤¤¤Æ, ¤¢¤Þ¤ê¸¦µæ¤¬¤Ê¤µ¤ì¤Æ¤¤¤Ê¤¤.

ËܾϤǤÏ, Æü±Ñ¤Î¿·Ê¹µ­»ö¤¬·Ñ³Ū¤Ë¼ý½¸¤Ç¤­¤ë¤³¤È¤ËÃåÌܤ·, ¤³¤ì¤òÆü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¥½¡¼¥¹¥Ç¡¼¥¿¤È¤·¤ÆÍøÍѤ¹¤ë¤³¤È¤ò¹Í¤¨¤ë. ¼¡¤Ë, ʸÂбþ¤Î¤È¤ì¤¿ÂÐÌõ¥³¡¼¥Ñ¥¹¼ý½¸¤òÍÆ°×¤Ë¤¹¤ë¤¿¤á¤ÎÂÐÌõ¥³¡¼¥Ñ¥¹¹½Ã۴Ķ­ BACCS(Bilingual Aligned Corpus Construction System)¤Ë¤Ä¤¤¤Æ½Ò¤Ù¤ë. BACCS¤Ï½¾Íè¼êË¡¤È¤Ï°Û¤Ê¤ë¥í¥Ð¥¹¥È¤Ê¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤òÍ­¤·, ¥×¥í¥°¥é¥à¤Î½ÐÎϤ¹¤ë¥¢¥é¥¤¥á¥ó¥È·ë²Ì¤ò¥°¥é¥Õ¥£¥«¥ë¤Ê¥¤¥ó¥¿¥Õ¥§¡¼¥¹¤òÄ̤¸¤Æ ´Êñ¤Ë³Îǧ/½¤Àµ¤¬²Äǽ¤ÊÂÐÌõ¥³¡¼¥Ñ¥¹¹½Ã۴Ķ­¤Ç¤¢¤ë. °Ê²¼, Æü±Ñ¿·Ê¹µ­»ö¤òÂбþ¤Å¤±¤ë¼êË¡, BACCS¤¬Í­¤¹¤ë¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Î¾ÜºÙ, ¤ª¤è¤Ó BACCS¤Î»ý¤Äµ¡Ç½¤Ë¤Ä¤¤¤Æ½Ò¤Ù¤ë.




2.1 Æü±Ñ¿·Ê¹µ­»ö¤Î¼«Æ°Âбþ¤Å¤±




2.1.1 ¼«Æ°Âбþ¤Å¤±¤ÎÊýË¡

¿·Ê¹µ­»ö¤òÂоݤˤ¹¤ì¤Ð·Ñ³Ū¤Ë¥Ç¡¼¥¿¤¬ÆÀ¤é¤ì¤ëÅÀ¤ËÃåÌܤ·¤Æ, ÆüËܸì¤È±Ñ¸ì¤Î¿·Ê¹µ­»ö¤òÂбþ¤Å¤±¤ë¤³¤È¤Ë¤è¤ê, ÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¥½¡¼¥¹¥Ç¡¼¥¿¤òÆÀ¤ë¤³¤È¤¬¹Í¤¨¤é¤ì¤ë(7). ¤³¤³¤Ç¤Ï, ÆüËܸì¤È±Ñ¸ì¤ÎξÊý¤Î¿·Ê¹¤òȯ¹Ô¤·¤Æ¤¤¤ë ÆüËܷкѿ·Ê¹¼Ò¤Î¥Æ¥ì¥³¥ó¥Ç¡¼¥¿¥Ù¡¼¥¹¤ËÃåÌܤ·¤¿. ¥Æ¥ì¥³¥ó¥Ç¡¼¥¿¥Ù¡¼¥¹¤Ë¤ÏÆüËܸì¤Îµ­»ö¤È±Ñ¸ì¤Îµ­»ö¤ÏÊÌ¡¹¤Ë¼ýÏ¿¤µ¤ì, ÆüËÜʸµ­»ö¤¬1ÆüÊ¿¶ÑÌó700µ­»ö¤Ç¤¢¤ë¤Î¤ËÂФ·¤Æ, ±Ñʸµ­»ö¤Ï1ÆüÊ¿¶ÑÌó100µ­»ö¤Ç¤¢¤ê, ¤Þ¤¿1µ­»öÅö¤ê¤Îµ­½ÒÎ̤⤫¤Ê¤ê°Û¤Ê¤Ã¤Æ¤¤¤ë. ¤µ¤é¤Ë, ÆâÍÆ¤ò¾È¹ç¤·¤¿·ë²Ì, µ­»ö¤Îȯ¿®ÆüÉդˤº¤ì¤¬¤¢¤ë¤³¤È¤¬Ê¬¤«¤Ã¤¿. ¤½¤³¤Ç, ±Ñʸµ­»ö1Æüʬ(Ìó100µ­»ö)¤ËÂФ·¤Æ, ÆüËÜʸµ­»ö3Æüʬ(Ìó2,000)µ­»ö¤òÂоݤ˵­»ö¤ÎÂбþ¤Å¤±¤ò¹Ô¤¦¤³¤È¤Ë¤¹¤ë.

¥Ç¡¼¥¿¥Ù¡¼¥¹¤ò¸¡º÷¤¹¤ëºÝ¤Ï, ¸Çͭ̾»ì¤ä°ìÈÌ̾»ì¤ò¥­¡¼¥ï¡¼¥É¤È¤·¤Æ»ÈÍѤ¹¤ë¤Î¤¬°ìÈÌŪ¤Ç¤¢¤ê, ¿ôÃ;ðÊó¤Ï»ÈÍѤµ¤ì¤Ê¤¤. ¤·¤«¤·, ¿ôÃ;ðÊó¤Ïʸ»úÎó²òÀϤˤè¤êÍÆ°×¤Ë¤È¤ê½Ð¤»¤ë¤³¤È, ¿ôÃͤϵ­»ö¸ÇÍ­¤Î¾ðÊó¤Î1¤Ä¤È¹Í¤¨¤é¤ì¤ë¤³¤È¤«¤é, ¤Þ¤º¿ôÃͤò¥­¡¼¥ï¡¼¥É¤È¤·¤ÆÍøÍѤ·, µ­»ö¤ÎÂбþ¤Å¤±¤ËŬÍѤ¹¤ë¤³¤È¤ò¹Í¤¨¤¿.

8Æüʬ¤Î±Ñʸµ­»ö¤ËÂбþ¤¹¤ëÆüËÜʸµ­»ö¤Î¸¡½Ð¼Â¸³¤ò¹Ô¤Ã¤¿¤È¤³¤í, ¿ôÃ;ðÊó¤Î¤ß¤Ç51.6%¤Î±Ñʸµ­»ö¤Ç, ¤µ¤é¤Ë¸Çͭ̾»ì¾ðÊó¤òÊ»ÍѤ¹¤ë¤³¤È¤Ë¤è¤ê80.4%¤Î±Ñʸµ­»ö¤Ç, ¤½¤ì¤¾¤ìÂбþ¤¹¤ëÆüËÜʸµ­»ö¤¬ÆÀ¤é¤ì¤ë¤³¤È¤¬ ʬ¤«¤Ã¤¿(8)(9). ¤Þ¤¿, Âбþ¤¹¤ëÆü±Ñ¤Îµ­»öÂФòÈæ³Ó¤¹¤ë¤È, ±Ñʸµ­»ö¤Î¤¦¤Á56.9%¤ÎÉôʬ¤Ë¤Ï·Á¼°Åª¤Ë¤â¤è¤¯Âбþ¤¹¤ëÆüËÜʸ¤¬Â¸ºß¤·, 40.3%¤ËÂФ·¤Æ¤ÏÆâÍÆÅª¤ËÂбþ¤¹¤ëÆüËÜʸ¤¬Â¸ºß¤¹¤ë¤³¤È¤¬ ʬ¤«¤Ã¤¿(10). ¤³¤ì¤Ë¤è¤ê, ¿·Ê¹µ­»ö¤òÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¥½¡¼¥¹¥Ç¡¼¥¿¤È¤·¤ÆÍøÍѤ¹¤ë¤á¤É¤¬ÆÀ¤é¤ì¤¿.




2.1.2 µ­»öÂбþ¤Îɾ²Á

Æü±Ñµ­»ö¤ÎÂбþ¤Å¤±¤ò¹Ô¤Ã¤¿ºÝ, Ê£¿ô¤Îµ­»öÂбþ¸õÊ䤬¤¢¤ë¾ì¹ç, Àµ¤·¤¤Âбþµ­»ö¤òÆÃÄꤹ¤ë¤¿¤á¤Î¼«Æ°Åª¤Êɾ²ÁË¡¤¬É¬ÍפȤʤë. ¿Í¼ê¤Ë¤è¤ê³Îǧ¤¹¤ë¤Î¤Ç¤¢¤ì¤Ð, 1Æüʬ¤Îµ­»ö¤Ë¤ª¤±¤ë¿ôÃͤäñ¸ì¤Î½Ð¸½¿ôÅù¤ò»²¹Í¤Ë¤·¤Ê¤¬¤é, ¤Ç¤­¤ë¤À¤±Â¿¤¯¤Îµ­»ö¤òÀµ³Î¤ËÂбþ¤Å¤±¤ë¤³¤È¤¬¤Ç¤­¤ë. ¤·¤«¤·, ¤½¤ÎºÝ¤Ë»ÈÍѤµ¤ì¤ë¥Ò¥å¡¼¥ê¥¹¥Æ¥£¥Ã¥¯¥ë¡¼¥ë¤òÄêÎ̲½¤¹¤ë¤³¤È¤Ïº¤Æñ¤Ç, ¼«Æ°Åª¤ÊÂбþ¤Å¤±¤Î¤¿¤á¤Îï礤ÃÍÀßÄê¤Ë¤ÏÍøÍѤǤ­¤Ê¤¤. ¤½¤³¤Ç, ±Ñʸµ­»ö¤Ë½Ð¸½¤¹¤ë¿ôÃͤä¸Çͭ̾»ì¤¬Âбþ¸õÊäµ­»ö¤Ë¸½¤ì¤¿·ï¿ô¤Ë¤è¤ë Í¥Àè½ç°Ì¤Å¤±ÊýË¡¤òÄ󰯤·, ¤½¤ÎÍ­¸úÀ­¤ò³Îǧ¤·¤¿(9)(11). Æü±Ñ¤Î¿·Ê¹µ­»ö8Æüʬ¤òÂоݤȤ·¤Æ·×²Á¤·¤¿·ë²Ì, Âбþµ­»ö¸õÊ䤬1¤Ä¤Î¤È¤­¤ÏÂбþ¹àÌܤ¬3¸Ä°Ê¾å, Âбþµ­»ö¸õÊ䤬ʣ¿ô¤¢¤ë¤È¤­¤ÏÂè1¸õÊä¤ÈÂè2¸õÊä¤ÎÂбþ¹àÌܤκ¹¤¬2¸Ä°Ê¾å, ¤È¤¤¤¦¾ò·ï¤òÀßÄꤹ¤ë¤³¤È¤Ë¤è¤ê, 1ÆüÊ¿¶ÑÌó37µ­»ö¤òŬ¹çΨ100%¤ÇÂбþ¤Å¤±¤é¤ì¤ë¤³¤È¤¬Ê¬¤«¤Ã¤¿. º£¸å¤Ï, Ŭ¹çΨ¤òÊݤÁ¤Ê¤¬¤éºÆ¸½Î¨¤ò¹â¤á¤ë¤¿¤á¤Î¾ò·ïÀßÄê¤Ë¤Ä¤¤¤Æ¸¡Æ¤¤ò¿Ê¤á¤ë.




2.2 Åý·×¾ðÊó¤È¼­½ñ¾ðÊó¤òÍѤ¤¤¿¼«Æ°¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à




2.2.1 ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Î³µÍ×

BACCS¤¬Í­¤¹¤ë¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Ï, Åý·×¾ðÊó¤È¼­½ñ¾ðÊó¤òÍѤ¤¤ÆÆÀ¤é¤ì¤ëÌõ¸ìÂФò´ð¤Ë, ¥¢¥ó¥«¡¼(Âбþ´Ø·¸¤¬³ÎÄꤷ¤¿Ê¸Âбþ¥Ú¥¢)¤ò¸«¤Ä¤±¤ëÁàºî¤ò·«¤êÊÖ¤¹¤³¤È¤Ë¤è¤Ã¤Æ, ʸ¤ÎÂбþ¤Å¤±¤ò¹Ô¤¦. ¤³¤³¤Ç, ½é´üŪ¤Ê¥¢¥ó¥«¡¼¤È¤·¤Æ, 2.1Àá¤Ë¤ª¤¤¤Æ, Âбþ¤Å¤±¤é¤ì¤¿Æü±Ñ¿·Ê¹µ­»ö¥Ç¡¼¥¿¥Ù¡¼¥¹¤«¤é, ¤½¤ì¤¾¤ì¤Îµ­»ö¤ÎÁ°¸å¤ò»ØÄꤹ¤ë¤³¤È¤¬²Äǽ¤Ç¤¢¤ë. Åý·×¾ðÊó¤ÎÍøÍѤÏ, ʸ̮¤Ë±þ¤¸¤¿¾ðÊó¤¬ÍøÍѤǤ­¤ëÅÀ¤ä, ·ÁÂÖÁDzòÀϤθí¤ê¤ËÂнè²Äǽ¤Ç¤¢¤ë¤È¤¤¤¦¥í¥Ð¥¹¥ÈÀ­¤¬¤¢¤ëÈ¿ÌÌ, Ê£¿ô²ó½Ð¸½¤¹¤ëñ¸ì¤·¤«ÍøÍѤǤ­¤Ê¤¤¤È¤¤¤¦·çÅÀ¤¬¤¢¤ë. °ìÊý, ¼­½ñ¾ðÊó¤ÎÍøÍѤÏ, °ìÅÙ¤·¤«½Ð¸½¤·¤Ê¤¤Ã±¸ì¤ËÂФ·¤Æ¤âÂнè²Äǽ¤Ç¤¢¤ëÈ¿ÌÌ, Ìõ¸ìÁªÂò¤Î¿ÍÍjÀ¸, ·ÁÂÖÁDzòÀϤθí¤ê¤Ë¤ÏÂнèÉÔ²Äǽ¤È¤¤¤¦·çÅÀ¤òÍ­¤¹¤ë. ËÜ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Ï, ¤³¤Îξ¼Ô¤ÎĹ½ê¤ò·ó¤ÍÈ÷¤¨¤¿¤â¤Î¤Ç¤¢¤ë.

¥¢¥ë¥´¥ê¥º¥à¤Îή¤ì¤Ï, ʸ¸¥(4)¤Î¥¢¥ë¥´¥ê¥º¥à¤ò´ðËܤȤ·¤Æ¤¤¤ë. ¤Þ¤º, ¤¹¤Ç¤Ë·èÄꤷ¤¿¥¢¥ó¥«¡¼¤ò´ð¤ËÂбþ²ÄǽÈϰϤòÀ¸À®¤¹¤ë. ³¤¤¤Æ, Åý·×¾ðÊó¤È¼­½ñ¾ðÊó¤Ë¤è¤Ã¤ÆÆÀ¤é¤ì¤ëÌõ¸ìÂФ«¤é, ¿·¤¿¤Ê¥¢¥ó¥«¡¼¤ò¤ß¤Ä¤±½Ð¤¹. ¤³¤ÎÁàºî¤ò·«¤êÊÖ¤¹¤³¤È¤Ë¤è¤Ã¤Æ, ½ç¼¡Ê¸Âбþ¥Ú¥¢¤ò¤ß¤Ä¤±¤ë(12)(13).

Åý·×¾ðÊó¤Ë¤è¤ëÌõ¸ìÂФÎÀ¸À®¼ê½ç¤Ï, ¤Þ¤ºÂбþ²Äǽ¥Ú¥¢¤Ë½Ð¸½¤¹¤ëÆüËܸìñ¸ì, ±Ñ¸ìñ¸ì¤ÎÁê¸ß¾ðÊóÎ̤Èt-score(3)¤ò·×»»¤·, ¼¡¤Ë¤½¤ì¤é¤ÎÃͤ¬¤¢¤ë·è¤á¤é¤ì¤¿ï礤ÃͰʾå¤È¤Ê¤ë¤â¤Î¤òÌõ¸ìÂФȤ¹¤ë. ¤³¤Îï礤ÃͤÏ, ½é´üŪ¤Ë¤Ï¾®¤µ¤á¤ËÀßÄꤷ, ºÇ½ª¾õÂ֤˶á¤Å¤¯¤Ë½¾¤Ã¤ÆÂ礭¤¯¤¹¤ë¤È¤¤¤¦ ¥¢¥Ë¡¼¥ê¥ó¥°¼êË¡¤Ë¤è¤ê·èÄꤹ¤ë(14).




2.2.2 ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Îɾ²Á

¡ÈScientific American¡É¤È¤½¤ÎÆüËܸìÈǤǤ¢¤ëÆü·Ð¥µ¥¤¥¨¥ó¥¹¤«¤é¤ÎÂÐÌõ¥Æ¥­¥¹¥È¤È, WWW¤òÄ̤¸¤ÆÆÉÇ俷ʹ¥Û¡¼¥à¥Ú¡¼¥¸¤«¤éÆÀ¤é¤ì¤ë¼ÒÀâ¤ÎÂÐÌõ¥Æ¥­¥¹¥È¤òÍѤ¤¤Æ, ËÜ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Îɾ²Á¤ò¹Ô¤Ã¤¿. Ŭ¹çΨ/ÌÖÍåΨ¤Ë¤è¤ëɾ²Á¤Ç, 91¡Á96%¤ÎÀºÅÙ¤òÆÀ¤Æ¤¤¤ë. ËÜ¥×¥í¥°¥é¥à¤Î¾ÜºÙ, ¼Â¸³É¾²Á¤Î¾ÜºÙ¤Ï, ʸ¸¥(12)¡Á(14)¤ò»²¾È¤µ¤ì¤¿¤¤.




2.3 ÂÐÌõ¥³¡¼¥Ñ¥¹¹½Ã۴Ķ­: BACCS




2.3.1 ¥¢¥é¥¤¥á¥ó¥È¹½Ã۴Ķ­¤ÎɬÍ×À­

ËÜ¥¢¥é¥¤¥á¥ó¥È¼êË¡¤òÍѤ¤¤ë¤³¤È¤Ç, ¤¢¤ëÄøÅÙ¤ÎÀºÅÙ¤ÎʸÂбþ¤¬²Äǽ¤Ç¤¢¤ë¤¬, ¾ï¤Ë´°Á´¤ÊÂбþ´Ø·¸¤¬ÆÀ¤é¤ì¤ë¤È¤Ï¸Â¤é¤Ê¤¤. ¤è¤Ã¤Æ, ¹âÉʼÁ¤ÊÂçÎ̤ÎÂбþ¤Å¤±¤µ¤ì¤¿ÂÐÌõ¥³¡¼¥Ñ¥¹¤ÎÃßÀѤÏ, ¿Í´Ö¤Ë¤è¤ë³Îǧ/½¤Àµºî¶È¤¬É¬ÍפǤ¢¤ë. ¤Þ¤¿, ¤½¤Îºî¶È¤Ë¤è¤ë¥æ¡¼¥¶¤«¤é¤Î¥Õ¥£¡¼¥É¥Ð¥Ã¥¯¾ðÊó¤òÄ̤·¤Æ ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Î²þÎɤؤμ꤬¤«¤ê¤òÆÀ¤ë¤³¤È¤¬¤Ç¤­¤ë. ¤µ¤é¤Ë, ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤ÎÉû»ºÊª¤È¤·¤ÆÆÀ¤é¤ì¤ëÅý·×¾ðÊó¤Ë¤è¤ëÌõ¸ìÂФΤ¦¤Á, ¥æ¡¼¥¶¤¬³Îǧ¤·¤¿ºÆÍøÍѲÄǽ¤ÊÌõ¸ìÂФòÊݸ¤¹¤ë¤³¤È¤Ë¤è¤ê, ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤ÎÀºÅ٤θþ¾å¤â´üÂԤǤ­¤ë.




2.3.2 BACCS¤Î³µÍ×

ÂÐÌõ¥³¡¼¥Ñ¥¹¹½Ã۴Ķ­BACCS¤Ï, Xwindow¾å¤ÇTcl/Tk¤òÍѤ¤¤Æ¼Â¸½¤µ¤ì¤Æ¤ª¤ê, Á°½Ò¤·¤¿¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤Î¥¢¥é¥¤¥á¥ó¥È·ë²Ì¤ò¥Ý¥¤¥ó¥Æ¥£¥ó¥°¥Ç¥Ð¥¤¥¹¤Ç ÍÆ°×¤Ë½¤Àµ/³Îǧ¤¬¤Ç¤­¤ë¥¤¥ó¥¿¥Õ¥§¡¼¥¹¤òÄ󶡤¹¤ë. ¤Þ¤¿, ¥æ¡¼¥¶¤Ï, BACCS¤òÄ̤¸¤Æ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤ÇÆÀ¤é¤ì¤ëÌõ¸ìÂФÎÃæ¤«¤é ¥æ¡¼¥¶¼­½ñ¤ËÅÐÏ¿¤¹¤ëÌõ¸ìÂФòÁªÂò¤¹¤ë¤³¤È¤â²Äǽ¤Ç¤¢¤ë.

BACCS¤òÍѤ¤¤¿¥³¡¼¥Ñ¥¹ºîÀ®¼ê½ç¤Ï¼¡¤Î¤È¤ª¤ê¤Ç¤¢¤ë. ¤Þ¤º, ÂоݤȤʤëÂÐÌõ¥Æ¥­¥¹¥È¤òÁªÂò¤·, ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤òµ¯Æ°, ¤½¤Î·ë²Ì¤ò¥Ç¥£¥¹¥×¥ì¥¤¤Ëɽ¼¨¤¹¤ë. ¼¡¤Ë¥Þ¥¦¥¹Áàºî¤Ë¤è¤ê, ʸ¤ÎÂбþ¤Î³Îǧ/½¤Àµºî¶È¤ò¹Ô¤¦. ºÇ¸å¤Ë, ¥¢¥é¥¤¥á¥ó¥È¥×¥í¥°¥é¥à¤¬½ÐÎϤ¹¤ëÌõ¸ìÂФÎÃæ¤«¤éÀµ¤·¤¤Ìõ¸ìÂФòÁªÂò¤·, ¥æ¡¼¥¶¼­½ñ¤ËÅÐÏ¿¤¹¤ë. BACCS¤Îµ¯Æ°Îã¤ò¿Þ1¤Ë¼¨¤¹.

¿Þ1 BACCS¤Îµ¯Æ°²èÌÌ

¤³¤Î¤è¤¦¤Ê³Îǧ/½¤Àµºî¶È¤¬Íưפʥ¤¥ó¥¿¥Õ¥§¡¼¥¹¤Ë¤è¤Ã¤Æ, ¸úΨŪ¤ÊÂçÎÌÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ¤¬²Äǽ¤È¤Ê¤Ã¤¿.




3 n-gramÅý·×½èÍý¥Ä¡¼¥ë




3.1 ÌÜŪ

Á°½Ò¤Î¼êË¡¤Ë¤è¤ê, Æü±Ñ¤Îʸ¤¬Âбþ¤Å¤±¤é¤ì¤¿ÂÐÌõ¥³¡¼¥Ñ¥¹¤ÏÂçÎ̤˺îÀ®¤Ç¤­¤ë¤è¤¦¤Ë¤Ê¤ë¤¬, ¥Ç¡¼¥¿¤¬ÂçÎ̤Ǥ¢¤ì¤Ð¤¢¤ë¤Û¤É, ¿Í´Ö¤¬, ¤½¤Î¤Þ¤Þ¥Ç¡¼¥¿¤ò»ÈÍѤ¹¤ë¤³¤È¤Ïº¤Æñ¤Ç¤¢¤ë. ¼«Á³¸À¸ì½èÍý¤ò¹âÀºÅ٤˹Ԥ¦¤¿¤á¤Ë¤Ï, ÂоÝʸ¤òʬÀϤ·, ʬÌîÆÃÍ­¤Îɽ¸½¤ò¥ë¡¼¥ë¤ä¥Ç¡¼¥¿¤È¤·¤ÆÃßÀѤ·¤Æ¤¤¤¯ºî¶È¤¬É¬ÍפȤʤë. µ¡³£ËÝÌõ¤Ç¤Ï, ¸Ä¿Í¤¬WWW¤òÆÉ¤à¤Ê¤É¤ÎÌÜŪ¤Ç¥·¥¹¥Æ¥à¤ò»ÈÍѤ¹¤ë¾ì¹ç, Ìõʸ¤Î°ÕÌ£¤¬Àµ¤·¤±¤ì¤Ð, ¼ÂÍѾåÂ礭¤ÊÌäÂê¤Ï¤Ê¤¤. ¤·¤«¤·¤Ê¤¬¤é, ËÝÌõ¤ò¶È̳¤È¤·¤Æ¹Ô¤Ã¤Æ¤¤¤ëÉô½ð¤Ë¤ª¤¤¤Æ¤Ï, ʸ½ñ¤âÀìÌçŪ¤ÊÆâÍÆ¤Ç, Ìõ¤¹¤Ù¤­¥¹¥¿¥¤¥ë¤ä¤¤¤¤²ó¤·¤¬Äê¤á¤é¤ì¤Æ¤¤¤ë¤³¤È¤¬Â¿¤¯, ñ¤ËËÝÌõ¤¬Àµ¤·¤¤¤À¤±¤Ç¤Ê¤¯, ¤½¤ÎÌõʸ¤¬¥¹¥¿¥¤¥ë¤Ë±è¤Ã¤Æ¤¤¤ë¤«, ½½Ê¬¤Ë¤³¤Ê¤ì¤¿Ê¸¾Ï¤Ç¤¢¤ë¤«¤É¤¦¤«¤¬ÌäÂê¤È¤µ¤ì¤ë. ¤¿¤È¤¨µ¡³£ËÝÌõ¤¬Àµ¤·¤¤Ìõ¤ò½Ð¤·¤¿¤È¤·¤Æ¤â, ¤½¤Îɽ¸½¤¬Ä̾ï¤Î¥¹¥¿¥¤¥ë¤ÈÂ礭¤¯¤«¤±Î¥¤ì¤Æ¤¤¤ì¤Ð, ·ë¶É¤½¤ÎÌõʸ¤Ï»ÈÍѤµ¤ì¤Ê¤¤·ë²Ì¤È¤Ê¤ë. ¤³¤Î¤è¤¦¤Ê»öÂÖ¤òÈò¤±¤ë¤¿¤á, ¤½¤ÎʬÌî¤Ë¤ª¤±¤ëÄ귿Ū¤Êɽ¸½¥Ñ¥¿¥ó¤òÁ°¤â¤Ã¤ÆÊ¬ÀϤ·, Âн褹¤ëɬÍפ¬¤¢¤ë. ¤·¤«¤·, ÂçÎ̤Υǡ¼¥¿¤«¤é¤³¤Î¤è¤¦¤ÊÄ귿Ū¥Ñ¥¿¥ó¤ò¿Í¼ê¤Ë¤è¤Ã¤ÆÃê½Ð¤¹¤ë¤Ë¤Ï ¿¤¯¤Î¥³¥¹¥È¤¬¤«¤«¤ë¤¿¤á, µ¡³£Åª¤ËÃê½Ð¤¹¤ë¼êÃʤȤ·¤Æ°Ê²¼¤Î¤è¤¦¤Ê¥Ä¡¼¥ë¤òºîÀ®¤·¤¿.




3.2 n-gramÅý·×½èÍý¤Ë¤è¤ëÄ귿Ūɽ¸½¤ÎÃê½Ð

¸À¸ì¥Ç¡¼¥¿¤ÎÃæ¤«¤é, »ÈÍÑÉÑÅ٤ι⤤¥Ñ¥¿¥ó¤òÃê½Ð¤¹¤ë¼êË¡¤È¤·¤Æ, ¥Æ¥­¥¹¥È¤ËÂФ·¤Æn-gramÅý·×½èÍý¤ò¹Ô¤¤, ¥Æ¥­¥¹¥È¥Ç¡¼¥¿Æâ¤Îʸ»úÎó¤ò¤½¤Îʸ»úĹ¤Î½ç¤ª¤è¤Ó½Ð¸½½çÅ٤νç¤ËÃê½Ð¤¹¤ë ¼êË¡¤¬¤¢¤ë(15). ¤·¤«¤·¤Ê¤¬¤é, Ãê½Ð¤¹¤ëʸ»úÎó´Ö¤ÎÁê¸ß´Ø·¸¤¬Ìµ»ë¤µ¤ì¤Æ¤¤¤ë¤¿¤á, ÃÇÊÒŪ¤Êʸ»úÎ󤬤«¤Ê¤ê¤Î³ä¹ç¤Çº®ºß¤¹¤ë¤È¤¤¤¦ÌäÂ꤬¤¢¤Ã¤¿. Ëܥġ¼¥ë¤Ç¤Ï¤³¤ÎÌäÂê¤ò²ò·è¤¹¤ë¤¿¤á, Ãê½Ð¤¹¤ëʸ»úÎó¤Î°ÌÃÖ¾ðÊó¤òµ­Ï¿¤·, Áê¸ß¤Ë½ÅÊ£¤¹¤ëʸ»úÎó¤ò½üµî¤·¤Æ¤¤¤ë. ¾ÜºÙ¤Ë¤Ä¤¤¤Æ¤Ïʸ¸¥(16)¤ò»²¾È¤µ¤ì¤¿¤¤. ½Åʣʸ»úÎó¤òºï½ü¤¹¤ë¤³¤È¤Ë¤è¤ê, ÃÇÊÒŪ¤Êʸ»úÎó¤ÎÃê½Ð¤¬ÍÞÀ©¤µ¤ì, Æü·Ð»º¶È¿·Ê¹3¥«·îʬ(892Ëü»ú)¤òÂоݤ˹Ԥ俼¸³¤Ç¤Ï, ½¾ÍèË¡¤ËÈæ³Ó¤·¤ÆÃê½Ð¤µ¤ì¤ëʸ»úÎó¤Î¼ïÎà¤Ï20¡Á1%¤Ë, ¤½¤ì¤é¤Î½Ò¤Ù½Ð¸½²ó¿ô¤Ï14¡Á1%ÄøÅ٤˺︺¤µ¤ì¤¿. Ï¢º¿¶¦µ¯É½¸½(Ϣ³¤·¤¿Ê¸»úÎó)¤ÎËܼêË¡¤È½¾ÍèË¡¤Ç¤ÎÃê½ÐÎã¤ò¿Þ2¤Ë¼¨¤¹.

¿Þ2 Ï¢º¿¶¦µ¯É½¸½¤ÎÃê½ÐÎã

¤Þ¤¿, ÃÇÊÒŪ¤Êʸ»úÎó¤ÎÃê½Ð¤òÍÞÀ©¤·, °ÌÃÖ¾ðÊó¤òµ­Ï¿¤·¤Æ¤¤¤ë¤³¤È¤Ë¤è¤ê, ¡Ö¡Á¤È¤·¤Ê¤¬¤é¤â¡Á¤È½Ò¤Ù¤¿¡×¤È¤¤¤Ã¤¿Î¥»¶¶¦µ¯É½¸½ (Î¥¤ì¤¿°ÌÃ֤˶¦µ¯¤¹¤ëÊ£¿ô¤Îʸ»úÎó)¤òÃê½Ð¤¹¤ë¤³¤È¤âƱ»þ¤Ë²Äǽ¤È¤Ê¤Ã¤¿. ÃåÌܤ¹¤ëÊ£¿ô¤Îɽ¸½¤¬1Ê¸Ãæ¤ËÎ¥¤ì¤Æ½Ð¸½¤¹¤ëÁȤΤ¹¤Ù¤Æ¤ò¥Õ¥¡¥¤¥ë¤ËÅǤ­½Ð¤·, ¥½¡¼¥È¤·¤¿¸å, Ʊ°ì¤ÎÁȤÎÉÑÅÙ¤òµá¤á¤ë¤â¤Î¤Ç¤¢¤ë. ¤³¤ÎÊýË¡¤Ç¤Ï, Ʊ°ìÊ¸Ãæ¤ËƱ°ì¤Îɽ¸½¤¬Ê£¿ô²ó½Ð¸½¤¹¤ë¾ì¹ç, ʸ·¿¤È¤ß¤Ê¤»¤Ê¤¤É½¸½¤ÎÁȤ¬Â¿¿ôÃê½Ð¤µ¤ì¤ë¤È¤¤¤¦ÌäÂ꤬¤¢¤Ã¤¿¤¬, °Ê²¼¤Î¤è¤¦¤Ê¾ò·ï¤òÀßÄꤷ, [¾ò·ï1]¤Ë½¾¤¦Ãê½ÐË¡(¼åÍÞÀ©·¿Î¥»¶¶¦µ¯É½¸½Ãê½ÐË¡), ¤È[¾ò·ï1, 2]¤ò¤È¤â¤ËËþ­¤¹¤ëÃê½ÐË¡(¶¯ÍÞÀ©·¿Î¥»¶¶¦µ¯É½¸½Ãê½ÐË¡)¤ò¹Ô¤¦¤³¤È¤Ë¤è¤Ã¤Æ ²ò·è¤·¤Æ¤¤¤ë(17).

[¾ò·ï1]
  ¶¦µ¯É½¸½¤ò¹½À®¤¹¤ëÍ×ÁǤϤª¸ß¤¤¤Ë¤¹¤Ù¤Æ°Û¤Ê¤ë.
[¾ò·ï2]
¸¶Ê¸Ãæ, ¶¦µ¯É½¸½¤ò¹½À®¤¹¤ëÀèÆ¬¤ÈËöÈø¤ÎÍ×ÁÇ´Ö¤Ë, ÃåÌÜÍ×ÁǤ¬2²ó°Ê¾å¸½¤ì¤Ê¤¤.

¤Þ¤¿, Ï¢º¿¶¦µ¯É½¸½¤ÎÃê½Ð¤Ë¤¢¤¿¤Ã¤Æ¤âÍÞÀ©¤Î¶¯¤µ¤ò²Ã¸º¤¹¤ë¤³¤È¤Ë¤è¤Ã¤Æ, ¤µ¤é¤ËÎ¥»¶¶¦µ¯É½¸½¤ÎÃê½Ð¤Ë¤ª¤¤¤Æ¸ú²ÌŪ¤Ç¤¢¤ë¤³¤È¤¬ ʬ¤«¤Ã¤¿(18)¤¿¤á, ÍÞÀ©¤ò¥Ñ¥é¥á¡¼¥¿¤È¤·¤ÆÍ¿¤¨¤ë¤³¤È¤¬¤Ç¤­¤ë¤è¤¦¤Ë¥Ä¡¼¥ë²½¤· ³Æ¼ï¤ÎÌÜŪ¤Ë¤¢¤Ã¤¿Ãê½Ð¤ò¹Ô¤¨¤ë¤è¤¦¤Ë¤·¤¿.




3.3 Ëܥġ¼¥ë¤Î±þÍÑ

Á°½Ò¤Î¼êË¡¤ò´ð¤Ë¤·¤Æ, ÆüËܷкѿ·Ê¹¼Ò¤Î¥ª¥ó¥é¥¤¥ó¥µ¡¼¥Ó¥¹¤ÇÄ󶡤µ¤ì¤Æ¤¤¤ë »Ô¶·Â®Êóµ­»ö¤ËÂФ·¤Æ, Äê·¿¥Ñ¥¿¥ó¤òÃê½Ð¤¹¤ëÊý¼°¤Ë¤Ä¤¤¤Æ¤Î¸¡Æ¤¤ò¹Ô¤Ã¤¿. ¤³¤Î»Ô¶·Â®Êó¥Ç¡¼¥¿¤Ë¤Ï»Ô¶·¾ðÊóÆÃÍ­¤Î¤¤¤¤²ó¤·¤ä, ·«¤êÊÖ¤·¸½¤ì¤ëÄ귿Ū¥Ñ¥¿¥ó¤¬ÉѽФ¹¤ë¤¿¤á, Ä귿Ū¥Ñ¥¿¥ó¤òÍøÍѤ·¤Æ¤Îµ¡³£ËÝÌõ¤ò¹Ô¤¦¤³¤È¤¬Å¬¤·¤Æ¤¤¤ë¤È¹Í¤¨¤é¤ì¤ë.

n-gramÅý·×½èÍý¤òÂоݵ­»ö¥Ç¡¼¥¿¤ËÂФ·¤Æ¤½¤Î¤Þ¤ÞŬÍѤ·¤¿¤È¤³¤í, °Ê²¼¤Î¤è¤¦¤ÊÌäÂêÅÀ¤¬¤¢¤ë¤³¤È¤¬Ê¬¤«¤Ã¤¿.

   ¡¦ ¿ô»ú¤Î¾ì¹ç, 1¤Ä¤Î¤Þ¤È¤Þ¤Ã¤¿¿ôÃͤ¬¤½¤ÎÉôʬʸ»úÎó¤ÇÃê½Ð¤µ¤ì¤ë.
Î㤨¤Ð, ¡È100¡É¤È¡È300¡É¤Î¤è¤¦¤Ê¿ô»ú¤Ç¤Ï¡È00¡É¤ÎÉôʬʸ»úÎó¤¬Ãê½Ð¤µ¤ì, ¤Þ¤¿, ¿ô¤¬°ã¤¦¤À¤±¤Ç¤Û¤ÜƱ¤¸·Á¼°¤ò»ý¤Ã¤¿É½¸½¤¬, Ê̤Îɽ¸½¤È¤·¤ÆÊ¬²ò¤µ¤ì¤Æ½¸·×¤µ¤ì¤ë¤¿¤áÉÑÅÙ¤¬¹â¤¯¤Ê¤é¤Ê¤¤.
¡¦ ¥Ç¡¼¥¿Æâ¤Î¸Çͭ̾»ì¤¬Ãê½Ð¤µ¤ì¤ë.
¤½¤Î¸Çͭ̾»ì¤¬Æ±Â°À­¤Ç¤¢¤Ã¤Æ¤â, ʸ»ú¤Î°ã¤¤¤Ë¤è¤êÊÌ¡¹¤ËÃê½Ð¤µ¤ì¤ë¤¿¤á, ÄãÉÑÅ٤榵¯É½¸½¤¬½Ð¤Æ¤¯¤ë.

¤³¤¦¤¤¤Ã¤¿É½µ­¥ì¥Ù¥ë¤Ç½èÍý¤ò¹Ô¤Ã¤Æ¤¤¤ë¤¿¤áȯÀ¸¤¹¤ëÌäÂê¤ò²ò·è¤¹¤ë¤¿¤á, ¤¢¤é¤«¤¸¤áÌäÂê¤È¤Ê¤ë¥Ç¡¼¥¿¤òÊ̤Îʸ»úÎó¤ÇÃÖ¤­´¹¤¨¤ë¤³¤È¤Ë¤è¤ê, Ãê½Ð¸ú²Ì¤ò¾å¤²¤ë¼êË¡(19)¤ò¹Í°Æ¤·, ¥Ä¡¼¥ë¥Ø¤ÎÁȹþ¤ß¤ò¹Ô¤Ã¤Æ¤¤¤ë. ¤Þ¤¿, ÃÖ¤­´¹¤¨¤ë¤Ù¤­¥Ç¡¼¥¿¤ò¼«Æ°Åª¤Ë¥°¥ë¡¼¥Ô¥ó¥°¤¹¤ë ¼êË¡(20)¤Ë¤Ä¤¤¤Æ¤â¸¡Æ¤¤ò¿Ê¤á¤Æ¤¤¤ë.

¤³¤ì¤é¤ÎÃÖ¤­´¹¤¨¤ò¹Ô¤¦¤³¤È¤Ë¤è¤Ã¤Æ, ¼ý½¸ÂоݤȤ¹¤ëÎ¥»¶¶¦µ¯É½¸½¤Î¼ý½¸ÈϰϤò³ÈÂ礹¤ë¤³¤È¤¬¤Ç¤­¤ë¤¿¤á, 1ʸ¤È¤¤¤¦ÈϰϤòͤ¨µ­»öÁ´ÂΤò¥Ñ¥¿¥ó²½¤¹¤ë»î¤ß¤â ²Äǽ¤È¤Ê¤Ã¤Æ¤¤¤ë(21).




4 ËÝÌõ¼­½ñ¤ÎºîÀ®»Ù±ç

n-gramÅý·×½èÍý¤Ë¤è¤Ã¤ÆÆÀ¤é¤ì¤¿Ï¢º¿¶¦µ¯É½¸½, ¤ª¤è¤Ó, Î¥»¶¶¦µ¯É½¸½¤ÏÍøÍѼԼ­½ñ¤ò¹½Ãۤˤª¤¤¤Æ, ¤½¤ì¤¾¤ì, ̾»ì¼­½ñ, ·ë¹ç²Á¥Ñ¥¿¥óÂм­½ñ¤òÅÐÏ¿¤¹¤ëºÝ¤Î´ðÁåǡ¼¥¿¤È¤·¤Æ»ÈÍѤ¹¤ë. Ãê½Ð¤µ¤ì¤¿Ã±¸ì¤ËÂФ·¤Æ, °Ê²¼¤Î¤è¤¦¤Ê¼êË¡¤Ë¤è¤Ã¤ÆÅÐÏ¿»Ù±ç¤ò¹Ô¤¦¤³¤È¤Ç, ¸úΨŪ¤Ë¼­½ñ¤ò¹½ÃÛ¤¹¤ë¤³¤È¤¬²Äǽ¤È¤Ê¤ë.




4.1 ̾»ì¤Î°Ọ̃°À­¤Î¼«Æ°¿äÄê




4.1.1 °Ọ̃°À­¿äÄê½èÍý¤ÎɬÍ×À­¤ÈÊýË¡

µ¡³£ËÝÂô¥·¥¹¥Æ¥à¤òÍøÍѤ·¤Æ¸½¼Â¤Îʸ½ñ¤òËÝÂô¤¹¤ë¾ì¹ç, ËÝÌõÂоݤ˹ç¤Ã¤¿ÍøÍѼԼ­½ñ¤¬É¬ÍפȤʤë. ÆÃ¤Ë, ¹âÉʼÁ¤ò¤Í¤é¤Ã¤¿µ¡³£ËÝÌõ¥·¥¹¥Æ¥à¤Ç¤Ï, ³ÆÃ±¸ì¤ËÂФ·¤Æ2,000¼ï°Ê¾å¤Îʬ²òÀºÅÙ¤ò»ý¤Äñ¸ì°Ọ̃°À­¤ÎÉÕÍ¿¤¬É¬ÍפǤ¢¤ë¤³¤È¤¬ ÃΤé¤ì¤Æ¤¤¤ë(22). °ìÈ̤ÎÍøÍѼԤ¬¤³¤Î¤è¤¦¤ÊÀºÌ©¤Ê¾ðÊó¤òÉÕÍ¿¤¹¤ë¤Î¤Ïº¤Æñ¤Ç¤¢¤ë. ¤½¤³¤Ç, ÍøÍѼԤ¬ÅÐÏ¿¤·¤¿¤¤ÆüËܸì̾»ì(Ê£¹ç¸ì¤ò´Þ¤à)¤È±Ñ¸ìÌõ¸ì¤òÍ¿¤¨¤ë¤À¤±¤Ç, ¥·¥¹¥Æ¥à¼­½ñ¤ÎÃ챤òÍøÍѤ·¤Æ, ̾»ì¼ïÊ̤ò¼«Æ°Åª¤ËȽÄꤷ, ¤½¤ì¤Ë±þ¤¸¤¿Ã±¸ì°Ọ̃°À­¤òÉÕÍ¿¤¹¤ëÊýË¡¤òÄ󰯤·, ½èÍý·Ï¤ò¼Â¸½¤·¤¿(23)(24).

ÍøÍѼÔÅÐÏ¿¸ì¤Îñ¸ì°Ọ̃°À­¤ò¿äÄꤹ¤ë¼ê½ç¤Ï, ¼ç̾»ì¤ÎȽÄê, ̾»ì¼ïÊÌ(¸Çͭ̾»ì, °ìÈÌ̾»ì¤ÎÊÌ)¤ÎȽÄê, ¸Çͭ̾»ì°Ọ̃°À­¤Î¿äÄê(¸Çͭ̾»ì¤Î¾ì¹ç), °ìÈÌ̾»ì°Ọ̃°À­¤Î¿äÄê(°ìÈÌ̾»ì, ¸Çͭ̾»ì¤ÎÁÐÊý)¤Î¼ê½ç¤«¤é¤Ê¤ë.

°ìÈÌ̾»ì°Ọ̃°À­¤Î¿äÄê¤Ï°Ê²¼¤Î¤è¤¦¤Ë¹Ô¤¦. Æü±ÑÂоݼ­½ñ¤ÎÌõ¸ì¤ÎÃæ¤Ë, ÍøÍѼÔÅÐÏ¿¸ì¤ÎÌõ¸ì¤È°ìÃפ¹¤ë¸ì¤¬¤¢¤ë¾ì¹ç, ¤½¤ÎÌõ¸ì¤ËÂбþ¤¹¤ë¸«½Ð¤·¸ì¤ÏÍøÍѼÔÅÐÏ¿¸ì¤Î¸«½Ð¤·¸ì¤ÈƱµÁ¸ì¤Ç¤¢¤ë¤È¹Í¤¨¤é¤ì¤ë¤Î¤Ç, ¤½¤Î°Ọ̃°À­¤ò¤½¤Î¤Þ¤ÞÍøÍѼÔÅÐÏ¿¸ì¤Î°Ọ̃°À­¤È¤¹¤ë. ÍøÍѼÔÅÐÏ¿¸ì¤ÎÌõ¸ì¤È°ìÃפ¹¤ëÌõ¸ì¤¬Æü±ÑÂоݼ­½ñ¤ÎÃæ¤Ë¤Ê¤¤¾ì¹ç¤Ï, ºÆÅÙ, Æü±ÑÂоݼ­½ñ¤ò¸¡º÷¤¹¤ë. ¤½¤ÎÃæ¤Ë, ÍøÍѼÔÅÐÏ¿¸ì¤Î¼ç̾»ì¤â¤·¤¯¤Ï¼ç̾»ì¤ò´Þ¤àÌõ¸ìÉôʬ¤¬¤¢¤ì¤Ð, ¤½¤Î°Ọ̃°À­¤òÍøÍѼÔÅÐÏ¿¸ì¤Î°Ọ̃°À­¤È¤¹¤ë. ¤¿¤À¤·, ¸ì·Á¤¬°Û¤Ê¤ë¾ì¹ç¤¬¤¢¤ë¤Î¤Ç, ¼ç̾»ì¤Ï²Äǽ¤Ê¸ì·ÁÊѲ½(ñ¿ôÊ£¿ô¤Ê¤É)¤ò¹Íθ¤·¤Ê¤¬¤é¾È¹ç¤¹¤ë.

Î㤨¤Ð, ɽ1¤Ë¤ª¤¤¤Æ, ÍøÍѼÔÅÐÏ¿¸ì¤Î¡Ö¼êÅö¤Æ¡×, ¡Ö°åΚפÏ, ¤½¤ÎÌõ¸ì(¤Þ¤¿¤Ï¼ç̾»ìÌõ¸ì)¡Ètreatment¡É¤¬¥·¥¹¥Æ¥à¼­½ñ¤Ë¤¢¤ë¤Î¤Ç, °Ọ̃°À­¤Ï¡Ô¼£ÎšդȿäÄꤵ¤ì¤ë.

ɽ1 °Ọ̃°À­¿äÄê¤ÎÎã
ÆüËܸ츫½Ð¤·±Ñ¸ìÌõñ¸ì°Ọ̃°À­
ÍøÍѼԼ­½ñ
¼£ÎÅcure¡Ô¼£ÎÅ¡Õ*
Êü¼ÍÀþ¼£ÎÅradiotherapy¡Ô¼£ÎÅ¡Õ*
¼êÅö¤Ætreatment¡Ô¼£ÎÅ¡Õ*
°åÎÅmedical treatment¡Ô¼£ÎÅ¡Õ*
¥·¥¹¥Æ¥à¼­½ñ(»²¾È¤·¤¿¹àÌÜ)
¼£ÎÅtreatment¡Ô¼£ÎÅ¡Õ
* ¿äÄꤵ¤ì¤¿Ã±¸ì°Ọ̃°À­.




4.1.2 °Ọ̃°À­¿äÄê¤Î¸ú²Ì

¿·Ê¹µ­»ö102ʸ¤È¥½¥Õ¥È¥¦¥§¥¢Àß·×½ñ105ʸ¤ÎËÝÌõ¤ËɬÍפÊÍøÍѼԼ­½ñ¤ò, ËÜÊý¼°¤òÍѤ¤¤Æ°Ọ̃°À­¤ò¿äÄꤹ¤ë¤³¤È¤Ë¤è¤êºîÀ®¤·¤¿. ¤½¤ÎÍøÍѼԼ­½ñ¤ò»ÈÍѤ·¤¿ËÝÌõ¼Â¸³¤Ç¤Ï, °Ọ̃°À­¤òÉÕÍ¿¤·¤Ê¤«¤Ã¤¿¾ì¹ç¤ËÈæ¤Ù¤ë¤ÈÌõʸ¹ç³ÊΨ¤Ï6¡Á13%¸þ¾å¤·, °ìÈÌÍøÍѼԤ¬ÉÕÍ¿¤·¤¿¾ì¹ç(¿Í¼ê¤Ë¤è¤ëÉÕÍ¿)¤ÈƱÅù¤ÎÉʼÁ¤¬ÆÀ¤é¤ì¤ë¤³¤È¤¬Ê¬¤«¤Ã¤¿. ¤³¤ÎÉʼÁ¤Ï, ¥¢¥Ê¥ê¥¹¥È¤¬¿ô²ó¼Â¸³¤ò¹Ô¤Ã¤ÆºÇŬ¤Ê°Ọ̃°À­¤òÉÕÍ¿¤·¤¿ ÍøÍѼԼ­½ñ¤ò»ÈÍѤ¹¤ë¾ì¹ç¤ËÈæ¤Ù¤Æ¤â2¡Á3%¤·¤«Äã²¼¤·¤Ê¤¤ÃͤǤ¢¤ê, ºÇŬ°Ọ̃°À­¤ò·èÄꤹ¤ë¤¿¤á¤ËÍפ¹¤ë¥³¥¹¥È¤ò¹Í¤¨¤ë¤È, ½½Ê¬¤ËËþ­¤Ç¤­¤ëÃͤǤ¢¤ë(ɽ2).

ɽ2 ÌõʬÉʼÁ¤ÎÈæ³Óɾ²Á
°Ọ̃°À­ÉÕÍ¿Êý¼°Ìõʸ¹ç³ÊΨ
¿·Ê¹µ­»öÀß·×½ñ
ÉÕÍ¿¤Ê¤·56.7%65.7%
¼«Æ°¿äÄêÊý¼°69.6%71.4%
¿Í¼ê¤Ë¤è¤ëÉÕÍ¿71.5%71.4%
ºÇŬ°Ọ̃°À­72.5%73.3%




4.2 ·ë¹ç²Á¥Ñ¥¿¥óÂФÎȾ¼«Æ°ºîÀ®




4.2.1 ÇØ·Ê

µ¡³£ËÝÌõ¤ÎÉʼÁ¸þ¾å¤Ë¤Ï, ư»ì¤È̾»ì¤Î°Ọ̃Ū¶¦µ¯¤ò·ë¹ç²Á¥Ñ¥¿¥ó¤È¤·¤Æµ­½Ò¤·, ¤½¤ì¤ò¸¶¸À¸ì¤ÈÌÜŪ¸À¸ì¤ÇÂФˤ·¤¿·ë¹ç²Á¥Ñ¥¿¥óÂФλÈÍѤ¬ Í­¸ú¤Ç¤¢¤ë(22). ½¾Íè¤Ï, ¿Í¼ê¤Ë¤è¤ê1·ï¤º¤Ä¥Ñ¥¿¥óÂФòµ­½Ò¤¹¤ë¤³¤È¤Ë¤è¤ê, Ìó13,000·ï¤Îµ¬ÌϤι½Ê¸¼­½ñ¤ò¹½ÃÛ¤·¤Æ¤­¤¿. ɾ²Á¼Â¸³¤Ë¤è¤ê, Æü±ÑËÝÌõ¤ËɬÍפʥѥ¿¥óÂФÏ25,000·ïÄøÅ٤Ǥ¢¤ë¤³¤È, ÆÃ¤Ëϸì·Ï¤Îư»ì¡¦·ÁÍÆ»ì¤Î¥Ñ¥¿¥óÂФ¬ÉÔ­¤·¤Æ¤¤¤ë¤³¤È¤Ê¤É¤¬ ʬ¤«¤Ã¤¿(25). 10,000·ï°Ê¾å¤Î¥Ñ¥¿¥óÂФò¼ý½¸¤¹¤ë¤¦¤¨¤Ç, ¥Ñ¥¿¥óÂкîÀ®¤Î¸úΨ²½¤¬ÉԲķç¤Ç¤¢¤ë.




4.2.2 ºîÀ®»Ù±ç¤ÎÊýË¡¤È¸ú²Ì

·ë¹ç²Á¥Ñ¥¿¥óÂФˤÏ¿ÍѤµ¤ì¤ë¹ü³Ê¹½Â¤¤¬¤¢¤ê, »ÈÍÑÉÑÅ٤ξå°Ì9°Ì¤Þ¤Ç¤Ç70%, ¾å°Ì17°Ì¤Þ¤Ç¤Ç80%¤Î¥Ñ¥¿¥óÂФ¬µ­½Ò¤Ç¤­¤ë¤³¤È¤¬ ÃΤé¤ì¤Æ¤¤¤ë(26). ¤³¤ÎÅý·×ŪÀ­¼Á¤Ë´ð¤Å¤¤¤Æ, Æü±ÑËÝÂô¥·¥¹¥Æ¥à¤ÎÆüËÜʸ²òÀϤÈ, ¥Ñ¥¿¥óÂФιü³Ê¹½Â¤¤Î¾å°Ì¤«¤é½ç¤Ë¾È¹ç¤ò¹Ô¤¦´Êñ¤Ê±Ñʸ²òÀϤòÁȤ߹ç¤ï¤»¤ë¤³¤È¤Ë¤è¤ê ¥Ñ¥¿¥óÂкîÀ®»Ù±ç½èÍý¤ò¼Â¸½¤·¤¿(27).

Î㤨¤Ð, ¡ÖÈà½÷¤Ï¤µ¤Ã¤ÈÌîºÚ¤Î¿å¤òÀڤä¿. ¡×¤È ¡ÈShe shook the water off the vegetables.¡É¤ÎÂÐÌõÍÑÎãʸ¤ËÂФ·¤Æ¤Ï¼¡¤Î¤è¤¦¤Ëưºî¤¹¤ë.

   ­¡ ÆüËܸì²òÀϤˤè¤ê¡ÖÈà½÷+¤¬/¤µ¤Ã¤È/ÌîºÚ+¤Î/¿å+¤ò/ÀÚ¤ë+¤¿¡× ¤È¤¤¤¦·ÁÂÖÁÇ¡¦¹½Ê¸¾ðÊó¤òÆÀ¤ë.
­¢ ±Ñ¸ì²òÀϤȤ·¤Æ¤ÏÉÊ»ì¤òÉÕÍ¿¤·¤¿¸å, ¹ü³Ê¹½Â¤¤È¤Î¾È¹ç¤Ë¤è¤ê¡ÈX shake Y off Z¡É¤È¤¤¤¦¥Ñ¥¿¥ó¸õÊä¤òÆÀ¤ë.
­£ ÂÐÌõ¼­½ñ¤ò»²¾È¤·¤ÆÎ¾¼Ô¤ò¾È¹ç¤·, X ¡á¡Èshe¡É¡á¡ÖÈà½÷¡×, Y ¡á¡Èthe water¡É¡á¡Ö¿å¡×, Z ¡á¡Èthe vegetable¡É¡á¡ÖÌîºÚ¡×¤ÎÂбþ¤òÃΤë.
­¤ °Ê¾å¤«¤é, ¡ÖN1¤¬/N2¤Î/N3¤ò/ÀÚ¤ë¡× ¡ÈN1 shake N3 off N2¡É¤È¤¤¤¦·ë¹ç²Á¥Ñ¥¿¥óÂиõÊ䤬ÆÀ¤é¤ì¤ë.

¤³¤Î¸å, ¥¢¥Ê¥ê¥¹¥È¤¬, µ­½ÒÍ×ÁǤβáÉÔ­¤ò¥Á¥§¥Ã¥¯¤·¤¿¤ê, N1¤Î¾ò·ï¤ò¡ÖÈà½÷¡×¤ÎÂå¤ê¤Ë°Ọ̃°À­¤Î¡ã¿Í¡ä, N2¤ò¡ÖÌîºÚ¡×¤ÎÂå¤ê¤Ë°Ọ̃°À­¤Î¡ã¶ñÂÎʪ¡ä¤Ê¤É¤È ¾ò·ï»ØÄê¤òÈÆÍѲ½¤·¤¿¤ê¤¹¤ë¤³¤È¤Ë¤è¤ê¥Ñ¥¿¥óÂФ¬´°À®¤¹¤ë.

¤³¤Î»Ù±ç½èÍý¤Ë¤è¤ê, ¿Í¼êºî¶È¤Î¾ì¹ç¤ËÈæ¤Ù¤Æ1·ïÅö¤ê¤ÎºîÀ®»þ´Ö¤¬ Ìó1/6¤Ëû½Ì¤µ¤ì¤¿¤Û¤«(27), »Ô¶·Â®Êóʸ¤ÎÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¤è¤¦¤ÊÀìÌçʬÌî¤ËÂФ·¤Æ¤â, ¤½¤ÎʬÌî¤Ë¤¢¤Þ¤ê½¬½Ï¤·¤Æ¤¤¤Ê¤¤ºî¶È¼Ô¤¬, °ìÈÌŪ¤Ê¥Ñ¥¿¥óÂФκîÀ®»þ´Ö¤ÈÆ±ÄøÅ٤λþ´Ö¤Ç¥Ñ¥¿¥óÂФòºîÀ®¤¹¤ë¤³¤È¤¬¤Ç¤­, »Ù±ç½èÍý¤ÎÍ­¸úÀ­¤¬³Îǧ¤µ¤ì¤¿(28).




5 ¤¢¤È¤¬¤­

ËÜÏÀʸ¤Ç¤Ï, µ¡³£ËÝÌõ¥·¥¹¥Æ¥à¤Î¼­½ñ¤ä¥ë¡¼¥ë¤ò²þÎɤ¹¤ëºÝ¤ËɬÍפȤʤë Â絬ÌϤÊÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛÊýË¡¤È, n-gramÅý·×¤ò±þÍѤ·¤¿¥³¡¼¥Ñ¥¹¤Î¸úΨŪ¤ÊʬÀÏÊýË¡¤ò¼¨¤·¤¿. ¤Þ¤¿, ÆÀ¤é¤ì¤¿ÂÐÌõ¥Ç¡¼¥¿¤«¤é¼­½ñ¤òºîÀ®¤¹¤ë¤¿¤á¤Î»Ù±ç½èÍý¤ò¾Ò²ð¤·¤¿.

¤³¤ì¤é°ìÏ¢¤Î½èÍý¤òÂçÎ̤Υƥ­¥¹¥È¥Ç¡¼¥¿¤ËŬÍѤ¹¤ë¤³¤È¤Ë¤è¤ê, µ¡³£ËÝÌõ¥·¥¹¥Æ¥à¤Î´ðËÜŪ¤Ê¼­½ñ¤ä¥ë¡¼¥ë¤ò¸úΨ¤è¤¯¼ý½¸¤¹¤ë¤³¤È¤¬²Äǽ¤È¤Ê¤ë. ¤Þ¤¿, µ¡³£ËÝÌõ¥·¥¹¥Æ¥à¤ò¼ÂÍѲ½¤·¤Æ¤¤¤¯¤¦¤¨¤Ç¤Ï, ÂоݤȤʤëʬÌî¤ò¸ÂÄꤷ, ¤½¤ì¤¾¤ì¤ÎʬÌî¤Ë¸«¹ç¤Ã¤¿ÀìÌçŪ¼­½ñ, ¥ë¡¼¥ë¤ò¹½ÃÛ¤¹¤ë¤³¤È¤¬É¬ÍפȤʤ뤬, ¤³¤ì¤é¤Î»Ù±ç¥·¥¹¥Æ¥à¤Ë¤è¤ê, ÂоÝʬÌî¤òÀÚ¤êÂØ¤¨¤ëºÝ¤Î¥³¥¹¥È¤âºï¸º¤¹¤ë¤³¤È¤¬¤Ç¤­¤ë.

¸½ºß, ¤¤¤º¤ì¤Î½èÍý¤ËÂФ·¤Æ¤â, ½èÍýÀºÅÙ¤äÁàºîÀ­¤Î¸þ¾å¤òÌܻؤ·¤Æ²þÎɤò¿Ê¤á¤Æ¤¤¤ë.




ʸ¸¥

(1)
I. Dagan and K. Church: ¡ÈTermight identifying and translating technical terminology¡É, In Proc. of 4th Conference on Applied Natural Language Processing (ANLP), pp.34-40, 1994.

(2)
F. Smadja and K. McKeown: ¡ÈTranslating collocations for use in bilingual lexicons¡É, ARPA Human Language Technology Workshop 94, pp. 152-156, 1994.

(3)
P. Fung and K. Church: K-vec: ¡ÈA new approach for aligning parallel texts. In Proc. of 15th COLING¡É, pp.1096-1102, 1994.

(4)
M. Kay and M. Roscheisen: ¡ÈText-translation alignment¡É, Computational Linguistics, 19, No.1, pp.121-142, 1993.

(5)
J. C. Lai, P. F. Brown and R. L. Mercer: ¡ÈAligning sentences in parallel corpora¡É, In Proc. of 29th Annual Meeting of ACL, pp.169-176, 1991.

(6)
±§ÄÅϤ¡¦¾¾ËÜ: ¡ÈÂÐÌõ¼­½ñ¤ª¤è¤ÓÅý·×¾ðÊó¤òÍѤ¤¤¿Æó¸À¸ìÂÐÌõ¥Æ¥­¥¹¥È¾È¹ç, ¥³¥ó¥Ô¥å¡¼¥¿¥½¥Õ¥È¥¦¥§¥¢¡É, 12, No.5, pp.12-21, 1995.

(7)
Çò°æ¡¦Æ£ÇÈ¡¦ÃÓ¸¶¡¦¾åÅÄ¡¦°æ¾å: ¡È¿·Ê¹µ­»öÆü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ(1) -´ðËܹ½Áۤȸ¡Æ¤²ÝÂê-¡É, Åŵ¤´Ø·¸³Ø²ñ¶å½£»ÙÉôÂè48²óÏ¢¹çÂç²ñ, 1373, p.855, 1995.

(8)
Çò°æ¡¦¾åÅÄ¡¦°¤Éô¡¦Æ£ÇÈ¡¦ÃÓ¸¶: ¡È¿·Ê¹µ­»öÆü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ(2) -ÊÂΩDB ¤«¤é¼èÆÀ¤·¤¿µ­»ö¤ÎÂбþ¤Å¤±-¡É, Åŵ¤´Ø·¸³Ø²ñ¶å½£»ÙÉôÂè48²óÏ¢¹çÂç²ñ, 1374, p.856, 1995.

(9)
¹â¶¶¡¦Çò°æ¡¦Æ£ÇÈ¡¦ÃÓ¸¶¡¦¾åÅÄ¡¦¾¾Åç: ¡ÈDB ¤«¤éÃê½Ð¤·¤¿Æü±Ñ¿·Ê¹µ­»ö¤Î¼«Æ°Âбþ¤Å¤±¡É, ¸À¸ì½èÍý³Ø²ñÂè2²óǯ¼¡Âç²ñ, B3-3, pp.201-204, 1996.

(10)
Çò°æ¡¦¾¾Èø¡¦À¥²¼¡¦Æ£ÇÈ¡¦ÃÓ¸¶: ¡È¿·Ê¹µ­»öÆü±ÑÂÐÌõ¥³¡¼¥Ñ¥¹¤Î¹½ÃÛ(3) -µ­»ö¤ÎÆÃħʬÀϤÈʸ¤ÎÂбþ´Ø·¸¤Î¸¡Æ¤-¡É, Åŵ¤´Ø·¸³Ø²ñ¶å½£»ÙÉôÂè48²óÏ¢¹çÂç²ñ, 1375, p.857, 1995.

(11)
¹â¶¶¡¦Çò°æ¡¦Æ£ÇÈ¡¦ÃÓ¸¶¡¦¾åÅÄ¡¦¾¾Åç: ¡ÈÆü±Ñ¿·Ê¹µ­»ö¤Î¼«Æ°µ­»öÂбþ¤Å¤±¡É, ¾ðÊó½èÍý³Ø²ñ¸¦µæÊó¹ð, 96-NL-114-9, pp.55-62, 1996.

(12)
½ÕÌî: ¡È¼­½ñ¤ÈÅý·×¤òÍѤ¤¤¿ÂÐÌõ¥¢¥é¥¤¥á¥ó¥È¡É, ¾ðÊó½èÍý³Ø²ñÏÀʸ»ï, 38, No.4, pp.719-726, 1996.

(13)
M. Haruno and T. Yamazaki: ¡ÈHigh-Peformance Bilingual Text Alignment Using Statistical and Dictionary Information¡É, In Proc. of 34th Annual Meeting of ACL, pp.131-138, 1990.

(14)
M. Haruno, S. Ikehara and T. Yamazaki: ¡ÈLearning Bilingual Collocations by Word-level Sorting¡É, In Proc. of 16th International Conference on Computational Linguistics (COLING96), 1, pp. 525-530, 1996.

(15)
M. Nagao and S. Mori: ¡ÈNew Method of N-gram Statistics for Large Number of n and Automatic Extraction of Words and Phrases from Large Text Data of Japanese¡É, In Proc. of 15th COLING, pp.611-615, 1994.

(16)
ÃÓ¸¶¡¦Çò°æ¡¦²Ï²¬: ¡ÈÂ絬ÌÏÆüËܸ쥳¡¼¥Ñ¥¹¤«¤é¤ÎÏ¢º¿·¿¤ª¤è¤ÓÎ¥»¶·¿¤Î¶¦µ¯É½¸½¤Î¼«Æ°Ãê½ÐË¡¡É, ¾ðÊó½èÍý³Ø²ñÏÀʸ»ï, 36, No.11, pp.2584-2596, 1995.

(17)
ÃÓ¸¶¡¦Çò°æ: ¡ÈÂ絬ÌÏÆüËܸì¥Ç¡¼¥¿¤«¤é¤ÎÎ¥»¶·¿¶¦µ¯É½¸½¤Î¼«Æ°Ãê½Ð¡É, Ê¿À®7ǯÅŵ¤´Ø·¸³Ø²ñ´ØÀ¾»ÙÉôÏ¢¹çÂç²ñ, G14-2, p.G369, 1995.

(18)
ÆâÌÃÓ¸¶¡¦Çò°æ: ¡È¼åÍÞÀ©¤Ë¤è¤ëÏ¢º¿¶¦µ¯É½¸½¤ÎÃê½Ð¤È¤½¤ì¤Ë´ð¤Å¤¯Î¥»¶¶¦µ¯É½¸½¤ÎÃê½Ð¡É, ¸À¸ì½èÍý³Ø²ñÂè2²óǯ¼¡Âç¹ç, B6-4, pp. 257-260, 1996.

(19)
ÆâÌÇò°æ¡¦ÃÓ¸¶¡¦¿·Åĸ«: ¡ÈÃÖ´¹¤¨¤òÍѤ¤¤¿n-gram¤Ë¤è¤ë¸À¸ìɽ¸½¤ÎÃê½Ð¡É, ¾ðÊó½èÍý³Ø²ñ¸¦µæÊó¹ð, 96-NL-114-10, pp.63-68, 1996.

(20)
ÆâÌÇò°æ¡¦ÃÓ¸¶: ¡ÈÎ¥»¶¶¦µ¯É½¸½¥Ç¡¼¥¿¤òÍѤ¤¤¿Ã±¸ì¤Î¥°¥ë¡¼¥Ô¥ó¥°¡É, ¸À¸ì½èÍý³Ø²ñÂè3²óǯ¼¡Âç²ñ, C1-7, pp.107-110, 1997.

(21)
ÆâÌÇò°æ¡¦ÃÓ¸¶: ¡ÈËÝÌõ¥Æ¥ó¥×¥ì¡¼¥È¤Î¼«Æ°Ãê½Ð¡É, ¾ðÊó½èÍý³Ø²ñÂè53²óÁ´¹ñÂç²ñ, 2, 4L-5, pp.61-62, 1996.

(22)
ÃÓ¸¶¡¦µÜºê¡¦²£Èø: ¡ÈÆü±Ñµ¡³£ËÝÂô¤Î¤¿¤á¤Î°ÕÌ£²òÀÏÍѤÎÃμ±¤È¤½¤Îʬ²òǽ¡É, ¾ðÊó½èÍý³Ø²ñÏÀʸ»ï, 34, No.8, pp.1692-1704, 1993.

(23)
S. Ikehara, S. Shirai, A. Yokoo, F. Bond and Y. Omi: ¡ÈAutomatic acquisition of semantic attributes for user defined words in Japanese-to-English machine translation¡É, In Proc. of ANLP'94, pp.184-185, 1994.

(24)
ÃÓ¸¶¡¦Çò°æ¡¦²£Èø¡¦F. Bond¡¦¾®¸«: ¡ÈÆü±Ñµ¡³£ËÝÌõ¤Ë¤ª¤±¤ëÍøÍѼÔÅÐÎиì¤Î°Ọ̃°À­¤Î¼«Æ°¿äÄê¡É, ¼«Á³¸À¸ì½èÍý, 2, No.1, pp.3-17, 1995.

(25)
S. Shirai, S. Ikehara. A. Yokoo and H. Tnoue: ¡ÈThe quantity of valency pattern pairs required for Japanese to English machine translation and their compilation¡É, In Proc. of Natural Language Processing Pacific Rim Symposium (NLPRS'95), pp.443-448, 1995.

(26)
²£Èø¡¦Ãæ´ä¡¦Çò°æ¡¦ÃÓ¸¶: ¡ÈÆü±Ñµ¡³£ËÝÌõÍÑ¥¹¥±¥ë¥È¥ó-¥Õ¥ì¥Ã¥·¥å·¿¹½Ê¸°ÕÌ£¼­½ñ¤Î¹½À®¡É, ¾ðÊó½èÍý³Ø²ñÂè48²óÁ´¹ñÂç²ñ, 6Q-8, 3, pp.139-140, 1994.

(27)
Çò°æ¡¦¾åÅÄ¡¦Ê¼Æ£¡¦²£Èø¡¦ÃÓ¸¶: ¡ÈÆü±Ñµ¡³£ËÝÂô¤Î¤¿¤á¤Î·ë¹ç²Á¥Ñ¥¿¥óÂФκîÀ®»Ù±ç½èÍý¡É, ÅŻҾðÊóÄÌ¿®³Ø²ñµ»½Ñ¸¦µæÊó¹ð, NLC96-34, pp.25-30, 1996.

(28)
Çò°æ¡¦°æ¾å¡¦°æÅÄÁÒ¡¦ÃÓ¸¶¡¦²£Èø: ¡ÈÀìÌçʬÌîÂбþ¤ÎÆü±Ñµ¡³£ËÝÌõÍѹ½Ê¸°ÕÌ£¼­½ñ¤Î¹½ÃÛ¡É, ¸À¸ì½èÍý³Ø²ñÂè2²óǯ¼¡Âç²ñ, A1-4, pp.13-16, 1996.



ÆâÌî °ì
¥³¥ß¥å¥Ë¥±¡¼¥·¥ç¥ó²Ê³Ø¸¦µæ½ê¸¦µæ¼çǤ
Ê¿À®¸µÇ¯Æþ¼Ò. ¼ç¤Ëµ¡³£ËÝÌõ½èÍý¤Î¸¦µæ¤Ë½¾»ö. ¸½ºß, Åý·×½èÍý¤Ë¤è¤ë¥Æ¥­¥¹¥È¤«¤é¤ÎÄê·¿¥Ñ¥¿¥óÃê½Ð¤Î¸¦µæ¤Ë½¾»ö.
¾¼ÏÂ62ǯ°ñ¾ëÂç³Ø¹©³ØÉô¾ðÊ󹩳زÊ´¶È. Ê¿À®¸µÇ¯Æ±Âç³Ø±¡¹©³Ø¸¦µæ²Ê½¤»Î²ÝÄø½¤Î».
¾ðÊó½èÍý³Ø²ñ¡¦¸À¸ì½èÍý³Ø²ñ²ñ°÷.
½ÕÌî ²íɧ ATR¿Í´Ö¾ðÊóÄÌ¿®¸¦µæ½ê¸¦µæ°÷(Á°¥³¥ß¥å¥Ë¥±¡¼¥·¥ç¥ó²Ê³Ø¸¦µæ½ê)
Ê¿À®5ǯÆþ¼Ò. ¼«Á³¸À¸ì½èÍý, µ¡³£³Ø½¬¤Î¸¦µæ¤Ë½¾»ö.
Ê¿À®3ǯµþÅÔÂç³Ø¹©³ØÉôÅŵ¤¹©³ØÂè2³Ø²Ê´¶È. 5ǯƱÂç³Ø±¡½¤»Î²ÝÄø½¤Î».
¾ðÊó½èÍý³Ø²ñ¡¦¸À¸ì½èÍý³Ø²ñ¡¦Society for Neuroscience ALL²ñ°÷.
¹â¶¶ ÂçÏÂ
¥³¥ß¥å¥Ë¥±¡¼¥·¥ç¥óÎÁ³Ø¸¦µæ½ê¼Ò°÷
Ê¿À®6ǯÆþ¼Ò. ¼ç¤Ëµ¡³£ËÝÌõ¤Î¸¦µæ¤Ë½¾»ö.
Ê¿À®4ǯ¿·³ãÂç³Ø¹©³ØÉô¾ðÊ󹩳زÊ´¶È. 6ǯƱÂç³Ø±¡¼«Á³²Ê³Ø¸¦µæ²Ê½¤»Î²ÝÄø½¤Î».
¾ðÊó½èÍý³Ø²ñ¡¦¸À¸ì½èÍý³Ø²ñ²ñ°÷.
Çò°æ Í¡
¥³¥ß¥å¥Ë¥±¡¼¥·¥ç¥ó²Ê³Ø¸¦µæ½ê¼ç´´¸¦µæ°÷
¾¼Íø55ǯÆþ¼Ò. ¼ç¤Ë¼«Á³¸À¸ì½èÍý(ÆÃ¤ËÆüËܸì½èÍý)¤Î¸¦µæ¤Ë½¾»ö. ¸½ºß¡¢µ¡³£ËÝÌõµ»½Ñ¤Î¸¦µæ¤Ë½¾»ö.
¾¼ÏÂ53ǯÂçºåÂç³Ø¹©³ØÉôÄÌ¿®¹©³Ø²Ê´¶È. 55ǯƱÂç³Ø±¡¹©³Ø¸¦µæ²ÊÄÌ¿®¹©³ØÀì¹¶Çî»ÎÁ°´ü²ÝÄø½¤Î».
ÅŻҾðÊóÄÌ¿®³Ø²ñ¡¦¾ðÊó½èÍý³Ø²ñ¡¦¸À¸ì½èÍý³Ø²ñ²ñ°÷.
Ê¿À®7ǯÂè30²óÆüËܲʳص»½Ñ¾ðÊ󥻥󥿡¼¾Þ(³Ø½Ñ¾Þ), ¿Í¹©ÃÎǽ³Ø²ñ1994ǯÅÙÏÀʸ¾Þ¼õ¾Þ.