Direct Parse Tree Translation in cooperation with the Transfer Method

Yoshihiro MATSUO and Satoshi SHIRAI and Akio YOKOO and Satoru IKEHARA

NTT Communication Science Laboratories
1-2356 Take, Yokosuka-shi, Kanagawa, 238-03 JAPAN
{yosihiro,shirai,ayokoo,ikehara}@nttkb.ntt.jp


Abstract

This paper proposes a cooperative translation method that combines direct parse tree translation with the transfer method implemented in the Japanese-to-English machine translation system ALT-J/E. Direct parse tree translation uses pattern matching between a dependency analyzed Japanese input sentence and parse tree patterns in a parse tree dictionary. Ordinary transfer methods have a problem in that input sentences are split up too finely, utilizing this pattern matching prior to the split process helps eliminate unnecessary splits. It is shown that the nodes in the patterns should be treated as three different types in order to translate the variable parts in the pattern by the transfer method.



Keywords:

Direct parse tree translation, Hybrid approach.



[ In Proceedings of NeMLaP, pp.144-149 (September, 1994). ]



INDEX

     Introduction
Translation system overview
Matching and Division process
Combining the 'trunk' and the 'leaves'
How to treat free elements
Applicability
Conclusion
  Acknowledgments
  References



Introduction

Up till now, most machine translation systems have been based on the transfer method, where translation is conducted in several phases such as morphological analysis, syntactic analysis, semantic analysis, transfer, and generation. The semantic analysis and transfer phases mainly process words while the generation process combines the translations. Although phrase transfer rules using syntactic information have also been used, such rules, which are described without strictly defined semantic attributes, sometimes causes unexpected side effects. When the sentence is divided into the smaller units that are required by the semantic analyzer, if there is excessive division the meaning of the input sentences can be changed.

Recently, example-based (Nagao, 1992; Furuse and Iida, 1992) , knowledge-based (Tsujii and Ananiadou, 1993; Nirenburg and Levin, 1991) and pattern-based (Katoh and Uratani, 1992; Maruyama, 1993) translation systems have been proposed to replace the transfer method. Since these methods translate without excessive division, the meaning in the input sentence structure is conserved. However, there are problems with all these methods. The example-based method can not readily calculate the similarity between input sentences and the exarnpIes also translation fails Without sufficiently similar exampIls. It is not clear how to use and build a knowledge-base for knowledge-based translation. Pattern-based translation with morpheme patterns cannot be applted to free word order languages.

In order to conserve the meaning of an input sentence, translation units that cannot be divided prior to semantic analysis needed to be appropriately defined. The Japanese to EngliSh translation system ALT-J /E, based on the multi-level translation method (Ikehara, 1989), was developed to reduce unexpected division. This method employs three levels of abstraction for translating unit sentences1, and each level has its own transfer rules2. The least abstract rules are applied first. Other expressions such as noun phrases or adverbial phrases are translated based on a semantic analysis.

Since the three levels of rules for unit sentences require units which contain one verb and some case elements and the semantic analyzers for noun or adverbial phrases require appropriate structures, the muIti-level transIation method must divide input sentences into several unitsThis makes it difficult to handle Japanese verbs with wide ranging meanings such as naru or suru which are used frequently (similar to "do" or "make" ) and English special expressions such as subjunctives, because the information needed for translation is often scattered between the units, In order to handle such scattered information, we have introduced direct parse tree translation into ALT-J/E. The direct parse tree translation uses pattern matching between the parse tree of an input sentence and parse trees in a dictionary. Since this is processed prior to the division process, information from the entire parse tree can be utilized.

It is possible to build a translation system using only this method, but such a system would have the following disadvantages.

Semantic analysis-based translation does not have these disadvantages. In order to integrate the two methods, direct parse tree translation is mainly used to handle long range dependency on the parse tree, which is difficult to treat With ordinary semantic analysis. This paper proposes a method of cooperative translation using both direct parse tree translation and the transfer method implemented in the Japanese-to-English machine translation system, ALT-J/E.




Translation system overview

Direct parse tree translation should capture the Japanese structure corresponding to an English expression. It is difficult to find a pattern of morpheme sequences because Japanese has almost no word order dependency. Therefore, we introduce a pattern matching function on the dependency structure of the input sentences.

Figure 1 shows an overview of the translation system. The whole system is made up of a pattern matching part, some semantic analysis-based translators and an English synthesizer. A white circle represents a Japanese phrase and a tree made up of white circles represents a dependency analyzed Japanese sentence; A hatched square represents a rule that a Japanese phrase should satisfy and a tree made up of hatched squares represents a parse tree pattern. A white triangle represents a English word and a tree of these triangles reprcsents a structure of English sentence. A tree made up of hatched triangles represents a English template as described in the English template dictionary.

Figure 1: Translation system overview

The pattern matching part receives a dependency analyzed Japanese sentence and compares it with the Japanese parse tree patterns described in the Japanese parse tree dictionary. If the pattern matches the input tree, the tree is divided into a 'trunk part' and some 'leaf parts'. The trunk part is the part that is translated as a whole, and the leaf parts are the rest of the input tree which will be translated by the semantic analysis-based translation. For the trunk part, since the scattered information is already captured, no further deep analysis is required. The system merely uses the English template which corresponds to the pattern from the Japanese parse tree dictionary. For each leaf part, the system invokes the relevant semantic analyzer and transfer process needed to generate the English structures. The English synthesizer receives the English template read from the English template dictionary and the English structures generated by the transfer process, and combines them into a complete English sentence,




Matching and Division process

The pattern matching part compares the dependency analyzed Japanese sentence with parse tree patterns. The dependency structures are represented as trees which contain words and phrases on their nodes. Therefore the parse tree patterns are made of trees with attribute conditions on their nodes. The entire comparison examines all conditions in each node in the parse tree pattern and the attributes of each node in the input sentence.

The result of the pattern matching is that each node of each parse tree pattern has a part of an input tree as its value. The division process separates node values into leaf and trunk parts, but there may be more than one possible separation. Sentence (1) shows the exanlple of a subjunctive expression; which cannot be represented by a more simple Japanese form.

   (1-a)  kare-hamarudenan-demo shitte-iru-ka-no-younihanasu.
healmosteverythingknowtalk
(1-b)he talks as if he knew everything.

In this case, the key expressions that determine that this sentence is the subjunctive are the Japanese adverb marude and the compound made of an auxiliary verb youni and particles kano. The Japanese sentence is parsed into the dependency structure shown in Figure 2-(a), To capture this structure, the parse tree pattern of Figure 2-(b) can be prepared in the parse tree dictionary.

(a)(b)

Figure 2: Dependency structure of sentence (1) and example pattern

The pattern matching part supplies the result that X directly corresponds to kare, that Y directly corresponds to shitteiru and that Z directly corresponds to marude. To generate the phrase "know everything" , Y should be nandemo shitteiru. Thus, the additional element nandemo should be connected to variable Y. Z and W, however, are parts of the trunk and are not translated using the transfer rules.

Another example, sentence (2) , may be informative.

   (2-a)  kare-haisu-noashi-wo kumu
hechairlegcombine
(2-b) He assembles chair legs.
(2-c)* He crosses chair legs.

In this case, to correctly translate kumu as "assemble" requires that ashi does not refer to a part of an animal3. Thus the parse tree pattern in Figure 3-(b) should be prepared in the parse tree dictionary.

(a)(b)

Figure 3: Dependency structure of sentence (2) and example pattern

The result from the pattern matching part is that X directly corresponds to kan; that Y directly corresponds to ashi, that Z directly corresponds to isu and that W directly corresponds to kumu. Since the input to generate the phrase "chair leg" in node Y is that Y is isu-no ashi, Z needs to be connected to variable Y.

These examples show that the Z in the pattern for sentence (2) should be treated differently from the Z in the pattern for sentence (1).

To do this, we introduce three types of node in the parse tree pattern.

   a-type   This is the root of a leaf part. This node should be translated into English.
b-type   This is a part of the trunk part. This node should not be translated into English, but is translated directly as a part of the template.
c-type   This is a part of a leaf part, but not the root. This node should be connected to an upper node.

For a-type nodes, the system invokes the relevant semantic analyzer which is described in the parse tree pattern (Figure 4,5). B-type nodes are discarded. For all other elements in the input sentence (those that directly correspond to c-type nodes and any additional elements), the following sequence is performed.

Figure 4: Node categorized pattern for sentence (1) and the result of identification of the node's value

Figure 5: Node categorized pattern for sentence (2) and the result of identification of the node's value

In the case of sentence (1), the nodes should be categorized as follows: X and Y are a-type and Z and W are b-type. X and Y are translated by semantic analysis-based translator, and Z and W are discarded. The sequence shown above is performed for one additional node nan in the example sentence. Following the tree will stop at the node shitteiru, which directly corresponds to a-type node Y. Therefore, the node nan is connected to shitteiru and the system determines that X consists of kare and Y consists of nandemo-shitteiru (see Figure 4).

In the case of sentence (2), the nodes should be categorized as follows: X and Y are a-type, Z is c-type and W is b-type. The node isu which corresponds to c-type node Z is treated by the above sequence and connected to ashi. The system determines that X consists of kare and Y consists of isu-no-ashi (see Figure 5).




Combining the 'trunk' and the 'leaves'

The English template dictionary contains the templates of English structures corresponding to the trunk parts. The template has one variable for each a-type node. The role of each variable such as SUBJ in SENTENCE or MODIFIER in NP is also described in the template. The part determined as the value of an a-type node is analyzed semantically by the relevant analyzer and is transferred by the relevant transfer process. Each result of the transfer process is connected to the template according to the role of the variable in order to build a complete English sentence. Figure 6 shows the English template used with the example pattern in figure 5.

Figure 6: Example of English template




How to treat free elements

The pattern matching part sometimes produces free elements. They need to be correctly translated into English using the semantic analyzer and the transfer process, and the result of the transfer process should be properly connected to the English structure. Our current system can treat free elements if the following two conditions apply.

  1. the resulting English structure has a predicate in its top level
  2. the free element is an adverbial phrase or a case element

When both these conditions are satisfied, the free element is translated as an adverbial phrase or a prepositional phrase connected to the predicate. Otherwise, the direct parse tree translation fails and the whole sentence is translated normally using semantic analysis.

For example, sentence (3) has the free element isoide "in a hurry". Because isoide is an adverbial phrase and there is a predicate in the sentence, the translation of isoide can be connected to the predicate. This gives sentence (3-b).

   (3-a)  kare-haisu-noashi-wo isoidekunda
hechairleghurrycombined
(3-b)He assembled chair legs in a hurry.




Applicability

We examined the translation of naru in a functional test set (Ikehara, 1992) of about 3,700 sentences. Although ALT-J/E translates the verb naru with many valence patterns, it is difficult to translate. One of the reasons is that naru is sometimes translated into a phrase without a verb. The solution is to capture the structure around naru with an upper structure including another verb or to use a mechanism to convert a clause into a phrase. Both are impossible with simple valence patterns for unit sentences which cannot refer to external information. Another reason is that information distant from the verb is required to select the English structure. For example, to translate ’ˇÁsuru jidai-ni naru into "the age will come when ’ˇÁ", ’ˇÁsuru should be treated together with naru4. But the division process in the transfer method would send ’ˇÁsuru jidai to the noun phrase analyzer on its own.

The test set has 146 sentences which include naru, of which about 60 sentences can be translated into acceptable English by ALT-J/E without direct parse tree translation. We have obtained key expressions to capture the structure around naru for about 40 more sentences, rules using these are currently being refined and implemented.




Conclusion

We have combined direct parse tree translation with translation based on semantic analysis. This method has been implemented on the Japanese-to-English machine translation system ALT-J/E. This method is especially effective for translating Japanese native verbs such as naru or suru which are difficult to translate with valence patterns.




Acknowledgments

The authors wish to thank the other members of the MT research group for several valuable discussions. They also wish to thank Miki Nakamura, Tomiko Hyoudou and Hiroko Inoue for their suggestions and valuable discussions.




References

Furuse, O. and Iida, H. (1992).
Transfer-Driven Machine Translation. In Proceedings of In-tematinal Workshop on Fundamental Research for the Future Generation of Natural Language Processing(FGNLP) , Manchester.

Ikehara, S. (1989).
Multi-level Machine Translation Method. Future Computing Systems , 2(3).

Ikehara, S. (1992).
Criteria for Evaluating the Linguistic Quality of Japanese to English MT System. In Proceedings of MT Evaluation Workshop , San Diego.

Katoh, N. and Uratani, N. (1992).
Extraction and machine translation of news sentences with fixed patterns (in Japanese). In Proceedings of 44th Annual Convention IPS Japan .

Maruyama, H. (1993).
Pattern-Based Translation: Context-Free Transducer and Its Applications to Practical NLP. In Proceedings of Natural Language Processing Pacific Rim Symposium , Fukuoka.

Nagao, M. (1992).
Some Rationales and Methodologies for Example-based Approach. In Proceedings of Internatinal Workshop on Fundamental Research for the Future Generation of Natural Language Processing(FGNLP) , Manchester.

Nirenburg, S. and Levin, L. (1991).
Syntax-Driven and Ontology-Driven Lexical Semantics. In Proceedings of the 1st SIGLEX Workshop , Berkeley.

Tsujii, J. and Ananiadou, S. (1993).
Knowledge-based Processing in MT. In Proceedings of Japan-US Workshop on Machine Aided Translation , Washington.




Footnote
1 A unit sentence is made up of a verb and case elements. (Return)
2 The first level rules translate idiomatic expressions (about 3,000 rules). The second level rules are valence patterns (about 10,000 rules) which are described as relations between a verb and cases categorized. by about 3,000 semantic attributes. The third level rules are the default translation rules for verbs. (Return)
3 For translating the verb kumu, ALT-J/E has several valence-patterns. One of them is such that if the direct object has the semantic attribute [LEG] then the verb should be translated as "cross" , while another says that if the direct object has the semantic attribute [CONCRETE THING] then the verb should be translated as "assemble" . Since the semantic attribute of ashi is [LEG], the system without the direct parse tree translation produces the incorrect translation (2-c). (Return)
4 With simple valence patterns, it will be translated into "It becomes the age that ’ˇÁ". (Return)