Hiromi Nakaiwa, Satoshi Shirai, Satoru Ikehara & Tsukasa Kawaoka, AAAI '95 Spring Symposium, March 27-29, 1995

Extrasentential Resolution of Japanese Zero Pronouns using Semantic and Pragmatic Constraints

Hiromi Nakaiwa, Satoshi Shirai, Satoru Ikehara and Tsukasa Kawaoka

NTT Communication Science Laboratories
1-2356 Take, Yokosuka-shi, Kanagawa-ken, 238-03 JAPAN
{nakaiwa, shirai, ikehara}@nttkb.ntt.jp, kawaoka@cslab.kecl.ntt,jp

Abstract

This paper proposes a method to resolve Japanese zero pronouns with extrasentential reference which can be realized in a practical machine translation system. This method focuses on semantic and pragmatic constraints such as modal expressions, verbal semantic attributes and conjunctions to determine the extrasentential referents of Japanese zero pronouns. According to a window test, 100% of zero pronouns with extrasentential referents could be resolved correctly with the consistency,

INDEX

In all natural language, elements that can be easily deduced by the reader are frequently omitted from expressions in texts (Kuno 1978). This phenomenon causes big problems in natural processing systems. For example in a machine translation system, the system needs to recognize that elements not overtly indicated in the source language may become mandatory elements in the target language. Particularly in Japanese, the subject and object are often omitted; whereas they are often mandatory in English. Thus, in Japanese to English machine translation systems, it is necessary to identify case elements omitted from the original Japanese (these are referred to as "zero pronouns") for their translation into English expressions.

Several methods have been proposed with regard to this problem. Kameyama (1986) and Walker et al. (1990) proposed methods to resolve Japanese zero pronouns based on the centering algorithm using the types of postpositional particles and the existence of empathy loaded verbs. Yoshimoto (1988) suggested a method that uses topics extracted from a dialogue to identify the referents that were indicated in the dialogue. It also uses the types of honorific expressions and speech acts to identify the referents that were not overtly shown in the dialogue. Dousaka (1994) proposed a model to identify the referents of Japanese zero pronouns referring to persons by interpreting the pragmatic conditions of the appropriate usage of honorific expressions, donatory verbs and specific types of sentence-final grammatical forms. When considering application on a practical machine translation system for which the translation target area can not be limited, it is not possible to apply these methods directly because their precision of resolution is low as they only use limited information and the volume of knowledge that must be prepared beforehand is so large.

The zero pronouns that must be resolved by a machine translation system can be classified into 3 types; (a) zero pronouns with antecedents within the same sentence (intrasentential), (b) zero pronouns with antecedents elsewhere in the text (intersentential) and (c) zero pronouns with deictic reference (extrasentential). Regarding type, (b), Nakaiwa et al. (1992) proposed a method to determine the intersentential antecedents using verbal semantic attributes, focus is placed on the dynamic characteristics of verbs and the relationships between verbs. The rules used in this method are independent of the field of the source text. Therefore, anaphora resolution may be conducted with a relatively small volume of knowledge making the proposed method very suitable for machine translation systems.

In this papper, we propose a widely applicable method to determine the extrasentential referents of Japanese zero pronouns (type (c)) using semantic constraints such as verbal semantic attributes and pragmatic constraints such as modal expressions and types of conjunctions. Bccause of the use of these constraints, we can determine the referents that were not shown in the sentences in texts with simple rules that work without limiting the target sentence domain.

To grasp the distribution of zero pronouns and their antecedents that do not appear in the same sentence, in this chapter, we examine which zero pronouns must be resolved and where their antecedents appear in a test set designed to evaluate the performance of Japanese to English machine translation systems (Ikehara 1994). The 3718 test sentences have many examples of zero pronouns making intrasentential and extrasentential anaphoric references. The sentence set was created to test the coverage of expressions that can be translated by Japanese to English MT systems based on the varieties of Japanese expressions and the differences between Japanese and English. The sentence set has about 500 kinds of test items. Each sentence has a manual translation, and almost all of the sentences can be translated without contextual information (3704 sentences out of 3718 sentences). A MT system can be evaluated by comparing its output to the equivalent manual translation. Each sentence is expressed in natural Japanese and the sentence set covers many different expressions.

The results of the examination of zero pronouns and their referential elements in the functional test sentence set are shown in Table 1. There were a total of 515 zero pronouns.

乢	Location of Referential
	------------	乢	Elemts
Location
			------------	乢
of Zero Pronouns

Intrasentential

Extrasentential

Sub Total
[Cases]

Etc.

Passive voice

I or we

you

human

Etc.

in embedded sent.

Ga(subject)

109

151

448

in embedded sent.

Wo(direct object)

Ni(indirect object)

Etc.

Sub Total [Cases]

140

375

515

Table 1 Frequency of Appearance of Zero Pronouns and their Referential Elements in Test Sentences
(Source of Sample Sentences: Test Sentences to Evaluate Japanese to English MT System (3718 sentences).
Of the test sentences, 463 out of 3718 contained zero pronoun(s))

The location of referential elements can be divided into 2 kinds: those expressed in the same sentence, and those not expressed in the same sentence. The latter were farther classified into 6 kinds.

According to this study of the functional test sentence set, in 375 out of 515 instances (73%) the antecedent wasn't expressed in the sentence. Zero pronouns that could be left unexpressed by converting the translation to passive voice amounted to 174 instances (34%). The other zero pronouns, 201 instances (39%), referred to antecedents that did not appear in the sentence.

Zero pronouns that were the subjects and that refer to the writer or speaker I or a group we amounted to 69 instances (13%) out of the 201 instances. Further examination revealed that only in these 69 instances did the verb that governed them express some modality such as - shitai '-want to -' or -shiyou 'Let -' or the verbs were omou 'think' and other such words indicating THINKING ACTION. Furthermore, zero pronouns that were the subjects and that referred to the reader or hearer you, amounted to 28 instances (5%) out of the 201 instances. In these 28 instances, the verbs that governed these zero pronouns expressed the modalities of - subekida 'should' or -sitehanaranai 'must not'. Similarly, modalities and verb types can also be used to identify 'it' or the 'unknown human'. This type of zero pronoun can be resolved by deducing their referents using modality or categorized verbal semantic attributes.

Based on the results shown in previous chapter, we propose a method to resolve Japanese zero pronouns whose antecedents do not appear in the texts.

To resolve Japanese zero pronouns whose antecedents do not appear within the texts, it is possible to use case elements' semantic constraint conditions to estimate referents. Semantic information used to estimate supplementing elements is a constraint on cases. For example, these constraints to the cases are used for selecting the transfer pattern in a machine translation system. Figure 1 shows an example of a transfer pattern in a Japanese to English machine translation system for the Japanese verb ikimasu 'go'. Figure 1 shows how, if the Japanese verb is ikimasu 'go' and the noun phrase with a ga particle, which shows a subject, has the semantic attribute SUBJECTS, VEHICLES or ANIMALS, then the verb should be translated as 'go'. In this pattern, if subject N1 becomes a zero pronoun, the system tries to estimate the referent using semantic constraints. But, in this case, it is impossible to estimate the referent as one type, because there are three kinds of semantic constraints. In the transfer pattern, the semantic constraints were abstract if the selections of translation were not needed using the semantic constraints. So, this method frequently poses difficulties in pinpointing elements to be estimated.

According to the results that were examined in the previous chapter, this type of zero pronoun can be resolved by deducing their referents not only using semantic constraints to the cases but also using modality or categorized verbal semantic attributes. For example, in this case, it is effective to determine the referents corresponding to 'I' using the verbal semantic attributes of the pattern, N1's PHYSICAL TRANSFER and the polite expression -masu.

According to the analysis of the results shown in the previous section, we found that modal expressions and verbal semantic attributes are useful in determining the extrasentential referents of Japanese zero pronouns. Also, we can estimate that the types of conjunctions that are effective to determine the referents in a complex sentence. In this section, we examine three kinds of semantic and pragmatic constraints, modal expressions, verbal semantic attributes and conjunctions.

Constraints based on modal expressions Modal expressions in Japanese are expected to be the most powerful constraints to estimate extrasentential referents. For example, in the case of zero pronouns in ga-cases 'subject', the referent becomes the writer or speaker 'I' or a group 'we' if the sentence has the modal expressions, -sitai '冇 want to -' HOPE or -sitehosii '冇 want 冇 to -' CAUSATIVE HOPE, and the referent becomes the reader or hearer 'you' if the sentence has the modal expressions, -siteha-ikenai '冇 must not -' PROHIBIT or -subekida '冇 should -' OBLIGATION. If inere are no referent candidates within the elements in the texts using contexts, dIe referents can be determined using previous constrains based on modal expressions. To write these constraints based on modal expressions effectively, we used the modal attributes propposed by Kawai (1987).

Constraints based on verbal semantic attributes Constraints based on verbal semantic attributes can be divided into the following two types:

'Give and take' expressions such as the verbs morau 'get' and yaru 'give' and transfer expressions such as the verbs iku 'go' and kuru 'come' can determine the referents of zero pronouns without modal expressions, if they have zero pronouns. For example, if the ga-case(subject) of the sentence whose verb is morau 'get' becomes a zero pronoun, the referent becomes 'I'. In the case of verb kuru 'come', the referent becomes an element other than 'I', for example 'you'. These kinds of verbs implicitly indicate the relationship between the writer/speaker and the elements that becomes ga-cases (for example, the empathy (Kuno, 1978) or side of the territory of infomation (Kamio, 1985)). Based on these properties, the extrasentential referents of Japanese zero pronouns can be estimated.

Even if the referents of zero pronouns can not be determined using modal expressions or the types of verbs, sometimes, the referents can be determined using a combination of modal expressions and the types of verbs. For example, in the following Japanese expression,

the ga-case becomes a zero pronoun. This sentence said the experience of writer/speaker and 'I' is suitable for the referents of the zero pronouns. As shown in sentence (1), if the ga-case in the expression with verb whose semantic attribute is ACTION and modal expression -ta PAST becomes a zero pronoun, it is translated by a human translator as 'I'. In a similar way, the ga-case in the expression with a verb whose semantic attribute is ACTION and the modal expression -darou 'will' ESTIMATION becomes a zero pronouns the referent is 'you'. Such constraints using both verbal semantic attributes and modal expressions can he used to estimate the extrasentential referents ofJapanese zero pronouns.

To write these constraints based on types of verbs effectively, we used the 97 verbal semantic attributes (VSA) proposed by Nakaiwa (1994).

Constraints based on conjunctions Sometimes the extrasentential referents of Japanese zero pronouns can be determined depending on the types of conjunctions. The constraints based on the Japanese conjunctions can be divided to the following two types.

(1) The constraints of the case sharing depending on the types of conjunctions

Minami (1974) and Takubo (1987) proposed the characteristic of sharing coverage of cases depending on the types of Japanese conjunctions. For example Minami divided Japanese conjunctions into three kinds, A, B and C. The complex sentence including A type Japanese conjunctions, such as tsutsu 'while' and nagara 'while', shares one ha-case (Topic) and one ga-case (Subject). In the case of B type Japanese conjunctions, such as node 'because' tara 'if', one ha-case is shared but not the ga-case. In the case of C type Japanese conjunctions, such as keredo 'but' kedo 'but', neither the ha-case nor the ga-case are necessarily shared. According to this classification, if two ga-cases in a complex sentence including A type Japanese conjunctions were to become zero pronouns and the referent of one of the two zero pronouns was determined by the constraints proposed previously, the referent of the other zero pronoun is the same referent. These characteristics of Japanese conjunctions can be used to determine the referents of zero pronouns.

(2) Constraints based on conjunctions, modal expressions and verbal semantic attributes

Cooccurrence of conjunctions, verbal semantic attributes and modal expressions in a complex sentence sometimes determines the meaning of the sentence, and sometimes determines the extrasentential referents of zero pronouns in the sentence. For example, in the following Japanese expression,

(2)

冇-ga

tokoya-ni

ika-nai

to,

kami-ga

boubou-ni-naru.

冇-SUBJ

barber-IND-OBJ

go-not

hair

begin to look untidy.

If you don't go to the barber, your hair will began to look untidy.

the subject of the verb ika-nai 'go-not' becomes a zero pronoun but the referent can be determined as the writer or speaker, 'you'. This sentence has the meaning that the writer or speaker advises that if you don't do something, a situation will arise. The meaning type of a complex sentence can be determined using the rules that the conjunction is to 'if' and in the sub clause ga-case becomes a zero pronoun and the meaning of the verb is ACTION with negation and in the main clause the meaning of the verb is ATTRIBUTE with modal expression ni-naru 'become' ATTRIBUTE TRANSFER. Using these kinds of rules, the meaning types of complex sentences can be determined, and the referents of zero pronouns can be determined.

In this section, we propose the algorithm of extrasentential resolution of Japanese zero pronouns using the constraints that were proposed in this chapter. The algorithm that is presented here is realized in a Japanese to English machine translation system. So, the only zero pronouns that must be resolved are those that become mandatory elements in English. Anaphora resolution of zero pronouns is conducted as follows.

(Step	1) --Detection of zero pronouns.
	If they exist, examine whether there are antecedents within the text. (For example, anaphora resolutions are conducted using Nakaiwa's method (1992).)
	If their antecedents can be found, finish the resolution process.
(Step	2)--When the antecedent cannot be detected in step 1, extrasentential resolution of Japanese zero pronouns using verbal semantic attributes, modal expressions and the types of conjunctions are conducted. The conditions to determine the referents are summarized in Table 2.
	When the referents are determined, the system takes in to account not only these conditions shown in the Table 2, but also the semantic constraints governed by verbs as shown in Figure 1.
	If their referents can be found, finish the resolution process.
(Step	3)--If their referents can not be found and the translation in passive voice can be done, translate in the passive voice; else based on the semantic restrictions imposed on the zero pronoun by the verbs, deductively generate anaphora elements.
	Finish the resolution process.

Appearance Location of Zero Pronouns	Condition	Referents	Comment
ga-case (subject)	modal:HOPE(-sitai)	I or we	speaker/writer hopes
	modal:CAUSAL HOPE (-sitehosii)		speaker/writer hopes to hearer/reader
	modal:INVITE(-simashou)		speaker/writer invites
	VSA:under ACTION +modal:POLITE(-simasu)		depending on the social relonshop betwen speaker/writer and hearer/reader
	----		----
	modal:PROHIBIT(-siteha-ikenai)	you	speaker/writer prohibits hearer/reader's action
	VSA:under ACTION + modal:OBLIGATION(beki)		speaker/writer make hearer/reader's action oblibation.
	----		----
	VSA:BODILY ACTION THINKING ACTION EMOTIVE ACTION EMOTIVE STATE	human (I, we, you...)	when the verb that show the action or emotion that only human can do appears in te sentence and when there are no other referent candicates., the referents of zero pronouns is human.
	----	human (I, we, you...)	----
	VSA:copula sentence and the meaning is ABSTRACT	it	pronoun of abstract noun should be it
	VSA:ATTRIBUTE & PERCEPTUAL STATE		verbs that indicates weather such as atsui 'hot', -samui 'cold'
	---		----
ni-case (indirect object)	modal:CAUSAL HOPE (-sitehosii)	you	speaker/writer hopes to hearer/reader
ni-case (indirect object)	----	you	----

In this chapter, we show the results of evaluation of the method that was proposed in the previous chapter. The target of resolution is the zero pronouns whose referents are the extrasentential elements shown in the Table 1 and that can be resolved in the constraints in the previous chapter (ga-case<-"I" or "we", "you", "human" "it" and ni-case(indirect object)<-"you"; 5 kinds, 175 entries). The rules to resolve these zero pronouns were created by examining these zero pronouns but they are universal. So, the evaluation was conducted in the window test. We conducted the following two types of tests to evaluate the accuracy of the rules, and the simplicity of the creation of the rules.

To evaluate the former factor, we examine the dependency between the types of conditions in anaphora resolution such as semantic constraints to cases, modal expression, verbal semantic attributes and conjunctive expression, and the accuracy of the resolutions. In this examination, we evaluate the accuracy depending on the types of constraints used. To evaluate the latter factor, we examine the dependency between the complexities of the rules that were used in the resolution and the accuracy of the resolutions. In this examination, we evaluate the accuracy using simple rules that are easy to create and that can be created as universal rules.

To examine the resolution accuracy for conditions of resolution, we examined the accuracy of the method proposed in this papper with 4 kinds of conditions: using conditions of semantic constraints on cases only, using conditions of semantic constraints on cases and modal expression, using conditions of semantic constraints on cases, modal expression and verbal semantic atinbutes, using conditions of semantic constraints on cases, modal expression, verbal semantic attributes and conjunctions.

Table 3 shows the results of the resolution depending on the types of the rules. As shown in this table, all 175 zero pronouns can be resolved with the consistency of rules that were proposed in the previous chapter. By the introduction of verbal semantic attributes, the accuracy of resolution has improved at the same rate as the introduction of modal expressions (41 entries, 24%). From this result, we can say that the verbal semantic attributes are comparatively as effective as modal expressions. The results also show that the accuracy without using the constraints of conjunctions achieves 85%.

Appearance Location of Zero Pronouns	Referents		Resolution Condition
Appearance Location of Zero Pronouns	Referents		Semantic Constraints on cases	Modal + Expression	+ VSA	+ Conjunctions
Ga-case	I or we		16 (23%)	40 (+24) (58% (+35%))	64 (+24) (93% (+38%))	69 (+5) (100% (+7%))
	you		0 (0%)	12(+12) (43%(+43%))	17 (+5) (61% (+18%))	28 (+11) (100% (+39%))
	human	I	0 (0%)	0 (0%)	6 (+6) (67% (+67%))	9 (+3) (100% (+33%))
		you	0 (0%)	0 (0%)	6 (+6) (55% (+55%))	11 (+5) (100% (+45%))
		one	0 (0%)	0 (0%)	0 (0%)	3 (+3) (100% (+100%))
		Sum	0 (0%)	0 (0%)	12 (+12) (49% (+49%))	23 (+11) (100% (+51%))
	it		50 (100%)	50 (100%)	50 (100%)	50 (100%)
Ni-case (ind. object)	you		0 / 0 (0%)	5 (+5) (100% (+100%))	5 (100%)	5 (100%)
Sum			66 (38%)	107 (+41) (61% (+24%))	148 (+41) (85% (+24%))	175 (+27%) (100% (+15%))

To examine the resolution accuracy for the complexity of rules, we examine recall and precision of a method proposed in this paper depending on the complexities. The complexities C are evaluated using the following formula depend on the number of used constraints.

C =

num. of modal const.* 1 + num. of VSA comt.* 1 + num. of conjunctions const.* 2

In this formula, 1 in the modal and VSA and 2 in the conjunction indicate the weights. Because conjunction constraints force both sides of the unit sentence, we add the weight for the conjunctions constraints 2. According to this formula, the complexity of a rule that has a constraint for conjunctions and for VSA in main clause and for modal and VSA in sub clause, becomes 5(= 1 (modal)* 1 + 1(VSA)*2 + 2(conjunction)* 1).

Table 4 shows the accuracy of the resolution depending on the complexities of the rules. 46 kinds of rules were used in the extrasentential resolution of 175 zero pronouns as shown in table 4. The accuracy of resolution using rules with complexities of 3 or less is 90%, and the accuracy of resolution using rules with complexities of 4 or less is 95%. This result shows that the use of the constraints based on modal expressions, VSA and conjunctions can achieve high accuracy using relatively simple rules.

Resolution Condition			Complexities of Rules	Number of Rules	Accuracy
Modal Expression	VSA	Conjunctions	Complexities of Rules	Number of Rules	Accuracy
0	0	0	0 (Only semantic constraints to the cases)	0	38% (66)
1	0	0	1	11 (+11)	61% (+24%) (l07 (+41))
0	1	0	1	12 (+1)	62%(+1%) (108 (+1))
1	1	0	2	29 (+17)	85% (+24%) (148 (+40))
0	0	1	2	30 (+1)	85% (+1%) (149 (+1))
1	0	1	3	31 (+1)	86% (+1%) (151 (+2))
0	1	1	3	34 (+3)	90% (+3%) (157 (+6))
1	1	1	4	36 (+2)	93% (+3%) (163 (+6))
0	2	1	4	39 (+3)	95% (+2%) (167 (+4))
2	1	1	5	40 (+1)	96% (+1%) (168 (+1))
1	2	1	5	44 (+4)	99% (+3%) (173 (+5))
2	2	1	6	46 (+2)	100% (1%) (175 (+2))

This papper proposes a powerful method for extrasentential resolution of Japanese zero pronouns. It was found that the referential elements that were not in the sentence resolved 100% in the window test with the introduction of rules based on three kinds of constraints based on modal expressions, verbal semantic attributes and conjunctions with the consistency. In the future, we will examine the universality of the rules that were created in this consideration by applying to other texts and will examine the intrasentential anaphora resolution using these three kinds of constratnts.

	Introduction
	Appearance of Zero Pronouns in Japanese Texts
	Test Sentences
	Appearance Tendency
	Extrasentential resolution of Japanese Zero Pronouns
	Extrasentential resolution using semantic constraints to the cases
	Extrasentential resolution using semantic and pragmatic constraints
	Algorithm
	Evaluation
	Evaluation Methods
	Resolution Accuracy for Condifions of Resolution
	Resolution Accuracy against Rule Complexity
	Conclusion
	References

(1)	冇-ga	hon-wo	yon-da.
	冇-SUBJ	book-OBJ	read-PAST
	I read a book.

Abstract

INDEX

Introduction

Appearance of Zero Pronouns in Japanese Texts

Test Sentences

Appearance Tendency

Extrasentential resolution of Japanese Zero Pronouns

Extrasentential resolution using semantic constraints to the cases

Extrasentential resolution using semantic and pragmatic constraints

Algorithm

Evaluation

Evaluation Methods

Resolution Accuracy for Condifions of Resolution

Resolution Accuracy against Rule Complexity

Conclusion

References