Yasuhiro Akiba, Hiromi Nakaiwa, Yoshifumi Ooyama & Satoshi Shirai, International Journal on Artificial Intelligence Tools, December 2001

Interactive Generalization of a Translation Example Using Queries Based on a Semantic Hierarchy

Yasuhiro Akiba,^* Hiromi Nakaiwa, and Yoshifumi Ooyama
NTT Communication Science Laboratories
2-4, Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, JAPAN
{akiba,nakaiwa,ooyama}@cslab.kecl.ntt.co.jp

Satoshi Shirai
ATR Spoken Language Translation Research Laboratories
2-2-2, Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0288, JAPAN
satoshi.shirai@atr.co.jp

This article addresses the issue of acquiring translation rules for machine translation (MT) systems that adopt a transfer approach. These rules are semantic pattern pairs (SPPs) of the source and target languages. Practical MT systems must additionally contain a huge number of SPPs corresponding to rarely-used predicates and predicate usages. Such SPPs are difficult to automatically acquire with corpus-based methods. To solve this difficulty, this article proposes a method to acquire SPPs by using queries based on a semantic hierarchy. The proposed method asks a lexicographer for the necessary information in order to generalize the conditions of SPPs and then gradually generalizes these conditions. Experimental results show that the proposed method allows the acquisition of more plausible conditions within almost the same time spent for manual generalization.

Keywords:

Natural Language Processing, Machine Translation, Knowledge Acquisition, Translation Knowledge, Interactive System.

[ International Journal on Artificial Intelligence Tools, pp.315-320 (December, 2001). ]

^* He moved to ATR Spoken Language Translation Research Laboratories.
Address: 2-2-2, Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, JAPAN.
E-mail: yasuhiro.akiba@atr.co.jp

INDEX

	1. Introduction
	2. Acquisition Task
	3. Proposed Method
	3.1. Generating sentences for queries
	3.2. Presenting queries and getting answers
	3.3. Approaches for searching for an appropriate category
	4. Experimental Work
	5. Conclusion

	Acknowledgments
	References

1. Introduction

The World Wide Web (WWW) has become very popular throughout the world. Due to this popularity, more and more people have come to realize the growing opportunities for them to read documents written in unfamiliar languages. Accordingly, there are higher hopes for more efficient machine translation (MT) systems.

One attempt to achieve MT systems is the use of a semantic-based transfer approach.^[1,2,3] This approach focuses on the collocation between verbs and nouns. Semantic-based transfer MT systems provide if-then rules (called semantic pattern pairs or SPPs), whose if-parts and then-parts are collocation patterns in the source language (SL) and the target language (TL), respectively.

Figure 1 shows a simplified example^[4] of an SPP in a Japanese-English MT system, ALT-J/E.^[1] The if-part is a Japanese pattern, and the then-part is an English equivalent. This SPP indicates that if the Japanese verb is yaku , its subjective noun is categorized as People and its objective noun is categorized as Bread or Confectionery; accordingly, the corresponding English verb is bake and its subject and object are English translations of the Japanese subject and object, respectively. Each slot such as J-SUBJ or J-OBJ contains the meanings of nouns (semantic categories) like People, Bread, or Confectionery.¹

IF J-VERB = yaku

J-SUBJ = N₁:[People]

J-OBJ = N₂:[Bread or Confectionery]

THEN E-SUBJ = N₁

E-VERB = bake

E-OBJ = N₂

Fig. 1. SPP for Japanese verb yaku.

ALT-J/E has about 2,800 semantic categories,^[4] which constitute a hierarchy with a maximum depth of 12 (Figure 2). Each of the approximate 400,000 nouns in the ALT-J/E lexicon has one or more semantic categories as its meaning.

Fig. 2. Upper levels of the semantic hierarchy ALT-J/E.

Practical MT systems based on a semantic-based transfer approach must additionally contain a huge number of SPPs corresponding to rarely-used predicates and predicate usages. In the case of ALT-J/E, Shirai et al.^[5] reported that about 10,000 SPPs, corresponding to rarely-used predicates and predicate usages, have to be generated for ALT-J/E to cover nearly all of the predicates in Japanese. The existing approaches for acquiring SPPs include corpus-based approaches.^[6,7,8,9]

As Church^[10] indicates, however, many possible co-occurrences cannot be observed even in a very large corpus. That is, a sufficient number of illustrative sentences cannot be prepared for the above kinds of SPPs. These approaches are, therefore, of limited applicability.

To date, the above kinds of SPPs have been acquired by manual approaches.^[5] The task of generating the SPPs has involved inputting the appropriate (not over-general and not over-specific) semantic categories into each slot. However, the following reasons have made it difficult for lexicographers to manually generate SPPs.

Large number of candidates:: Searching for appropriate categories requires a lot of effort on the part of lexicographers. This is because the number of candidates for appropriate categories ranges from several thousands^[4] to several tens of thousands,^[11] depending on the semantic hierarchy used.
Necessity of experience:: To specify appropriate semantic categories, lexicographers must be very familiar with all of the semantic categories and the lexicon on an MT system.
Different quality of generalization:: Lexicographers treat minor translation examples² differently according to their own generalization standards. Some lexicographers specify more general semantic categories so as to be met by minor examples. Other lexicographers specify more specific semantic categories so as not to be met by minor examples.

To overcome the difficulty of generating SPPs and the sheer quantity of SPPs, methods to support lexicographers are required.

This article proposes a new method that searches for appropriate semantic categories to be inputted into each slot by using queries based on a semantic hierarchy. The proposed method adopts three different approaches to searching for appropriate semantic categories. The proposed method generates translation examples at each search point and asks a lexicographer whether a noun corresponding to a target slot can be semantically collocated with the verb on the SL side in the acquired SPP and whether that noun can also give acontext such that the most plausible equivalent for the verb on the SL side is the verb on the TL side. The only task that the lexicographer has to do is to answer each query. Consequently, the resulting categories follow a specific generalization standard.

The authors experimentally evaluated the proposed method by acquiring the semantic categories for ten SPPs^[4] with ALT-J/E. Experimental results showed that the proposed method was able to acquire, in five cases, the same categories as, or in the other cases, more plausible categories than, those specified manually within almost the same time spent for the manual generalization.

The next section describes the acquisition task in this study. The proposed method is presented in Section 3. Experimental results are shown and a discussion is provided in Section 4. Finally, our conclusions are presented in Section 5.

2. Acquisition Task

This section characterizes the appropriate semantic categories that the proposed method should input into each slot in SPPs and describes the acquisition task that the proposed method handles.

Let us assume that lexicographers attempt to acquire the appropriate semantic categories for the slots of the SPP shown in Figure 1 while they derive (1) as a sample sentence that meets the conditions of the acquired SPP.

(1)	Taro-ga	appurupai-wo	yaku
	taro-SUBJ	apple pies-OBJ	bake-PRESENT
	`Taro bakes apple pies'

where the nouns Taro "a typical first name of Japanese males" for the J-SUBJ slot or appurupai "apple pies" for the J-OBJ slot are called the sample nouns, and the semantic categories of the sample nouns are called the sample categories. Let the sample categories of Taro and appurupai be Male and Confectionery, respectively.

In specifying different semantic categories from the sample categories into slots so that we caninvestigate what kinds of linguistic phenomena can be observed, let us replace one of the sample nouns with another noun as follows.

(2)	Meari-ga	appurupai-wo	yaku
	mary-SUBJ	apple pies-OBJ	bake-PRESENT
	`Mary bakes apple pies'

(3)	Mangetsu-ga	appurupai-wo	yaku
	a full moon-SUBJ	apple pies-OBJ	bake-PRESENT
	`A full moon bakes apple pies'

(4)	Taro-ga	sake-wo	yaku
	taro-SUBJ	a salmon-OBJ	grill-PRESENT
	`Taro grills a salmon'

In the case of (2), the sentence is natural as a Japanese sentence, so the substituted noun Meari "Mary" can be semantically collocated with Yaku "bake". In the case of (3), the description of the sentence Mangetsu-ga appurupai-wo yaku is not realistic and cannot happen normally. In this case, Mangetsu "a full moon" cannot be semantically collocated with yaku. In the case of (4), the sentence is natural as a Japanese sentence so the substituted noun sake "a salmon" can be semantically collocated with Yaku . On the other hand, the most plausible equivalent for Japanese verb yaku is grill rather than bake . This is because, although the source sentence does not describe definite clues in order to select the English equivalent, the most plausible situation of the source is Taro grills a salmon as long as one takes account of the culture of SL; in this case, the Japanese culture.

As seen above, the following linguistic phenomena can be observed through the replacement of nouns,

(P1): Some nouns (or semantic categories) can be semantically collocated with the verb on the SL side in the acquired SPP.
(P2): Out of all of the semantically collocated nouns (or semantic categories), some nouns (or semantic categories) can give a context such that the most plausible equivalent for the verb on the SL side is the target verb on the TL side. This phenomenon is a good indicator for finding the appropriate categories.

As a semantic category specified to a slot processed by lexicographers (the target slot) canchange from the sample category for the target slot to the root of the semantic hierarchy step by step, the number of nouns able to meet an acquired SPP can increase gradually. Let C_i , hereafter, denote the i-th semantic category on the path from the root of the semantic hierarchy to the sample category for the target slot (Figure 3). When C_i is specified to the target slot instead of C_i+1, the ratio of the collocated nouns in (P1), to the additionally covered nouns (Figure 4), is called the acceptable rate of C_i , which is related to the occurrence probabilities of the SL sentences generated by the replacement. The ratio of the nouns that can give the context in (P2), to all collocated nouns in (P1), is called the translatable rate of C_i (Figure 4), which is related to the probabilities that such substituted nouns can give the context in (P2). Because both rates of the appropriate semantic categories specified to the target slot should be sufficiently large, let us focus on the lower thresholds of both rates and call them the minimal acceptable rate and the minimal translatable rate, respectively. A semantic category whose acceptable rate is the minimal acceptable rate or greater and whose translatable rate is the minimal translatable rate or greater is called an ok-category. On the other hand, a category that is not an ok-category is called an ng-category.

Fig. 3. The path between the root in the semantic hierarchy C₁ and the sample category C_M for the target slot: M denotes the depth of the sample category for the target slot.

Fig. 4. Acceptable rate and translatable rate of C_i (|

| denotes the number of nouns in the set.)

This article characterizes each appropriate semantic category as an ok-category that is located at the highest level on the semantic hierarchy when the minimal acceptable rate and the minimal translatable rate are given.

Consequently, given the minimal acceptable rate and the minimal translatable rate, the acquisition task that the proposed method handles is to search for the highest ok-category for the target slot of the SPP acquired by the method (Figure 5).

Input: (A) the skeleton of asentence that meets the acquired SPP,

(B) a combination of nouns such that, for each slot in the if-part of the acquired SPP, one of the nouns should meet the condition of the slot (the sample nouns),

(C) the semantic category for each sample noun (the sample categories).

Output: the highest ok-category in the semantic categories on the path between the root in the semantic hierarchy and the sample category for the target slot.

Fig. 5. Acquisition task.

For example, to make the proposed method acquire the appropriate semantic category for the J-Subj slot of the SPP shown in Figure 1, the inputted skeleton of the sentence that meets the acquired SPP is N₁-ga N₂-wo yaku "N₁ bake N₂", where N_i is a variable. Taro and Male are inputted as the sample noun for the J-SUBJ slot and the semantic category, respectively. At the same time, appurupai and Confectionery are inputted as the sample noun for the J-OBJ slot and the semantic category, respectively. Lexicographers can easily input this information. The output is the highest ok-category in the eight semantic categories on the path between the root in Figure 2, Anything (C₁), and the leaf, Male (C₈).

3. Proposed Method

This section describes the proposed method, which adopts three approaches to search for the highest ok-category described in Section 2. Figure 6 shows the overview of the proposed method. The proposed method basically generalizes the sample category for the target slot through interaction with a lexicographer. The proposed method, at first, requires the initial information explained at the end of the previous section: skeleton, sample nouns, and sample categories (Step 1). After the initial information is inputted by the lexicographer (Step 2), the proposed method updates the current search point C_i (Step 3). Next, the proposed method generates sentences for C_i and asks yes-no queries by using the generated sentences (Step 4). After receiving the answers to the queries from the lexicographer (Step 5), the proposed method estimates the two rates of C_i : the acceptable rate of C_i and the translatable rate of C_i (Step 6). The proposed method then seeks the highest ok-category, i.e., the highest category whose acceptable rate and translatable rate are, respectively, the minimal acceptable rate or more and the minimal translatable rate or more (Step 7). Until it finds the highest ok-category, the proposed method repeats (Step 3) to (Step 7). When finding it, the proposed method outputs the highest ok-category (Step 8).

Fig. 6. Overview of the proposed method: interaction between the proposed method and the lexicographer.

In the following, Subsection 3.1 describes the sentence generation of Step 4 and the estimation of Step 6. Subsection 3.2 illustrates the queries of Step 4 and their answers of Step 5. Subsection 3.3 finally outlines the three approaches for the searching of Step 3 and 7.

3.1. Generating sentences for queries

By adopting one of the three approaches for the searching, the proposed method uses the same strategy to generate sentences for queries and to present the queries to lexicographers in order to estimate the acceptable rate and the translatable rate of the current search point. When the current search point is C_i , the proposed method generates sentences in the following way: (i) initially generate a sentence by filling each variable N_i in the skeleton with the corresponding sample noun; (ii) then, generate some sentences by replacing the sample noun for the target slot with other nouns in Cluster_i, which hereafter denotes the set of nouns categorized as C_i or descendants of C_i but not categorized as C_i+1 or descendants of C_i+1. For example, assume that, in order to acquire the SPP shown in Figure 1, the input to the proposed method is the same as presented at the end of Section 2: N₁-ga N₂-wo yaku as the skeleton, Taro and appurupai as the sample nouns, and Male and Confectionery as the semantic categories; the J-SUBJ is the target slot; and C₂ (Concrete in Figure 2) is the current search point. Then, the substituted nouns are categorized as either Places or Objects or descendants of them, as shown in Figure 7.

The number of substituted nouns for C2 is recursively distributed
to the descendants of C2 (excluding C3) according to the number
of their descendent leaves.

Fig. 7. Cluster_i and the separation of Cluster_i into subsets for stratified sampling.^[12]

The proposed method uses the generated sentences in order to estimate the acceptable rate and the translatable rate of the current search point C_i . The main issues are that, in order to estimate both rates within significantly small errors by using only the limited number of generated sentences, how the proposed method selects the nouns substituted for the sample noun and how the proposed method estimates both rates.

To resolve the two issues, the proposed method employs stratified sampling,^[12] a sampling survey technique in statistics, as follows. Cluster_i is separated into subsets of nouns, from which some substituted nouns are collected, where the number of substituted nouns from each subset is decided according to the total number of nouns in the subset as will be seen later. Then, the acceptable rate of C_i is estimated as the weighted average of the acceptable rates for the subsets, where the weight for each subset is the number of substituted nouns. For example, let us assume that C_i is separated into two subsets, that the ratio of the substituted nouns for the subsets is 3:1, and that the acceptable rates for the subsets are 100% and 50%, respectively. Then, the acceptable rate of C_i is estimated as (100*3+50*1)=(3+1). The translatable rate of C_i is also estimated in the same way.

Stratified sampling does not provide a way to separate Cluster_i. the proposed method, therefore, adopts an original technique to separate Cluster_i into subsets, which isdecided step by step as follows. For example, when the total number of substituted nouns in Cluster₂ is 40 under the same input as seen at the beginning of this section, at first, allocate the number of substituted nouns (the sample size) in Cluster₂ among all of the siblings of C₃ (Agents in Figure 7) according to the ratio of the number³ of leaves that are descendants of each sibling of C₃, for example, 3:1. In this example, the sample sizes for Places and Objects become 30 and 10, respectively. For each sibling, allocate the sample size of the sibling among all of the children in the same way, until the sample size is equal to or less than a threshold, for example, 15. If the ratio of the number of leaf-level descendants of Nature, Regions, and Facilities is 2:1:3, then the sample sizes⁴ for Nature, Regions, and Facilities become 10, 5, and 15. Since the sample size of Objects is less than 15, the sample size of Objects is not allocated among the children: Animate and Inanimate. After this, for each child, allocate the sample size of the child among all of the children of the child recursively, until the sample size is equal to or less than the threshold. Let S (i ,j ); (1 <= j <= L_i ), hereafter, denote the semantic categories whose sample sizes are not allocated to their children. In the example, L₂ = 4 and S (2,j )(1 <= j <= L₂) correspond to Nature, Regions, Facilities, and Objects, as shown in Figure 7.

S (i ,j ), (1 <= j <= L_i ) are used as the subsets used by the stratified sampling. The substituted nouns are selected from S (i ,j ) or descendants of S (i ,j ) in the order of frequency of use. The number of selected nouns is the sample size for S (i ,j ). Ten nouns are, in the example, selected from Objects or descendants of Objects; consequently, tensentences are generated for Objects (S (2,4)) through the substitution of the sample noun for the target slot with each of the ten selected nouns (Figure 8). For each of the others: Nature (S (2,1)), Regions (S (2,2)), and Facilities (S (2,3)), ten sentences, five sentences, and fifteen sentences are, respectively, generated in the same way.

Fig. 8. Generation of sentences for queries.

3.2. Presenting queries and getting answers

For each S (i ,j ) ; (1 <= j <= L_i ), the proposed method simultaneously presents the generated sentences to lexicographers, as shown in Figure 9. This simultaneous presentation prevents the lexicographers from misunderstanding the meanings of the substituted nouns. Since all of the substituted nouns in the presented sentences are categorized in a certain semantic category, the lexicographers can easily guess the correct meaning of a substituted noun.

(Q1) (Q2)

Generated sentence 1 Yes Yes

Generated sentence 2 No

Generated sentence 3 Yes Yes

Generated sentence 4 Yes No

Generated sentence 5 Yes Yes

Fig. 9. Image of the interface on which the proposed method presents queries for a substituted noun set S (i ,j ) and gets answers to the queries from a lexicographer.

The lexicographers judge whether each substituted noun can be semantically collocated with the verb on the SL side in the acquired SPP (Q1). If and only if this answer is positive, they also judge whether the substituted noun can give a context such that the most plausible equivalent for the verb on the SL side is the verb on the TL side (Q2). The lexicographers make theirdeterminations by answering queries: (Q1) and (Q2). For example, as explained in detail in Section 2, they answer positive to both (Q1) and (Q2) for (2) and negative to (Q1) for (3). Moreover, they answer positive to (Q1) and negative to (Q2) for (4).

3.3. Approaches for searching for an appropriate category

The three approaches for searching differ in the order in which they search for an appropriate semantic category (Figure 10). The first two approaches, the Bottom-up approach and the Top-down approach, are the same as a linear search. The last approach, the Dichotomy approach, is the same as a binary search. For convenience in the following explanation, let us define that L_M = 1 and S (M ,1) = C_M .

Fig. 10. Three approaches for searching: Bottom-Up, Top-down, and Dichotomy.

In the case that the proposed method adopts the Bottom-up approach, the proposed method applies the above query strategy to each semantic category in reverse order of depth, C_M , C_M-1, . When an ng-category is found, the proposed method stops searching and outputs the latest ok-category.

In the case that the proposed method adopts the Top-down approach, the proposed method applyies the above query strategy to each semantic category in order of depth, C₁, C₂, . When an ok-category is found, the proposed method stops searching and outputs that ok-category.

In the case that the proposed method adops the Dichotomy approach, the proposed method initially applies the above query strategy to the leaf and the root in order. Next, the proposed method prepares a candidate list (C₁, C₂, , C_i , , C_M ) and applies the above query strategy to the semantic category in the middle of the candidate list or to the lower semantic category closest to the middle if a precisely central semantic category does not exist. According to whether the semantic category is an ok-category or not, the proposed method revises the candidate list in the same way as a binary search does. Then, the above procedure is repeated by using the updated candidate list. Consequently, the first and last elements of the candidate list are always an ng-category and an ok-category, respectively, after the root and leaf semantic categories are processed. When the length of the candidate list becomes 2, the proposed method stops searching and outputs the last element.

As mentioned above, the only task that lexicographers have todoistoanswer each query. Consequently, the resulting categories follow aspecific generalization standard. After repeatedly applying one of the approaches for searching to each corresponding input, the proposed method can specify all of the appropriate semantic categories for each slot intheSPP.

4. Experimental Work

The authors evaluated the proposed method on the three following points.

How different are the semantic categories acquired by theproposed method from those manually specified by lexicographers?
Are the semantic categories acquired by the proposed method better than those manually specified by lexicographers?
How is the proposed method able to contribute to the acquisition of SPPs?

In order to evaluate the above points, the authors attempted to acquire semantic categories for SPPs whose if-parts corresponded to the skeletons in the 2nd column of Table 1. The sample nouns inputted to the proposed method are shown under the skeletons. The target of the generalization is underlined. Each semantic category in the 3rd column indicates a semantic category specified manually for the SPP corresponding to the skeleton in the 2nd column. The semantic hierarchy used was that of ALT-J/E as shown in Figure 2. The minimal acceptable rate and the minimal translatable rate were fixed at 2% and 80%, respectively. The sample size for C_i and the threshold of the sample size were 50 and 10, respectively. Two lexicographers, who generated SPPs of ALT-J/E, participated in the experiments.

Table 1: Skeletons and sample sentences for interactive generalization.

No	Skeleton Sample Sentence	Category specified manually
1	N1-ga N2-wo yomu "N1 read N2" Chichi-ga hon-wo yomu "My father reads a book"	`Agents`
2	N1-ga N2-wo yomu "N1 read N2" Chichi-ga hon-wo yomu "My father reads a book"	`Abstract Thing` (`Idea`)
3	N1-ga N2-wo yomu "N1 read N2" Chichi-ga houkokusho-wo yomu "My father reads a report"	`Spirit/Soul/Mind`
4	N1-ga N2-wo N3-ni erabu "N1 elect N2 N3" Juhmin-ga kare-wo kaichou-ni erabu "Residents elect him their head"	`Chief/President/Manager`
5	N1-ga N2-de nyushou-suru "N1 win a prize in N2" Kare-ga Konkuhru-de nyushou-suru "He wins a prize in the contest"	`Abstract Thing` (`Behavior`)
6	N1-ga N2-wo tatamu "N1 close N2" Chichi-ga mise-wo tatamu "My father closes his shop"	`Facilities`
7	N1-ga N2-wo unten-suru "N1 run N2" Chichi-ga hatsudouki-wo unten-suru "My father runs an electric dynamo"	`Machinery`
8	N1-ga N2-wo nageru "N1 throw N2" Chichi-ga bohru-wo nageru "My father throws a ball"	`Objects`
9	N1-ga hanpatsu-suru "N1 rebound" Kabusiki-ga hanpatsu-suru "Shares rebound"	`Economic System`
10	N1-ga N2-ni tassuru "N1 rise to N2" Doru-ga saitakane-ni tassuru "The dollar rises to the highest level"	`Price/Cost`

Table 2 reports experimental results. The 2nd to 4th columns show the relative position of an acquired semantic category in comparison to a semantic category manually specified. For example, +1 or -1 indicates that the acquired semantic category is one semantic category above the semantic category manually specified or below it, respectively. Each number in the 5th or 7th columns shows the number of paired queries, i.e., (Q1) and (Q2) in Section 3.2, presented to the lexicographers. Each number in the 8th to 10th columns shows the time spent for generalization of the sample noun to the target slot. B, T, and D on the 2nd line indicate that the approach for the searching is, Bottom-up, Top-down, and Dichotomy, respectively. Through this experiment, the following things were found:

Although the proposed method acquired different semantic categories in five rows out of ten, the required semantic categories whose relative positions were one or two below were judged by the lexicographers to be better than those specified manually. This is because, in their minds, a manual specification to a translation example is very sensitive, even if the example belongs to a minority of translation examples; and accordingly, the semantic category specified manually tend to become too general.
The above consideration tells us that the Bottom-up approach is the best approach for searching. The average time spent was 17.9 minute. On the other hand, the lexicographers reported that the time spent forthe manual specification work was 15 to 20 minutes on average. The efficiency of the proposed method is, therefore, almost equal to that of manual specification as far as one compares the time spent. Note that, although the trials using each approach weredone in thefollowing order: Top-down, Bottom-up, and Dichotomy, the ratio of the number of paired queries between Top-down and Bottom-up was almost equal to that of the time spent between them. The lexicographers, therefore, did not remember the Q&A in the previous trial: the Top-down approach, at lease when they moved to the next trial: the Bottom-up approach.

Table 2. Experimental results: the relative position of the acquired semantic category to that specified manually, the number of paired queries: (Q1) and (Q2) (See Section 3.2), and the time spent.

No	Difference			# of paired Queries			Time (M.)
No	B	T	D	B	T	D	B	T	D
1	0	0	0	259	53	53	23	6	5
2	0	0	0	158	155	155	17	16	18
3	-1	+3	+3	119	51	51	24	10	5
4	0	0	0	55	262	158	9	34	15
5	-2	-2	-2	153	259	204	8	21	11
6	-1	-1	-1	147	208	157	10	13	12
7	-2	-1	-1	104	310	208	10	54	8
8	-2	-2	-2	208	207	156	55	75	13
9	0	0	0	158	212	160	7	12	8
10	0	0	0	50	316	155	16	4	10
Ave.	--	--	--	141.1	203.3	145.7	17.9	24.5	10.5

5. Conclusion

This article proposed a method to acquire appropriate semantic categories to be inputted into each slot of an SPP by using queries based on a semantic hierarchy. The queries ask whether the noun corresponding to the target slot in presented sentences can be semantically collocated with the verb in SL and ask whether the noun can also give acontext such that the most plausible equivalent fortheverb in SL is the verb on the TL side in the acquired SPP. The method allows lexicographers to acquire more plausible semantic categories for SPPs by simply answering the queries presented by the method.

Acknowledgments

The authors thank NTT Communication Science Labs for their support through the research grant. They also acknowledge the members of Natural Language Processing Systems Department atNTT Advanced Technology Corporation for their cooperation in the development of the proposed system. The first author thanks ATR Spoken Language Translation Research Laboratories for their support of this paper.

References

[1]: S. Ikehara, S. Shirai, A. Yokoo, and H. Nakaiwa, Toward an MT system without preediting-effects of new methods in ALT-J/E , Proc. 3rd Machine Translation Summit: MT Summit III, the Association for Machine Translation in the Americas, Washington (1991) 101-106.
[2]: M. Dorna and M. C. Emele, Efficient implementation of a semantic-based transfer approach , Proc. 12th European Conference on Artificial Intelligence: ECAI-96, ed. Wolfgang Wahlster, John Wiley and Sons, Chichester (1996) 567-571.
[3]: J. Yang, Towards the automatic acquisition of lexical selection rules , Proc. 7th Machine Translation Summit: MT Summit VII, Asia-Pacific Association for Machine Translation, Tokyo (1999) 397-403.
[4]: S. Ikehara, M. Miyazaki, S. Shirai, A. Yokoo, H. Nakaiwa, K. Ogura, Y. Ooyama, and Y. Hayashi, Goi-Taikei: A Japanese Lexicon (in Japanese) , Iwanami Shoten Publisher, Tokyo (1997).
[5]: S. Shirai, S. Ikehara, A. Yokoo, and H. Inoue, The quantity of valency pattern pairs required for Japanese to English MT and their compilation , Proc. 3rd Natural Language Processing Pacific Rim Symposium: NLPRS-95, Korea Advanced Institute of Science and Technology, Seoul (1995) 443-448.
[6]: Y. Akiba, M. Ishii, H. Almuallim, and S. Kaneda, Learning English verb selection rules from hand-made rules and translation examples , Proc. 6th International Conference on Theoretical and Methodological Issues in Machine Translation: TMI-95, the Center for Computational Linguistics, Katholieke Universiteit, Leuven, (1995) 206-220.
[7]: H. Almuallim, Y. Akiba, and S. Kaneda, On handing tree-structured attributes in decision tree learning , Proc. 12th International Conference on Machine Learning: ICML-96, ed. Lorenza Saitta, Morgan Kaufmann, San Francisco, California (1995) 12-20.
[8]: H. Tanaka, Decision tree learning algorithm with structured application to verbal case-frame acquisition , Proc. 16th International Conference on Computational Linguistics: COLING-96, Association for Computational Linguistics, New Brunswick, New Jersey (1996) 943-948.
[9]: T. Utsuro, Sense classification of verbal polysemy based on bilingual class/class association , Proc. 16th International Conference on Computational Linguistics: COLING-96, Association for Computational Linguistics, New Brunswick, New Jersey (1996) 968- 973.
[10]: K. W. Church and R. L. Mercer, Introduction to the special issue on computational linguistics using large corpora , Computational Linguistics 19 (1993) 1-24.
[11]: G. A. Miller, Wordnet: a lexical database for English , Communications of the ACM 38:11 (1995) 39-41.
[12]: R. L. Scheaffer, W. I. Mendenhall, and R. L. Ott, Elementary Survey Sampling (5th ed.) , Duxbury Press, California (1996).

Footnote

¹ Some nouns like Zeri "jelly" are categorized like Confectionery; consequently, semantically strange sentences like watasi-wa zeri-wo yaku "I bake jelly" can meet the SPP shown in Figure 1. It does not matter that sentences like this meet an SPP since they are not inputted into MT systems. (Return)

² Minor translation examples are counterexamples that should not meet a certain SPP but that should meet other SPPs. (Return)

³ These numbers are used as convenient indicators of the semantic diversity under each sibling. (Return)

⁴ When the sample size becomes a decimal like 13.3* (=20*2/3), the decimal is rounded out. (Return)

	(Q1)	(Q2)
Generated sentence 1	Yes	Yes
Generated sentence 2	No
Generated sentence 3	Yes	Yes
Generated sentence 4	Yes	No
Generated sentence 5	Yes	Yes