Mathematical Linguistics Vol. 20

Vol. 20 No. l (June, 1995)	Vol. 20 No. 2 (Sep., 1995)	Vol. 20 No. 3 (Dec., 1995)	Vol. 20 No. 4 (Mar., 1996)
Vol. 20 No. 5 (June, 1996)	Vol. 20 No. 6 (Sep., 1996)	Vol. 20 No. 7 (Dec., 1996)	Vol. 20 No. 8 (Mar., 1997)

Vol. 20 No. 1 (Jun. 20, 1995)

Classification:
paper
Author:
ITO Masamitsu
Title:
Automatic Segmentation Methods for Kanji Strings of Classical Japanese Texts
Pages:
1-17
Descriptors:
Wago-yoogen-bunkatsuhoo; Jii-bunkatsuhoo; automatic segmentation; segmentation of kanji strings; word recognition; `Konjaku-Monogatarishuu'; word units
Abstract:
I developed a program system to automatically segment sentences in classical Japanese. The automatic segmentation methods adopted as algorithms for the program are divided into two parts. One is for kanji strings, and the other is for kana strings. This paper describes some automatic segmentation methods for kanji strings of classical Japanese texts.
The methods are based upon categories of meaning of kanji strings and a common orthographic characteristics of verbs and adjectives.
My program uses only a small table containing 592 units. In my experiments I have obtained 92.28% correct results.
These methods are simple and of practical use.
Classification:
paper
Author:
OYAMADA Yuki
Title:
Conjugation words as Case Elements in Modern Japanese
Pages:
18-34
Descriptors:
conjugation word; case; Mizutani grammar; Restriction Predicate; continuous change of case relation ``ni''
Abstract:

Conjugation words were used regularly as case elements in Archaic Japanese. In modern Japanese, however, they are used in a restricted manner. The conjugation words are classified into two types; ones that require their own predicates, and ones that do not. This paper describes the grammatical analysis to these two types.
Especially, the second type is shown to consist of case particles ``ni'' and ``yori'', for which we discuss their case relations. As a result, the method of processing the conjugation words as case elements will be presented.

Vol. 20 No. 2 (Sep. 12, 1995)

Classification:
paper
Author:
YAMAOKA Miho
Title:
Analysis on Coordinate Structure and Its Experiment
Pages:
55-79
Descriptors:
coordinate Structure; symbol of coordination; degree of the similarity; Mizutani grammar; parser system; JUST
Abstract:

This paper describes an analysis on the coordinate structure from the semantic point of view as well as the syntactic point. And also we show the result by using the parser system, JUST, which we developed to the analysis.
In the paper, we classified the type of coordinate structure in accordance with the construction level of the sentences. The types are of morpheme level, terminal level, phrase level, and sentence level. In addition we have refined on them, whether they have a symbol of coordination or not. To eliminate the ambiguity on parsing coordinate structure, the sentences are evaluated to low, if they have problems in either syntactic or semantic process. The major part of the minus point in parsing coordination structure is the degree of morpheme similarity. Finally, we demonstrate our work by using the parser system, JUST, which we developed in respect to the coordinate structure.
Classification:
report
Authors:
YOSHIDA Masakazu, KOSHIBA Ryosuke, SHIINA Satoshi, and MATSUMOTO Syozi
Title:
A Study of the Signature
Pages:
80-89
Descriptors:
signature, author indefication, handwriting
Classification:
news
Author:
TOYOSHIMA Masayuki (Hokkaido University)
Title:
Introducing JIS X4061 ``Japanese Collation''
Pages: 90-95
Descriptors:
JIS code, JIS X4061, Japanese collation, sorting

Vol. 20 No. 3 (Dec. 12, 1995)

Classification:
paper
Author:
MIZUTANI Sizuo
Title:
Some Quantitative Considerations on Adverb ``marude''
Pages:
99-111
Descriptors:
``marude''; adverb modifying predication; short-story; hikyoo(alikeness); negation; contextual correspondence; competitive expression; ``marukiri''
Abstract:

People seem to feel that Japanese adverb ``marude'' is or to be used with hikyoo expression, especially with the word ``yoo'' (like or as if). Such senses do not match the actual situation. According to my survey on 145 short-stories(1926 - 1992) where this adverb occurred 83 times, the rate of such usage was merely some 65%. On the other hand the rate of usage with negation was observed over than 10%. Similar situation was found by re - examination of data on 90 magazines (1956) by the National Language Research Institute's word count. Based on this fact, the grammatical and semantic characters of ``marude'' are discussed in order to correct popular misbelief. Taking notice of the original meaning ``entirely'', various uses of this word can be explained inclusively.
Classification:
report
Author:
OTSUKA Misa
Title:
Polysemy in the Dictionary:As Analysis from the view-point of Semantic Categories
Pages:
112-122
Descriptors:
meaning of a word; degree of polysemy; semantic derivation; semantic category; ``Bunruigoihyou''; dictionary
Abstract:

This paper presents an analysis of polysemic words in the dictionary from the view-point of semantic category.
I use ``Bunruigoihyou'' ( ``Word List by Semantic Principle'') to classify into the semantic category of words. In this paper, I survey the distribution of polysemic words from the perspective of the categories assigned in the ``Bunruigoihyou''. Most words with a high degree of polysemy are verbs. Among nouns, words belonging to Category ``1.1 Abstract Relation'' and ``1.5 Nature and Natural Phenomena'' have a high degree of polysemy. On the other hand, words belonging to Category ``1.3 Mind and Acts'' have a low degree of polysemy. Words representing the body are highly polysemic from the view-point of more detailed categories.
Also I analyze the meaning structure of polysemic words from thc view- point of semantic categories, assuming the central meaning of a word to be related to the semantic categories assigned in the ``Bunruigoihyou''. Except for words in Category 1.5, derived meaning tends to remain within the original semantic category. On the other hand, polysemic meaning derived from words in Category 1.5 are generally assigned to other categories. (i.e. te, hone, hikari etc. ) The latter derivations are based on a word form, position, function, and essential properties. Category 1.5 derivations are extremely diverse.
Classification:
note
Author:
WADA Eiichi
Title:
Variation in Kanji Fonts
Pages:
123-127

Vol 20 No. 4 (Mar. 12, 1996)

Classification:
paper
Author:
ITO Masamitsu
Title:
Automatic Segmentation Methods for kana Strings of Japanese Classical Texts
Pages:
139-167
Descriptors:
automatic segmentation; segmentation of kana strings; word recognition; `Konjaku-Monogatarisyu'; word units
Abstract:

I developed a program system to automatically segment sentences in classical Japanese. The automatic segmentation methods adopted as algorithms for the program are divieded into two parts. One is for kanji strings, and the other is for kana strings. This paper describes some automatic segmentation methods for kana strings of classical Japanese texts.
The methods are based upon grammatical forms of conjugated words, suffixes and particles which are wirtten by kana.
My program uses only a small table containing 592 units. In my experiments I have obtained 99.81% correct results.
These methods are simple and of practical use.
Classification:
report
Author:
JIN Ming-zhe
Title:
Statistical Characteristics of Distribution of Phrase to Phrase Modification Distance in Japanese Novels
Pages:
168-179
Descriptors:
distribution of phrase to phrase modification distance; range; individuality of a writer; Akaike Information Criterion(AIC); chi-square measure
Abstract:

The aim of the description of this paper is a statistical analysis of phrase to phrase modification distance in the sentences of Japanese novels. The research reveals that the distributions of the phrase to phrase modification distance do not show the writer's individuality, and are subordinate to the L type distribution. But they are not subordinate to the same distribution, when the they are divided by range. As for the statistical method, the average value of the distribution distance (chi-square distance), the AIC (Akaike Information Criterion) and the chi-square measure are used in the research.
Classification:
essay
Author:
OKAMOTO Tetsuya
Title:
Egawaru and Other Verbs Ending in `-ru'
Pages:
180-182
Classification:
review
Author:
OGINO Tsunao
Title:
Gengo no Kagaku: Nihongo no Kigen o Tazuneru (The Science of Language: toward a Solution of the Origin of Japanese) by YASUMOTO Biten
Pages:
183-184

Vol. 20 No. 5 (Jun. 14, 1996)

Classification:
paper
Author:
SAITO Masahiko
Title:
On Extra-morae in Kokinsyuu
Pages:
187-196
Descriptors:
Kokinnsyu; extra-morae; vear-positioned vowels; structure of the last line; rules of contraction
Abstract:

We consider the phenomenon of extra-morae (ji-amari) in poems of Kokinshu. T. E. Huber proposed a set of rules of contraction of vowels and obtained 31 exceptions. We add several rules of contraction, in particular
1) a rule concerning the structure of the last line of poems and
2) phonological rules on two successive vowels. We can reduce the number of exceptions from 31 to 22.
Classification:
report
Author:
TAMURA Manami
Title:
Kanji for Writing Basic Japanese vocabulary
Pages:
197-204
Descriptors:
basic kanji; Jooyoo kanji; Kyooiku kanji; basic vocabulary; number of words; covering ratio; covering ratio; rate of use
Abstract:

In order to define basic kanji, it is indispensable not only to survey frequency ratio of kanji but also to consider number of words in which each kanji is used. In this paper, I report on the features of kanji used in writing basic Japanese vocabulary. As basic vocabulary I use ``Nihongo Kyoiku Kihongoi Nana-shu Taisyohyo'' (`` Contrastive Word Lists of Seven Kinds of Basic Japanese Vocabulary''). Obviously, the kanji that has high frequency is not always kanji that is used many words, and Kyoiku kanji (the characters which Japanese primary students must learn by their sixth year of schooling) is not entirely included in the kanji for writing basic vocabulary, but has different elements from set of kanji for writing basic vocabulary.
Classification:
essay
Author:
ISHIMOTO Arata
Title:
Logic and Linguistics
Descriptors:
logical grammar, Montague grammar, Lesniewsky ontology
Pages:
205-208
Classification:
review
Author:
MIYAJIMA Tatsuo
Title:
Die deutsche Sprache in Japan (Nihon ni okeru Doitsugo --- Jitsuyomen to Gakushumen)
Pages:
209-216

Vol 20 No. 6 (Sep. 13, 1996)

Classification:
paper
Author:
OGINO Tsunao
Title:
Some Characteristics of Speakers' Introspection Newspaper CD-ROMs,and Japanese Dictionaries as Language Data; Recognition of `-suru' verbs
Pages:
233-252
Descriptors:
speakers' introspection, newspaper CD-ROM, Japanese dictionary, `-suru' verbs
Abstract:

Japanese nouns can be used as verb by `-suru'. But appropriateness of such `-suru' verbs varies from word by word. This paper picked up 38 nouns and checked whether they are appropriate as a `-suru' verb using three kinds of data;
(1) native speakers' introspection (N=329),
(2) frequencies of appearance in newspaper CD-ROMs (1.9GB),
(3) descriptions of 42 Japanese dictionaries.
The results show the following characteristics of each material as language data.
(1) Speakers' introspection can reflect current tendencies of language change.
(2) Newspaper CD-ROMs, large electronic texts, include a variety of styles and often show interesting examples.
(3) Japanese dictionaries reflect language facts for the last several decades and include old-fashioned descriptions.
Classification:
note
Author:
ONO Masaki
Title:
Analysis on The Discourse-Topicalization System and Its Experiment
Pages:
253-256
Descriptors:
discourse topic, dative, object, sympathy hierarchy
Classification:
note
Authors:
MIYAJIMA Tatsuo, and NAKANO Hiroshi
Title:
On the Enlargement of ``bunrui-goihyo'' (Word List by Semantic Principles-in English)
Pages:
257-264
Descriptors:
thesaurus, ``Bunruigoihyou''
Classification:
program
Author:
NAKANO Hiroshi
Title:
Program Library MCL for Japanese Word Count
Pages:
265-279
Descriptors:
program library, Japanese word count

Vol. 20 No. 7 (Dec. 13, 1996)

Classification:
paper
Author:
MIZUTANI Sizuo
Title:
Essay on the Case in Modern Japanese, with Grammatico-semantic Description of Case-markers
Pages:
283-303
Descriptors:
case as relation; system of cases; case-marker; zero-presentation of case; grammatico-semantic description; modern Japanese
Abstract:
By means of set theory with variables on the level of strings, this paper proposes a definition of Japanese cases as ``relation'' of two elements combined by a case-marker or its zero-representation: a case-marker does not belong to its antecedent but functions as a binary combinator for a pair of elements. This paper also gives grammatico-semantic descriptions of all kakuzyosi (case-markers) i. e. ga, no, o, ni, to, [t]te, e, yori, kara, and de. Case-relations in (modern) Japanese are classified into seven classes, based on results of survey of actually used examples. One case-marker's form, however, does not always correspond to only one class of relation. Therefore, the Japanese system of cases can be said to be rather semantic than to be (purely) grammatical. Further, more or less optical zero-presentations for four cases are found from the functional side as relation.
Classification:
report
Author:
YOKOYAMA Shoichi and SAKUMA Kazuhiro
Title:
Analysis by Synthesis of Compound Nouns using Semantic Features
Pages:
304-314
Descriptors:
Compound Nouns; Semantic Features; Natural Language Processing; Analysis by Synthesis; Connection Rules; Combinatorial Explosion
Abstract:

In natural language processing, it is difficult to deal with compound nouns in Japanese. If the grammar rules authorize simple noun sequence, parsing trees run into a combinatorial explosion. Not all nouns form compound nouns. However, if the compound nouns themselves are entered as lexical items in a lexicon, the content of the lexicon will be explosive.
This paper describes the analysis and the classification of compound nouns using so-called ``analysis by synthesis'' method, which is on the analogy of speech processing. 34 semantic features are defined to nouns. About every 5 nouns are selected for a feature, and the combination of these nouns make compound nouns. Whether the resulting compound nouns are decided as acceptable or not is examined by hand, and connection rules are made. Abount 70% compound nouns are decided correctly based on these rules.
Classification:
essay
Author:
SIBATA Takesi
Title:
Correspondence Rate of ``Gensokuteki ni (in Practice)''
Pages:
330-331

Vol. 20 No. 8 (Mar. 7, 1997)

Classification:
paper
Author:
MIZUTANI Sizuo (Institute of Behavioral Sciences)
Title:
In Mathematical Literature
Pages:
335-356
Descriptors:
Valenz; case-cobmining pattern; mathematical literature; apparent pattern; abstract pattern; sampling; substitute for case-marker
Abstract:

In Japanese, adjectives, nouns and part of adverbs can be predicates as well as verbs. The notion ``Valenz'' of verbs would be to be enlarged to such classes, as case-combining patterns of predicates. In addition, there are instances that some case-markers of different forms have, in real contexts, almost the same function in the sense that they are exchangeable. Assuming one function for such cases regardless of their apparent word-forms, ``abstract patterns'' can be set up. I examine this notion on a book of mathematics titled ``Suugaku 100 no Teiri (One Hundred Famous Theorems in Mathematics).'' The number of observed patterns by a 1/20 sample was 121 on abstract patterns, and the total number was estimated as 147.275 (+-1.061: standard deviation of the estimator) by twice applications of Deming's least-square adjustment based on Mizutani's n-k relation formula. As for apparent patterns the total number was estimated as 484.5 (+-36.4). This paper, also, gives an outline of results and discusses some problems on the word order of elements (on apparent patterns) and concerning a way of abstraction of patterns such as no-subject elements, subjects properly referring to classes and expressions which substitute for case-markers.
Classification:
paper
Author:
JIN Mingzhe (Sapporo Gakuin University)
Title:
Determining the Writer of Diary Based on Distribution of Particles
Pages:
357-367
Descriptors:
Determinig the writer; distribution of particles; statistical analysis
Abstract:

Identifying the elements that make up a writer's characteristic writing style is one of the keys to determining the authorship or authenticity of a literary composition. When attempting to determine or recognize authorship, the choices of which data shall be used in analysis depend on the language used.
In Japanese, for example, the frequency of Chinese characters as well as placement of commas are indicators of a writer's characteristic style.
In the present study, the statistical analysis is based on the distribution of particles from 60 diary entries of 6 writers. It is contended that the distribution of particles differ from writer to writer and may therefore be considered one of the features that make up an individual's characteristic writing style, thus providing valuable information for determining or recognizing the authorship of a particular text.
Classification:
paper
Author:
TANAKA Yukari (Aoyama Gakuin Junior College) and YOSHIDA Kenji (Shoin Women's Junior College)
Title:
Process of Variant-loss and Its Constraints: A Case of Japanese Velar Nasal
Pages:
368-383
Descriptors:
varinat loss; lexical diffusion; order of variable constraints; velar nasal; third quantification theory
Abstract:

The main goal of this paper is to investigate how the loss of linguistic variants proceeds. Distribution of nasal and non-nasal variants of Japanese velar consonant are taken as an example of change in progress seen in many regions of Japan. Speech samples were taken from about 500 students of 7 junior high school in the western part of Yamanashi Prefecture. We analyzed the data with the third quantification theory. The arrangement turned out to be continuous, suggesting that the loss of nasal variant is proceeding in the pattern of lexical diffusion. The phonetic factors (preceding nasals/following low vowels) are stronger constraints than the lexical ones (non-borrowings/subject particle). The only significant extralinguistic constraint is the geographic one. Schools nearer to Tokyo go ahead of the others concerning this variant loss. That means nasal velar variant ceased to be a marker and has become an indicator. We posit a model of loss of nasal variant in the last part.
Classification:
note
Authors:
TAJIMA Ikudoh and FUNAHASHI Shohko (Nagoya University)
Title:
An Vocabulary System
Pages:
384-390
Descriptors:
vocabulary; VocaSys

Back to Homepage