Lexicostatistics, Glottochronology

Then a global genealogical addition of Lezgian bates of the Early Goa.

Swadesh — Swadesh M. Lexico-statistic dating of prehistoric ethnic contacts: With special reference to North American Indians and Eskimos. Proceedings of the American Philosophical Society. Towards greater accuracy in lexicostatistic dating. In fact, I would argue that lexicostatistics and glottochronology are reasonably free from the threat of total extinction as long as the following situation has not been explicitly demonstrated on uncontroversial data: One or more such demonstrations would surely put an end to all debate about rates of lexical change; however, I know of no such examples, despite having closely worked with close to a thousand different Swadesh lists from various families, and have a strong suspicion that none will be discovered in the near future.

From a purely theoretical point, there is nothing new about this approach: Networks are, however, generally harder to interpret than trees — especially if we share the belief that neither trees nor networks in historical linguistics should be a goal in itself i. The big advantage of a tree is that each tree has a unique historical interpretation, whereas each network conceals a variety of scenarios. In fact, networks seem almost to be an unavoidable necessity. Consider, for instance, the equivalents of two different Swadesh meanings in three Indo-European languages indexes A and B indicate formal cognacy, i. From a purely formal standpoint, we would have little choice but to superimpose these two trees onto each other, getting a network representation, similar to the usual way this is done in genetics.

But such a projection would leave us no closer to answering the most important question: Somebody with no knowledge whatsoever of Indo-European historical studies would list the following possibilities: This is highly unlikely, since it goes against the uniformitarian principle: Let us now look at the larger picture. Lexicostatistics as a basis for language classification of my knowledge, no tree structures have been proposed for Indo-European on which Irish and other Celtic languages would come out closer to Tocharian than Hindi and other Indo-Aryan languages.

This would imply that the common ancestry of Irish cluas and Tocharian klots is, most likely, an archaism: But here is the catch: Unlike the Irish and Tocharian forms, this root is found in a much larger number of branches, and, most importantly, it is unmotivated, i. This puts us in a difficult situation close to the one suggested in c: No historical linguist, however, would take seriously the possibility of contacts between Irish and Tocharian: Exclusive Celtic-Tocharian lexical and semantic isoglosses are quite rare, to say the least. There is only one other solution: How high is the probability of that assumption? In fact, Old Irish still has au — clearly confirming the hypothesis.

The importance of this process, which we may call unilateral independent se- mantic development UISD for shortshould not be underestimated. For some rea- son, it seems to be ignored in most works on lexicostatistics or, at least, is never paid all the attention that it deserves. But what about cluas and klautso? Their phonemic structures also coincide at least, as far as the root is concernedtheir meanings are identical, but the Indo-European word that they go back to must have, by all ac- counts, had a different meaning. And that struc- ture, in turn, is itself created on the basis of a Swadesh wordlist where all the cogna- cies have already been marked.

Instead of regarding the situation as a sort of vicious circle, I prefer to view it as a variety of bootstrapping, where lexicostatistical analysis alternates, over and over again, with standard comparative research. The comparative evidence is then checked once again for identifiable cases of UISD. If any one given meaning at any one given time may evolve into different adjacent meanings, the probability of UISD anywhere, at any time, is quite low. But the real situation is different: As an example, one could quote Starostin, a paper that tries to verify several long-range hypotheses for language families of Eurasia based on lexicostatistics.

The comparison operates on proto-roots, reconstructed for Indo- European, Uralic, Kartvelian, Altaic, Dravidian, Semitic, North Caucasian, Sino- Tibetan, and Yeniseian protolanguages with varying degrees of reliability. Illich-Svi- tych, uniting the first six of the listed families, or S.

Although some of the individual etymologies are questionable on phonetic grounds, such numbers would seem to clearly support not only the very fact of relationship between these families, but also a rather surprising closeness of this relationship: A closer look, however, reveals that a typical comparison between the reconstructed protolanguages included in S. Opponents of long-range comparison would probably interpret these contradictions as confirmation of the fallacy of Nostratic and similar hypotheses: This is risky, since a family can consist of quite a few subbranches; if our comparison is not really between Proto-Indo-European and Proto-Uralic, but between aproximately 10—15 daughter branches of Proto-Indo-European and a slightly lesser number of daughter branches of Proto-Uralic, this significantly increases the possibility of accidental similarities, mistaken for genuine cognacy.

On the other hand, this certainly does not explain the very fact of widely vary- ing figures: The fact that S. The reconstructed words share the exact same basic meaning and obey regular phonetic correspondences, originally formulated by V. Illych-Sv- itych for Nostratic. There are two important yet not well-studied parameters in this approach: Here, we derive two statistical principles from stochastic theorems to quantify these parameters. These principles validate the practice of using the Swadesh and word lists to indicate degree of relatedness between languages, and enable a frequency-based, dynamic threshold to detect recurrent sound correspondences.

Using statistical tests, we further evaluate the generality of the Swadesh word list compared to the Swadesh word list and other word lists sampled randomly from the Swadesh word list. All these provide mathematical support for applying lexicostatistics in historical and comparative linguistics. Introduction In linguistics, quantitative approaches such as lexicostatistics and glottochronology have been widely applied to detect hypothetical genetic relations among languages McMahon and McMahon, ; Campbell, Lexicostatistics refers to the statistical manipulation of lexical materials for historical inferences that abstract away from exact dates Hymes, Lexicostatistics compares languages for phylogenetic affinity based on proportion of cognates in a standard basic vocabulary list.

Each slot in the list is a concept meaningand collected items words occupying the same slot are compared cross-linguistically. We thus do not make distinction between the terms vocabulary list and meaning list. Unlike lexicostatistics, glottochronology deals in particular with phylogenetic relationships among languages Campbell, Strictly speaking, lexicostatistics is a broader approach than glottochronology without specific assumptions such as constant rate of word retention or loss. Computing lexicostatistics generally proceeds in the following steps McMahon and McMahon, ; Campbell, It would be ideal to collect every word from languages being compared, yet it is infeasible to obtain an exhaustive or very large-scale collection of words, especially for endangered or poorly-documented languages.

Create family tree[ edit ] Creation of the language tree is based solely on the table found above. Various sub-grouping methods can be used but that adopted by Dyen, Krustal and Black was: Calculations have to be of nucleus and group lexical percentages. Applications[ edit ] A leading exponent of lexicostatistics application has been Isidore Dyen. He used lexicostatistics to classify Austronesian languages as well as Indo-European ones. A major study of the latter was reported by Dyen, Kruskal and Black Studies have also been carried out of Amerindian and African languages. Criticisms[ edit ] People such as Hoijer have showed that there were difficulties in finding equivalents to the meaning items while many have found it necessary to modify Swadesh's lists.

