This is a modern-English version of Language: An Introduction to the Study of Speech, originally written by Sapir, Edward. It has been thoroughly updated, including changes to sentence structure, words, spelling, and grammar—to ensure clarity for contemporary readers, while preserving the original spirit and nuance. If you click on a paragraph, you will see the original text that we modified, and you can toggle between the two versions.

Scroll to the bottom of this page and you will find a free ePUB download link for this book.

Language

An Introduction to the Study of Speech

by
Edward Sapir

1939

1921

Preface

This little book aims to give a certain perspective on the subject of language rather than to assemble facts about it. It has little to say of the ultimate psychological basis of speech and gives only enough of the actual descriptive or historical facts of particular languages to illustrate principles. Its main purpose is to show what I conceive language to be, what is its variability in place and time, and what are its relations to other fundamental human interests—the problem of thought, the nature of the historical process, race, culture, art.

This small book aims to provide a certain perspective on language rather than just gathering facts about it. It doesn't delve deeply into the ultimate psychological basis of speech and gives just enough actual descriptive or historical facts about specific languages to illustrate its principles. Its primary goal is to show what I believe language is, how it varies over time and in different places, and how it relates to other essential human interests—like the problem of thought, the nature of the historical process, race, culture, and art.

The perspective thus gained will be useful, I hope, both to linguistic students and to the outside public that is half inclined to dismiss linguistic notions as the private pedantries of essentially idle minds. Knowledge of the wider relations of their science is essential to professional students of language if they are to be saved from a sterile and purely technical attitude. Among contemporary writers of influence on liberal thought Croce is one of the very few who have gained an understanding of the fundamental significance of language. He has pointed out its close relation to the problem of art. I am deeply indebted to him for this insight. Quite aside from their intrinsic interest, linguistic forms and historical processes have the greatest possible diagnostic value for the understanding of some of the more difficult and elusive problems in the psychology of thought and in the strange, cumulative drift in the life of the human spirit that we call history or progress or evolution. This value depends chiefly on the unconscious and unrationalized nature of linguistic structure.

The perspective gained here will hopefully be useful to both language students and the general public, who often tend to dismiss language concepts as mere quirks of idle minds. For professional language students, understanding the broader context of their field is crucial to avoid falling into a dry, purely technical mindset. Among today's influential writers on liberal thought, Croce is one of the few who truly grasp the fundamental importance of language. He has highlighted its close connection to the issue of art, and I'm very grateful to him for this insight. Beyond their inherent interest, linguistic forms and historical developments offer immense diagnostic value for grappling with some of the more challenging and elusive issues in the psychology of thought and the strange, gradual changes in human spirit we refer to as history, progress, or evolution. This value mainly stems from the unconscious and unexamined nature of linguistic structure.

I have avoided most of the technical terms and all of the technical symbols of the linguistic academy. There is not a single diacritical mark in the book. Where possible, the discussion is based on English material. It was necessary, however, for the scheme of the book, which includes a consideration of the protean forms in which human thought has found expression, to quote some exotic instances. For these no apology seems necessary. Owing to limitations of space I have had to leave out many ideas or principles that I should have liked to touch upon. Other points have had to be barely hinted at in a sentence or flying phrase. Nevertheless, I trust that enough has here been brought together to serve as a stimulus for the more fundamental study of a neglected field.

I have steered clear of most technical terms and all the symbols from linguistic academia. There isn't a single diacritical mark in this book. Where I could, I based the discussion on English material. However, to fulfill the book's purpose, which includes exploring the various ways human thought has expressed itself, I had to include some unique examples. I don’t think I need to apologize for that. Due to space constraints, I had to exclude many ideas or principles I would have liked to cover. Other points had to be briefly hinted at with just a sentence or a quick phrase. Still, I hope that what I’ve included here will inspire a deeper exploration of this overlooked area of study.

I desire to express my cordial appreciation of the friendly advice and helpful suggestions of a number of friends who have read the work in manuscript, notably Profs. A. L. Kroeber and R. H. Lowie of the University of California, Prof. W. D. Wallis of Reed College, and Prof. J. Zeitlin of the University of Illinois.

I want to sincerely thank the friends who offered friendly advice and helpful suggestions after reading the manuscript, especially Profs. A. L. Kroeber and R. H. Lowie from the University of California, Prof. W. D. Wallis from Reed College, and Prof. J. Zeitlin from the University of Illinois.

Edward Sapir.

Ottawa, Ont.,

April 8, 1921.

Preface
chapter
1. Introductory: Language Defined
  Language a cultural, not a biologically inherited, function. Futility of interjectional and sound-imitative theories of the origin of speech. Definition of language. The psycho-physical basis of speech. Concepts and language. Is thought possible without language? Abbreviations and transfers of the speech process. The universality of language.
2. The Elements of Speech
  Sounds not properly elements of speech. Words and significant parts of words (radical elements, grammatical elements). Types of words. The word a formal, not a functional unit. The word has a real psychological existence. The sentence. The cognitive, volitional, and emotional aspects of speech. Feeling-tones of words.
3. The Sounds of Language
  The vast number of possible sounds. The articulating organs and their share in the production of speech sounds: lungs, glottal cords, nose, mouth and its parts. Vowel articulations. How and where consonants are articulated. The phonetic habits of a language. The “values” of sounds. Phonetic patterns.
4. Form in Language: Grammatical Processes
  Formal processes as distinct from grammatical functions. Intercrossing of the two points of view. Six main types of grammatical process. Word sequence as a method. Compounding of radical elements. Affixing: prefixes and suffixes; infixes. Internal vocalic change; consonantal change. Reduplication. Functional variations of stress; of pitch.
5. Form in Language: Grammatical Concepts
  Analysis of a typical English sentence. Types of concepts illustrated by it. Inconsistent expression of analogous concepts. How the same sentence may be expressed in other languages with striking differences in the selection and grouping of concepts. Essential and non-essential concepts. The mixing of essential relational concepts with secondary ones of more concrete order. Form for form’s sake. Classification of linguistic concepts: basic or concrete, derivational, concrete relational, pure relational. Tendency for these types of concepts to flow into each other. Categories expressed in various grammatical systems. Order and stress as relating principles in the sentence. Concord. Parts of speech: no absolute classification possible; noun and verb.
6. Types of Linguistic Structure
  The possibility of classifying languages. Difficulties. Classification into form-languages and formless languages not valid. Classification according to formal processes used not practicable. Classification according to degree of synthesis. “Inflective” and “agglutinative.” Fusion and symbolism as linguistic techniques. Agglutination. “Inflective” a confused term. Threefold classification suggested: what types of concepts are expressed? what is the prevailing technique? what is the degree of synthesis? Four fundamental conceptual types. Examples tabulated. Historical test of the validity of the suggested conceptual classification.
7. Language as a Historical Product: Drift
  Variability of language. Individual and dialectic variations. Time variation or “drift.” How dialects arise. Linguistic stocks. Direction or “slope” of linguistic drift. Tendencies illustrated in an English sentence. Hesitations of usage as symptomatic of the direction of drift. Leveling tendencies in English. Weakening of case elements. Tendency to fixed position in the sentence. Drift toward the invariable word.
8. Language as a Historical Product: Phonetic Law
  Parallels in drift in related languages. Phonetic law as illustrated in the history of certain English and German vowels and consonants. Regularity of phonetic law. Shifting of sounds without destruction of phonetic pattern. Difficulty of explaining the nature of phonetic drifts. Vowel mutation in English and German. Morphological influence on phonetic change. Analogical levelings to offset irregularities produced by phonetic laws. New morphological features due to phonetic change.
9. How Languages Influence Each Other
  Linguistic influences due to cultural contact. Borrowing of words. Resistances to borrowing. Phonetic modification of borrowed words. Phonetic interinfluencings of neighboring languages. Morphological borrowings. Morphological resemblances as vestiges of genetic relationship.
10. Language, Race, and Culture
  Naïve tendency to consider linguistic, racial, and cultural groupings as congruent. Race and language need not correspond. Cultural and linguistic boundaries not identical. Coincidences between linguistic cleavages and those of language and culture due to historical, not intrinsic psychological, causes. Language does not in any deep sense “reflect” culture.
11. Language and Literature
  Language as the material or medium of literature. Literature may move on the generalized linguistic plane or may be inseparable from specific linguistic conditions. Language as a collective art. Necessary esthetic advantages or limitations in any language. Style as conditioned by inherent features of the language. Prosody as conditioned by the phonetic dynamics of a language.
Index

I

Introductory: Language Defined

Speech is so familiar a feature of daily life that we rarely pause to define it. It seems as natural to man as walking, and only less so than breathing. Yet it needs but a moment’s reflection to convince us that this naturalness of speech is but an illusory feeling. The process of acquiring speech is, in sober fact, an utterly different sort of thing from the process of learning to walk. In the case of the latter function, culture, in other words, the traditional body of social usage, is not seriously brought into play. The child is individually equipped, by the complex set of factors that we term biological heredity, to make all the needed muscular and nervous adjustments that result in walking. Indeed, the very conformation of these muscles and of the appropriate parts of the nervous system may be said to be primarily adapted to the movements made in walking and in similar activities. In a very real sense the normal human being is predestined to walk, not because his elders will assist him to learn the art, but because his organism is prepared from birth, or even from the moment of conception, to take on all those expenditures of nervous energy and all those muscular adaptations that result in walking. To put it concisely, walking is an inherent, biological function of man.

Speech is such a common part of daily life that we rarely stop to define it. It feels as natural to us as walking, and only slightly less so than breathing. But just a moment of thought shows us that this feeling of speech being natural is actually misleading. The process of learning to speak is completely different from learning to walk. In walking, cultural factors, or the traditional norms of society, don't play a significant role. Instead, a child is individually equipped, thanks to the complex set of factors we call biological heredity, to make all the necessary muscular and nervous adjustments needed for walking. In fact, the structure of these muscles and the relevant parts of the nervous system are primarily suited for the movements involved in walking and similar activities. In a very real sense, a typical human is destined to walk, not because adults help them learn the skill, but because their body is ready from birth, or even from the moment of conception, to handle all the nerve energy and muscular changes required for walking. To sum it up, walking is a natural, biological function of humans.

Not so language. It is of course true that in a certain sense the individual is predestined to talk, but that is due entirely to the circumstance that he is born not merely in nature, but in the lap of a society that is certain, reasonably certain, to lead him to its traditions. Eliminate society and there is every reason to believe that he will learn to walk, if, indeed, he survives at all. But it is just as certain that he will never learn to talk, that is, to communicate ideas according to the traditional system of a particular society. Or, again, remove the new-born individual from the social environment into which he has come and transplant him to an utterly alien one. He will develop the art of walking in his new environment very much as he would have developed it in the old. But his speech will be completely at variance with the speech of his native environment. Walking, then, is a general human activity that varies only within circumscribed limits as we pass from individual to individual. Its variability is involuntary and purposeless. Speech is a human activity that varies without assignable limit as we pass from social group to social group, because it is a purely historical heritage of the group, the product of long-continued social usage. It varies as all creative effort varies—not as consciously, perhaps, but none the less as truly as do the religions, the beliefs, the customs, and the arts of different peoples. Walking is an organic, an instinctive, function (not, of course, itself an instinct); speech is a non-instinctive, acquired, “cultural” function.

Not so with language. It’s true that in some sense, a person is destined to speak, but that’s completely because they are born not just into nature, but into a society that will definitely lead them to its traditions. If you remove society, there’s every reason to think that a person will learn to walk, if they even survive at all. But it’s just as certain that they will never learn to speak, that is, to express ideas according to the traditional system of a specific society. Or, take a newborn away from the social environment they were born into and place them in a completely different one. They will learn to walk in their new setting much like they would have in the old one. However, their way of speaking will be completely different from the speech of their original environment. So, walking is a general human activity that changes only within limited boundaries as we move from individual to individual. Its variability happens naturally and without intention. Speech, on the other hand, is a human activity that varies without limit as we move from one social group to another, because it’s a purely historical legacy of the group, shaped by prolonged social use. It varies like all creative efforts do—not necessarily consciously, but just as genuinely as the religions, beliefs, customs, and arts of different cultures. Walking is an organic, instinctive function (not an instinct itself); speech is a non-instinctive, learned, “cultural” function.

There is one fact that has frequently tended to prevent the recognition of language as a merely conventional system of sound symbols, that has seduced the popular mind into attributing to it an instinctive basis that it does not really possess. This is the well-known observation that under the stress of emotion, say of a sudden twinge of pain or of unbridled joy, we do involuntarily give utterance to sounds that the hearer interprets as indicative of the emotion itself. But there is all the difference in the world between such involuntary expression of feeling and the normal type of communication of ideas that is speech. The former kind of utterance is indeed instinctive, but it is non-symbolic; in other words, the sound of pain or the sound of joy does not, as such, indicate the emotion, it does not stand aloof, as it were, and announce that such and such an emotion is being felt. What it does is to serve as a more or less automatic overflow of the emotional energy; in a sense, it is part and parcel of the emotion itself. Moreover, such instinctive cries hardly constitute communication in any strict sense. They are not addressed to any one, they are merely overheard, if heard at all, as the bark of a dog, the sound of approaching footsteps, or the rustling of the wind is heard. If they convey certain ideas to the hearer, it is only in the very general sense in which any and every sound or even any phenomenon in our environment may be said to convey an idea to the perceiving mind. If the involuntary cry of pain which is conventionally represented by “Oh!” be looked upon as a true speech symbol equivalent to some such idea as “I am in great pain,” it is just as allowable to interpret the appearance of clouds as an equivalent symbol that carries the definite message “It is likely to rain.” A definition of language, however, that is so extended as to cover every type of inference becomes utterly meaningless.

There’s one fact that often stops from being recognized as just a conventional system of sound symbols, leading people to mistakenly think it has an instinctive basis that it doesn’t actually have. This is the common observation that when we feel strong emotions—like a sudden jolt of pain or overwhelming joy—we involuntarily make sounds that listeners interpret as showing that emotion. However, there’s a huge difference between this involuntary expression of feeling and the regular way we communicate ideas through speech. The first type of sound is indeed instinctive, but it’s non-symbolic; that is, the sound of pain or joy doesn’t inherently indicate the emotion; it doesn’t step back and declare that a particular emotion is being experienced. Instead, it acts as a more or less automatic overflow of emotional energy; in a way, it’s part of the emotion itself. Furthermore, these instinctive cries hardly count as communication in any strict sense. They aren’t directed at anyone; they’re just overheard—if they’re heard at all—like a dog barking, footsteps approaching, or the wind rustling. If they do convey specific ideas to the listener, it’s only in the general way that any sound or even any phenomenon in our environment might convey an idea to the observing mind. If we treat the involuntary cry of pain that we usually express as “Oh!” as a true speech symbol equivalent to something like “I am in great pain,” it’s just as valid to interpret the sight of clouds as an equivalent symbol that carries the clear message “It’s likely to rain.” However, a definition of language that is so broad that it includes every type of inference becomes entirely meaningless.

The mistake must not be made of identifying our conventional interjections (our oh! and ah! and sh!) with the instinctive cries themselves. These interjections are merely conventional fixations of the natural sounds. They therefore differ widely in various languages in accordance with the specific phonetic genius of each of these. As such they may be considered an integral portion of speech, in the properly cultural sense of the term, being no more identical with the instinctive cries themselves than such words as “cuckoo” and “kill-deer” are identical with the cries of the birds they denote or than Rossini’s treatment of a storm in the overture to “William Tell” is in fact a storm. In other words, the interjections and sound-imitative words of normal speech are related to their natural prototypes as is art, a purely social or cultural thing, to nature. It may be objected that, though the interjections differ somewhat as we pass from language to language, they do nevertheless offer striking family resemblances and may therefore be looked upon as having grown up out of a common instinctive base. But their case is nowise different from that, say, of the varying national modes of pictorial representation. A Japanese picture of a hill both differs from and resembles a typical modern European painting of the same kind of hill. Both are suggested by and both “imitate” the same natural feature. Neither the one nor the other is the same thing as, or, in any intelligible sense, a direct outgrowth of, this natural feature. The two modes of representation are not identical because they proceed from differing historical traditions, are executed with differing pictorial techniques. The interjections of Japanese and English are, just so, suggested by a common natural prototype, the instinctive cries, and are thus unavoidably suggestive of each other. They differ, now greatly, now but little, because they are builded out of historically diverse materials or techniques, the respective linguistic traditions, phonetic systems, speech habits of the two peoples. Yet the instinctive cries as such are practically identical for all humanity, just as the human skeleton or nervous system is to all intents and purposes a “fixed,” that is, an only slightly and “accidentally” variable, feature of man’s organism.

The mistake should not be made of equating our conventional interjections (like oh! and ah! and sh!) with the instinctive cries themselves. These interjections are simply conventional representations of the natural sounds. They vary widely across different languages according to the unique phonetic characteristics of each. Thus, they can be seen as an essential part of speech, in the cultural sense, and are no more the same as the instinctive cries than words like “cuckoo” or “kill-deer” are the same as the sounds made by the birds they refer to, or than Rossini’s portrayal of a storm in the overture to “William Tell” is an actual storm. In other words, interjections and sound-imitating words in normal speech relate to their natural counterparts like art, which is a purely social or cultural entity, relates to nature. One might argue that, even though interjections vary from language to language, they still show notable similarities and could therefore be considered to have developed from a common instinctive origin. However, this situation is no different from that of the different national styles of visual representation. A Japanese painting of a hill both differs from and resembles a typical modern European painting of a similar hill. Both are inspired by and both “imitate” the same natural feature. Neither one is the same as, or directly derived from, this natural feature in any meaningful way. The two ways of representation are not identical because they come from different historical traditions and use different artistic techniques. Similarly, the interjections of Japanese and English are inspired by a common natural source, the instinctive cries, and thus inevitably suggest similarities to one another. They differ, sometimes greatly and sometimes only slightly, because they are constructed from historically varied materials or techniques, reflecting the respective linguistic traditions, phonetic systems, and speech habits of the two cultures. Yet the instinctive cries themselves are practically identical for all humanity, just as the human skeleton or nervous system is essentially a “fixed,” only slightly and “accidentally” variable, aspect of human biology.

Interjections are among the least important of speech elements. Their discussion is valuable mainly because it can be shown that even they, avowedly the nearest of all language sounds to instinctive utterance, are only superficially of an instinctive nature. Were it therefore possible to demonstrate that the whole of language is traceable, in its ultimate historical and psychological foundations, to the interjections, it would still not follow that language is an instinctive activity. But, as a matter of fact, all attempts so to explain the origin of speech have been fruitless. There is no tangible evidence, historical or otherwise, tending to show that the mass of speech elements and speech processes has evolved out of the interjections. These are a very small and functionally insignificant proportion of the vocabulary of language; at no time and in no linguistic province that we have record of do we see a noticeable tendency towards their elaboration into the primary warp and woof of language. They are never more, at best, than a decorative edging to the ample, complex fabric.

Interjections are among the least significant elements of speech. Their discussion is mostly valuable because it shows that even these, supposedly the closest language sounds to instinctive expression, are only superficially instinctive. Therefore, even if it could be proven that all of language traces back to interjections in its ultimate historical and psychological roots, it wouldn't mean that language is an instinctive activity. However, all attempts to explain the origin of speech in this way have been unsuccessful. There's no concrete evidence, historical or otherwise, suggesting that the majority of speech elements and processes evolved from interjections. These represent a very small and functionally insignificant part of the vocabulary of language; at no time and in no linguistic area that we have records of do we see a clear trend towards their development into the core structure of language. They are, at best, just a decorative fringe to the rich, complex fabric.

What applies to the interjections applies with even greater force to the sound-imitative words. Such words as “whippoorwill,” “to mew,” “to caw” are in no sense natural sounds that man has instinctively or automatically reproduced. They are just as truly creations of the human mind, flights of the human fancy, as anything else in language. They do not directly grow out of nature, they are suggested by it and play with it. Hence the onomatopoetic theory of the origin of speech, the theory that would explain all speech as a gradual evolution from sounds of an imitative character, really brings us no nearer to the instinctive level than is language as we know it to-day. As to the theory itself, it is scarcely more credible than its interjectional counterpart. It is true that a number of words which we do not now feel to have a sound-imitative value can be shown to have once had a phonetic form that strongly suggests their origin as imitations of natural sounds. Such is the English word “to laugh.” For all that, it is quite impossible to show, nor does it seem intrinsically reasonable to suppose, that more than a negligible proportion of the elements of speech or anything at all of its formal apparatus is derivable from an onomatopoetic source. However much we may be disposed on general principles to assign a fundamental importance in the languages of primitive peoples to the imitation of natural sounds, the actual fact of the matter is that these languages show no particular preference for imitative words. Among the most primitive peoples of aboriginal America, the Athabaskan tribes of the Mackenzie River speak languages in which such words seem to be nearly or entirely absent, while they are used freely enough in languages as sophisticated as English and German. Such an instance shows how little the essential nature of speech is concerned with the mere imitation of things.

What applies to interjections applies even more strongly to sound-imitative words. Words like “whippoorwill,” “to mew,” and “to caw” are not natural sounds that humans instinctively or automatically reproduce. They are just as much creations of the human mind and flights of human imagination as anything else in language. They don't directly arise from nature; instead, they are inspired by it and play with it. Therefore, the onomatopoetic theory of the origin of speech, which suggests that all speech gradually evolves from imitative sounds, doesn't bring us any closer to understanding instinctive communication than the language we use today. The theory itself is hardly more believable than its counterpart related to interjections. It's true that some words we no longer perceive as imitative might have originally had phonetic forms closely resembling natural sounds. Take the English word “to laugh,” for example. Still, it's pretty much impossible to show—or even reasonable to assume—that more than a tiny fraction of speech elements or any part of its formal structure comes from an onomatopoetic source. While we might generally want to assert the importance of imitating natural sounds in the languages of primitive peoples, the reality is that these languages do not show a particular preference for imitative words. Among the most primitive groups in aboriginal America, like the Athabaskan tribes of the Mackenzie River, their languages seem to lack such words almost entirely, while they are quite common in more sophisticated languages like English and German. This instance illustrates how little the fundamental nature of speech relates to mere imitation of things.

The way is now cleared for a serviceable definition of language. Language is a purely human and non-instinctive method of communicating ideas, emotions, and desires by means of a system of voluntarily produced symbols. These symbols are, in the first instance, auditory and they are produced by the so-called “organs of speech.” There is no discernible instinctive basis in human speech as such, however much instinctive expressions and the natural environment may serve as a stimulus for the development of certain elements of speech, however much instinctive tendencies, motor and other, may give a predetermined range or mold to linguistic expression. Such human or animal communication, if “communication” it may be called, as is brought about by involuntary, instinctive cries is not, in our sense, language at all.

The path is now clear for a practical definition of language. Language is a uniquely human and non-instinctive way to communicate ideas, emotions, and desires through a system of voluntarily created symbols. These symbols are primarily auditory and are produced by the so-called “organs of speech.” There is no clear instinctive basis for human speech itself, even though instinctive expressions and the natural environment can encourage the development of certain speech elements, and despite the fact that instinctive tendencies, both motor and otherwise, may shape how we express ourselves linguistically. Such human or animal communication, if we can even call it “communication,” that occurs through involuntary, instinctive cries is not, in our understanding, language at all.

I have just referred to the “organs of speech,” and it would seem at first blush that this is tantamount to an admission that speech itself is an instinctive, biologically predetermined activity. We must not be misled by the mere term. There are, properly speaking, no organs of speech; there are only organs that are incidentally useful in the production of speech sounds. The lungs, the larynx, the palate, the nose, the tongue, the teeth, and the lips, are all so utilized, but they are no more to be thought of as primary organs of speech than are the fingers to be considered as essentially organs of piano-playing or the knees as organs of prayer. Speech is not a simple activity that is carried on by one or more organs biologically adapted to the purpose. It is an extremely complex and ever-shifting network of adjustments—in the brain, in the nervous system, and in the articulating and auditory organs—tending towards the desired end of communication. The lungs developed, roughly speaking, in connection with the necessary biological function known as breathing; the nose, as an organ of smell; the teeth, as organs useful in breaking up food before it was ready for digestion. If, then, these and other organs are being constantly utilized in speech, it is only because any organ, once existent and in so far as it is subject to voluntary control, can be utilized by man for secondary purposes. Physiologically, speech is an overlaid function, or, to be more precise, a group of overlaid functions. It gets what service it can out of organs and functions, nervous and muscular, that have come into being and are maintained for very different ends than its own.

I just mentioned the “organs of speech,” and at first glance, it might seem like I’m saying that speech is instinctive and biologically predetermined. But let's not be fooled by the term. There aren't actually any specific organs of speech; there are just organs that happen to be useful for producing speech sounds. The lungs, larynx, palate, nose, tongue, teeth, and lips are all used, but they shouldn't be considered primary organs of speech any more than fingers should be seen as essential for playing the piano or knees as organs of prayer. Speech isn’t a simple action carried out by one or more organs naturally suited for it. It's a highly complex and constantly changing network of adjustments—in the brain, nervous system, and the organs involved in speaking and hearing—aimed at achieving effective communication. The lungs developed mainly for the essential biological function of breathing; the nose serves as an organ of smell; the teeth help break down food before digestion. So, these organs and others are continually used in speech, but that’s only because any organ, once it exists and can be voluntarily controlled, can be used by humans for secondary purposes. Physiologically, speech is a complex function, or more precisely, a collection of complex functions. It makes use of organs and functions—nervous and muscular—that were developed and maintained for very different reasons than communication.

It is true that physiological psychologists speak of the localization of speech in the brain. This can only mean that the sounds of speech are localized in the auditory tract of the brain, or in some circumscribed portion of it, precisely as other classes of sounds are localized; and that the motor processes involved in speech (such as the movements of the glottal cords in the larynx, the movements of the tongue required to pronounce the vowels, lip movements required to articulate certain consonants, and numerous others) are localized in the motor tract precisely as are all other impulses to special motor activities. In the same way control is lodged in the visual tract of the brain over all those processes of visual recognition involved in reading. Naturally the particular points or clusters of points of localization in the several tracts that refer to any element of language are connected in the brain by paths of association, so that the outward, or psycho-physical, aspect of language, is of a vast network of associated localizations in the brain and lower nervous tracts, the auditory localizations being without doubt the most fundamental of all for speech. However, a speechsound localized in the brain, even when associated with the particular movements of the “speech organs” that are required to produce it, is very far from being an element of language. It must be further associated with some element or group of elements of experience, say a visual image or a class of visual images or a feeling of relation, before it has even rudimentary linguistic significance. This “element” of experience is the content or “meaning” of the linguistic unit; the associated auditory, motor, and other cerebral processes that lie immediately back of the act of speaking and the act of hearing speech are merely a complicated symbol of or signal for these “meanings,” of which more anon. We see therefore at once that language as such is not and cannot be definitely localized, for it consists of a peculiar symbolic relation—physiologically an arbitrary one—between all possible elements of consciousness on the one hand and certain selected elements localized in the auditory, motor, and other cerebral and nervous tracts on the other. If language can be said to be definitely “localized” in the brain, it is only in that general and rather useless sense in which all aspects of consciousness, all human interest and activity, may be said to be “in the brain.” Hence, we have no recourse but to accept language as a fully formed functional system within man’s psychic or “spiritual” constitution. We cannot define it as an entity in psycho-physical terms alone, however much the psycho-physical basis is essential to its functioning in the individual.

It’s true that physiological psychologists talk about where speech is located in the brain. This means that the sounds we produce while speaking are found in the auditory area of the brain, just like other types of sounds are, and that the motor functions involved in speech (like the movements of the vocal cords in the larynx, the tongue movements needed to pronounce vowels, lip movements necessary for certain consonants, and many others) are found in the motor area, just as all other impulses for specific motor activities are. Similarly, the brain has control over all the processes of visual recognition involved in reading. Naturally, the specific areas of the brain related to different elements of language are interconnected by pathways, forming a vast network of associations in the brain and lower nervous system, with auditory areas being the most essential for speech. However, a sound associated with speech, even when connected to the specific movements of the “speech organs” needed to produce it, isn’t enough to be considered an element of language. It also needs to be linked to some element or set of experiences, like a visual image or a feeling of connection, before it has any basic linguistic meaning. This “element” of experience is the content or “meaning” of the language unit; the related auditory, motor, and other brain processes behind speaking and hearing speech are just complicated symbols or signals for these “meanings.” Thus, we realize right away that language itself isn't something we can definitively localize because it’s a unique symbolic relationship—physiologically arbitrary—between all possible elements of consciousness on one side and specific elements located in the auditory, motor, and other brain and nervous areas on the other. If we say that language can be “localized” in the brain, it’s only in the general and rather unhelpful way that we can say all aspects of consciousness, all human interests and activities, are “in the brain.” Therefore, we have no choice but to accept language as a fully developed functional system within human psychology or “spirit.” We can’t define it only as a physical entity, even though the psycho-physical foundation is crucial for its functioning in individuals.

From the physiologist’s or psychologist’s point of view we may seem to be making an unwarrantable abstraction in desiring to handle the subject of speech without constant and explicit reference to that basis. However, such an abstraction is justifiable. We can profitably discuss the intention, the form, and the history of speech, precisely as we discuss the nature of any other phase of human culture—say art or religion—as an institutional or cultural entity, leaving the organic and psychological mechanisms back of it as something to be taken for granted. Accordingly, it must be clearly understood that this introduction to the study of speech is not concerned with those aspects of physiology and of physiological psychology that underlie speech. Our study of language is not to be one of the genesis and operation of a concrete mechanism; it is, rather, to be an inquiry into the function and form of the arbitrary systems of symbolism that we term languages.

From the perspective of a physiologist or psychologist, it might seem unreasonable to talk about speech without consistently and clearly referring back to its foundation. However, this abstraction is justified. We can effectively discuss the intention, structure, and history of speech, just like we talk about any other aspect of human culture—like art or religion—as a cultural or institutional entity, putting aside the organic and psychological mechanisms behind it as something we take for granted. Therefore, it's important to understand that this introduction to the study of speech does not focus on the physiological aspects or physiological psychology that support speech. Our study of language won't be about the origins and workings of a concrete mechanism; instead, it will be an exploration of the function and structure of the arbitrary systems of symbols that we call languages.

I have already pointed out that the essence of language consists in the assigning of conventional, voluntarily articulated, sounds, or of their equivalents, to the diverse elements of experience. The word “house” is not a linguistic fact if by it is meant merely the acoustic effect produced on the ear by its constituent consonants and vowels, pronounced in a certain order; nor the motor processes and tactile feelings which make up the articulation of the word; nor the visual perception on the part of the hearer of this articulation; nor the visual perception of the word “house” on the written or printed page; nor the motor processes and tactile feelings which enter into the writing of the word; nor the memory of any or all of these experiences. It is only when these, and possibly still other, associated experiences are automatically associated with the image of a house that they begin to take on the nature of a symbol, a word, an element of language. But the mere fact of such an association is not enough. One might have heard a particular word spoken in an individual house under such impressive circumstances that neither the word nor the image of the house ever recur in consciousness without the other becoming present at the same time. This type of association does not constitute speech. The association must be a purely symbolic one; in other words, the word must denote, tag off, the image, must have no other significance than to serve as a counter to refer to it whenever it is necessary or convenient to do so. Such an association, voluntary and, in a sense, arbitrary as it is, demands a considerable exercise of self-conscious attention. At least to begin with, for habit soon makes the association nearly as automatic as any and more rapid than most.

I have already pointed out that the essence of language lies in assigning conventional, intentionally produced sounds, or their equivalents, to the various elements of experience. The word “house” is not just a linguistic fact if it only refers to the sound it makes when pronounced, the physical actions and sensations involved in saying it, the way it looks when seen, or the memories associated with any of these experiences. It is only when these and potentially other related experiences automatically connect with the image of a house that they start to function as a symbol, a word, an element of language. However, just having that association isn’t enough. You might hear a specific word said in a particular house during a significant moment, so that the word and the image of that house always appear together in your mind. This kind of association does not create language. The connection must be purely symbolic; in other words, the word must represent that image, serving solely as a reference to it whenever needed. This kind of association, though voluntary and somewhat arbitrary, requires a considerable amount of conscious effort at first. However, over time, habit makes the association almost automatic and quicker than most.

But we have traveled a little too fast. Were the symbol “house”—whether an auditory, motor, or visual experience or image—attached but to the single image of a particular house once seen, it might perhaps, by an indulgent criticism, be termed an element of speech, yet it is obvious at the outset that speech so constituted would have little or no value for purposes of communication. The world of our experiences must be enormously simplified and generalized before it is possible to make a symbolic inventory of all our experiences of things and relations; and this inventory is imperative before we can convey ideas. The elements of language, the symbols that ticket off experience, must therefore be associated with whole groups, delimited classes, of experience rather than with the single experiences themselves. Only so is communication possible, for the single experience lodges in an individual consciousness and is, strictly speaking, incommunicable. To be communicated it needs to be referred to a class which is tacitly accepted by the community as an identity. Thus, the single impression which I have had of a particular house must be identified with all my other impressions of it. Further, my generalized memory or my “notion” of this house must be merged with the notions that all other individuals who have seen the house have formed of it. The particular experience that we started with has now been widened so as to embrace all possible impressions or images that sentient beings have formed or may form of the house in question. This first simplification of experience is at the bottom of a large number of elements of speech, the so-called proper nouns or names of single individuals or objects. It is, essentially, the type of simplification which underlies, or forms the crude subject of, history and art. But we cannot be content with this measure of reduction of the infinity of experience. We must cut to the bone of things, we must more or less arbitrarily throw whole masses of experience together as similar enough to warrant their being looked upon—mistakenly, but conveniently—as identical. This house and that house and thousands of other phenomena of like character are thought of as having enough in common, in spite of great and obvious differences of detail, to be classed under the same heading. In other words, the speech element “house” is the symbol, first and foremost, not of a single perception, nor even of the notion of a particular object, but of a “concept,” in other words, of a convenient capsule of thought that embraces thousands of distinct experiences and that is ready to take in thousands more. If the single significant elements of speech are the symbols of concepts, the actual flow of speech may be interpreted as a record of the setting of these concepts into mutual relations.

But we've moved a bit too quickly. If the symbol “house”—whether it’s something we hear, do, or see—was only linked to the one image of a specific house we once saw, it might, with some leniency, be called a form of speech. However, it’s clear right away that speech like that would have little or no value for communication. We need to drastically simplify and generalize our experiences before we can create a symbolic catalog of everything we've encountered and the relationships between them. This catalog is essential for us to express ideas. The components of language, the symbols that represent our experiences, must therefore connect to whole groups or defined categories of experience rather than to individual experiences themselves. That's the only way communication can happen because a single experience resides in an individual mind and, technically speaking, cannot be communicated. To share it, it must be related to a category that the community implicitly agrees identifies it. So, the single impression I have of a specific house needs to be linked with all my other impressions of it. Moreover, my generalized memory or “concept” of this house must merge with the concepts that everyone else who has seen the house has created. The specific experience we began with has now expanded to include all possible impressions or images that any sentient being has formed or might form of the house in question. This initial simplification of experience underlies many elements of language, like proper nouns or the names of individual objects. It’s fundamentally the kind of simplification that serves as the basic subject of history and art. But we can’t settle for just this level of reduction of the infinite array of experience. We must dig deep, and we must more or less arbitrarily group large amounts of experiences as similar enough to justify considering them—though mistaken, but conveniently—as identical. This house, that house, and thousands of other similar phenomena are thought to share enough in common, despite significant and obvious differences, to be grouped together. In other words, the speech element “house” symbolizes, first and foremost, not a single perception or even the idea of a specific object, but rather a “concept,” essentially a handy bundle of thought that includes thousands of distinct experiences and is ready to incorporate even more. If the individual crucial elements of speech are the symbols of concepts, the actual flow of speech can be understood as a record of how these concepts relate to one another.

The question has often been raised whether thought is possible without speech; further, if speech and thought be not but two facets of the same psychic process. The question is all the more difficult because it has been hedged about by misunderstandings. In the first place, it is well to observe that whether or not thought necessitates symbolism, that is speech, the flow of language itself is not always indicative of thought. We have seen that the typical linguistic element labels a concept. It does not follow from this that the use to which language is put is always or even mainly conceptual. We are not in ordinary life so much concerned with concepts as such as with concrete particularities and specific relations. When I say, for instance, “I had a good breakfast this morning,” it is clear that I am not in the throes of laborious thought, that what I have to transmit is hardly more than a pleasurable memory symbolically rendered in the grooves of habitual expression. Each element in the sentence defines a separate concept or conceptual relation or both combined, but the sentence as a whole has no conceptual significance whatever. It is somewhat as though a dynamo capable of generating enough power to run an elevator were operated almost exclusively to feed an electric door-bell. The parallel is more suggestive than at first sight appears. Language may be looked upon as an instrument capable of running a gamut of psychic uses. Its flow not only parallels that of the inner content of consciousness, but parallels it on different levels, ranging from the state of mind that is dominated by particular images to that in which abstract concepts and their relations are alone at the focus of attention and which is ordinarily termed reasoning. Thus the outward form only of language is constant; its inner meaning, its psychic value or intensity, varies freely with attention or the selective interest of the mind, also, needless to say, with the mind’s general development. From the point of view of language, thought may be defined as the highest latent or potential content of speech, the content that is obtained by interpreting each of the elements in the flow of language as possessed of its very fullest conceptual value. From this it follows at once that language and thought are not strictly coterminous. At best language can but be the outward facet of thought on the highest, most generalized, level of symbolic expression. To put our viewpoint somewhat differently, language is primarily a pre-rational function. It humbly works up to the thought that is latent in, that may eventually be read into, its classifications and its forms; it is not, as is generally but naïvely assumed, the final label put upon, the finished thought.

The question often comes up whether thought can exist without speech, and if speech and thought are just two sides of the same mental process. This question is even trickier because it's surrounded by misunderstandings. First of all, it's important to note that whether thought requires symbols, which means speech, the flow of language doesn’t always reflect thought. We’ve observed that typical language elements label a concept. However, this doesn’t mean that language is always or primarily conceptual in its use. In everyday life, we’re more focused on specific details and particular relationships than on concepts themselves. For example, when I say, “I had a good breakfast this morning,” it's clear that I’m not deeply engaged in complex thought; what I’m sharing is simply a pleasant memory expressed in familiar language. Each part of the sentence defines a separate concept or relationship, but the sentence as a whole doesn’t carry any significant conceptual meaning. It's like having a dynamo that could generate enough energy to power an elevator but is mostly used to run an electric doorbell. This comparison is more relevant than it might seem at first glance. Language can be viewed as a tool that serves a range of mental purposes. Its flow not only matches the inner content of consciousness but does so across different levels, from states of mind focused on specific images to those where abstract concepts and their relationships take center stage, usually referred to as reasoning. Thus, the outward form of language remains consistent, while its inner meaning, psychic value, or intensity varies freely based on attention or the mind's selective interest and its overall development. From the perspective of language, thought can be defined as the highest potential content of speech, the content one gets by interpreting each element in the flow of language as having its fullest conceptual value. This means that language and thought are not exactly the same. At most, language is just the outer expression of thought at a high, generalized level of symbolic communication. To reframe our perspective, language is primarily a pre-rational function. It cautiously works its way toward the thoughts that are latent within it, which may eventually be drawn out from its classifications and forms; it is not, as is often naively assumed, the final label placed on completed thought.

Most people, asked if they can think without speech, would probably answer, “Yes, but it is not easy for me to do so. Still I know it can be done.” Language is but a garment! But what if language is not so much a garment as a prepared road or groove? It is, indeed, in the highest degree likely that language is an instrument originally put to uses lower than the conceptual plane and that thought arises as a refined interpretation of its content. The product grows, in other words, with the instrument, and thought may be no more conceivable, in its genesis and daily practice, without speech than is mathematical reasoning practicable without the lever of an appropriate mathematical symbolism. No one believes that even the most difficult mathematical proposition is inherently dependent on an arbitrary set of symbols, but it is impossible to suppose that the human mind is capable of arriving at or holding such a proposition without the symbolism. The writer, for one, is strongly of the opinion that the feeling entertained by so many that they can think, or even reason, without language is an illusion. The illusion seems to be due to a number of factors. The simplest of these is the failure to distinguish between imagery and thought. As a matter of fact, no sooner do we try to put an image into conscious relation with another than we find ourselves slipping into a silent flow of words. Thought may be a natural domain apart from the artificial one of speech, but speech would seem to be the only road we know of that leads to it. A still more fruitful source of the illusive feeling that language may be dispensed with in thought is the common failure to realize that language is not identical with its auditory symbolism. The auditory symbolism may be replaced, point for point, by a motor or by a visual symbolism (many people can read, for instance, in a purely visual sense, that is, without the intermediating link of an inner flow of the auditory images that correspond to the printed or written words) or by still other, more subtle and elusive, types of transfer that are not so easy to define. Hence the contention that one thinks without language merely because he is not aware of a coexisting auditory imagery is very far indeed from being a valid one. One may go so far as to suspect that the symbolic expression of thought may in some cases run along outside the fringe of the conscious mind, so that the feeling of a free, nonlinguistic stream of thought is for minds of a certain type a relatively, but only a relatively, justified one. Psycho-physically, this would mean that the auditory or equivalent visual or motor centers in the brain, together with the appropriate paths of association, that are the cerebral equivalent of speech, are touched off so lightly during the process of thought as not to rise into consciousness at all. This would be a limiting case—thought riding lightly on the submerged crests of speech, instead of jogging along with it, hand in hand. The modern psychology has shown us how powerfully symbolism is at work in the unconscious mind. It is therefore easier to understand at the present time than it would have been twenty years ago that the most rarefied thought may be but the conscious counterpart of an unconscious linguistic symbolism.

Most people, when asked if they can think without using words, would probably say, “Yes, but it’s not easy for me. Still, I know it can be done.” Language is just a covering! But what if language isn’t just a covering but more like a prepared pathway? It’s very likely that language is a tool originally used for simpler purposes and that thought develops as a refined understanding of its meaning. In other words, the product evolves with the tool, and thought might be just as unimaginable, in its development and daily application, without speech as mathematical reasoning is without the aid of specific mathematical symbols. No one thinks that even the most complex mathematical concept relies solely on a random set of symbols, but it’s hard to imagine the human mind developing or grasping such a concept without those symbols. I strongly believe that the widespread notion that people can think or reason without language is misleading. This misconception seems to arise from several factors. The simplest is the inability to separate imagery from thought. In fact, as soon as we try to connect one image to another, we often find ourselves slipping into a silent stream of words. Thought may exist in a natural space apart from the artificial space of speech, but speech seems to be the only route we know that leads to it. An even more significant source of the misleading belief that language can be left out of thought is the common misunderstanding that language is the same as its spoken representation. The spoken representation can be replaced, point for point, by motor or visual symbols (many people can read purely visually, for example, without the inner sound of words that correspond to the written text) or by other, more nuanced types of transfer that are harder to define. Thus, the idea that someone thinks without language simply because they don’t notice accompanying auditory imagery is far from valid. One might even suspect that the symbolic expression of thought can sometimes occur just outside the conscious mind, which means that the feeling of a free, nonverbal stream of thought is, for certain minds, somewhat justified, but only somewhat. Psychologically and physiologically, this would imply that the auditory or equivalent visual or motor centers in the brain, along with the relevant pathways of association—the brain's equivalent of speech—are activated so lightly during thought that they don’t reach consciousness at all. This would be a limiting case—thought gliding lightly on the hidden waves of speech, instead of moving along with it, side by side. Modern psychology has revealed how strongly symbolism operates in the unconscious mind. Therefore, it’s easier now than it was twenty years ago to understand that the most abstract thoughts may simply be the conscious equivalent of an unconscious linguistic symbolism.

One word more as to the relation between language and thought. The point of view that we have developed does not by any means preclude the possibility of the growth of speech being in a high degree dependent on the development of thought. We may assume that language arose pre-rationally—just how and on what precise level of mental activity we do not know—but we must not imagine that a highly developed system of speech symbols worked itself out before the genesis of distinct concepts and of thinking, the handling of concepts. We must rather imagine that thought processes set in, as a kind of psychic overflow, almost at the beginning of linguistic expression; further, that the concept, once defined, necessarily reacted on the life of its linguistic symbol, encouraging further linguistic growth. We see this complex process of the interaction of language and thought actually taking place under our eyes. The instrument makes possible the product, the product refines the instrument. The birth of a new concept is invariably foreshadowed by a more or less strained or extended use of old linguistic material; the concept does not attain to individual and independent life until it has found a distinctive linguistic embodiment. In most cases the new symbol is but a thing wrought from linguistic material already in existence in ways mapped out by crushingly despotic precedents. As soon as the word is at hand, we instinctively feel, with something of a sigh of relief, that the concept is ours for the handling. Not until we own the symbol do we feel that we hold a key to the immediate knowledge or understanding of the concept. Would we be so ready to die for “liberty,” to struggle for “ideals,” if the words themselves were not ringing within us? And the word, as we know, is not only a key; it may also be a fetter.

One last thing about the relationship between language and thought. The perspective we've developed doesn’t rule out the possibility that the evolution of language is greatly dependent on the advancement of thought. We can assume that language emerged before rational thought—exactly how and at what specific level of mental activity, we don’t know—but we shouldn’t believe that a complex system of speech symbols was fully formed before clear concepts and thinking came about. Instead, we should think of thought processes as beginning, in a way, as a kind of mental overflow, almost alongside the start of spoken expression. Moreover, once a concept is defined, it inevitably affects the life of its linguistic symbol, promoting further linguistic development. We can observe this intricate process of the interaction between language and thought happening right before us. The tool allows for the creation, and the creation shapes the tool. The emergence of a new concept is usually preceded by a somewhat strained or expanded use of existing linguistic material; the concept doesn’t achieve an individual and independent existence until it has found a unique linguistic expression. In most cases, the new symbol is simply something crafted from linguistic material that already exists, mapped out by overwhelmingly strict precedents. As soon as the word is available, we instinctively feel, with a bit of relief, that the concept is ready for our grasp. We don’t truly feel we have a key to immediate knowledge or understanding of the concept until we possess the symbol. Would we be so eager to die for “liberty” or fight for “ideals” if those words weren’t already resonating within us? And the word, as we know, is not just a key; it can also be a shackle.

Language is primarily an auditory system of symbols. In so far as it is articulated it is also a motor system, but the motor aspect of speech is clearly secondary to the auditory. In normal individuals the impulse to speech first takes effect in the sphere of auditory imagery and is then transmitted to the motor nerves that control the organs of speech. The motor processes and the accompanying motor feelings are not, however, the end, the final resting point. They are merely a means and a control leading to auditory perception in both speaker and hearer. Communication, which is the very object of speech, is successfully effected only when the hearer’s auditory perceptions are translated into the appropriate and intended flow of imagery or thought or both combined. Hence the cycle of speech, in so far as we may look upon it as a purely external instrument, begins and ends in the realm of sounds. The concordance between the initial auditory imagery and the final auditory perceptions is the social seal or warrant of the successful issue of the process. As we have already seen, the typical course of this process may undergo endless modifications or transfers into equivalent systems without thereby losing its essential formal characteristics.

Language is mainly an auditory system of symbols. While it involves articulation, the motor aspect of speech is clearly secondary to the auditory part. In typical individuals, the urge to speak first occurs in the realm of auditory imagery, which is then sent to the motor nerves that control the speech organs. However, the motor processes and the feelings that come with them are not the final goal; they are just a means to facilitate auditory perception for both the speaker and the listener. Communication, which is the purpose of speech, only succeeds when the listener's auditory perceptions are turned into the right flow of imagery or thoughts, or a combination of both. Therefore, the process of speech, viewed as a purely external tool, begins and ends with sounds. The agreement between the initial auditory imagery and the final auditory perceptions is the social proof of the successful outcome of the process. As we've already noted, the typical progression of this process can go through countless changes or adaptations into similar systems without losing its essential formal characteristics.

The most important of these modifications is the abbreviation of the speech process involved in thinking. This has doubtless many forms, according to the structural or functional peculiarities of the individual mind. The least modified form is that known as “talking to one’s self” or “thinking aloud.” Here the speaker and the hearer are identified in a single person, who may be said to communicate with himself. More significant is the still further abbreviated form in which the sounds of speech are not articulated at all. To this belong all the varieties of silent speech and of normal thinking. The auditory centers alone may be excited; or the impulse to linguistic expression may be communicated as well to the motor nerves that communicate with the organs of speech but be inhibited either in the muscles of these organs or at some point in the motor nerves themselves; or, possibly, the auditory centers may be only slightly, if at all, affected, the speech process manifesting itself directly in the motor sphere. There must be still other types of abbreviation. How common is the excitation of the motor nerves in silent speech, in which no audible or visible articulations result, is shown by the frequent experience of fatigue in the speech organs, particularly in the larynx, after unusually stimulating reading or intensive thinking.

The most important of these changes is the shortening of the speech process involved in thinking. This likely takes many forms, depending on the structural or functional characteristics of the individual mind. The least altered form is what we call “talking to oneself” or “thinking aloud.” In this case, the speaker and the listener are the same person, who essentially communicates with themselves. More notably, there is an even shorter version where speech sounds aren’t articulated at all. This includes all types of silent speech and regular thinking. The auditory centers may be activated alone; or the impulse to express language may also reach the motor nerves connected to the speech organs but be held back either in the muscles of these organs or at some point in the motor nerves themselves; alternatively, the auditory centers may be only slightly, if at all, affected, with the speech process showing up directly in the motor area. There must be other types of shortening as well. How common it is for the motor nerves to be activated in silent speech, where no audible or visible articulation occurs, is highlighted by the frequent fatigue felt in the speech organs, especially in the larynx, after particularly stimulating reading or intense thinking.

All the modifications so far considered are directly patterned on the typical process of normal speech. Of very great interest and importance is the possibility of transferring the whole system of speech symbolism into other terms than those that are involved in the typical process. This process, as we have seen, is a matter of sounds and of movements intended to produce these sounds. The sense of vision is not brought into play. But let us suppose that one not only hears the articulated sounds but sees the articulations themselves as they are being executed by the speaker. Clearly, if one can only gain a sufficiently high degree of adroitness in perceiving these movements of the speech organs, the way is opened for a new type of speech symbolism—that in which the sound is replaced by the visual image of the articulations that correspond to the sound. This sort of system has no great value for most of us because we are already possessed of the auditory-motor system of which it is at best but an imperfect translation, not all the articulations being visible to the eye. However, it is well known what excellent use deaf-mutes can make of “reading from the lips” as a subsidiary method of apprehending speech. The most important of all visual speech symbolisms is, of course, that of the written or printed word, to which, on the motor side, corresponds the system of delicately adjusted movements which result in the writing or typewriting or other graphic method of recording speech. The significant feature for our recognition in these new types of symbolism, apart from the fact that they are no longer a by-product of normal speech itself, is that each element (letter or written word) in the system corresponds to a specific element (sound or sound-group or spoken word) in the primary system. Written language is thus a point-to-point equivalence, to borrow a mathematical phrase, to its spoken counterpart. The written forms are secondary symbols of the spoken ones—symbols of symbols—yet so close is the correspondence that they may, not only in theory but in the actual practice of certain eye-readers and, possibly, in certain types of thinking, be entirely substituted for the spoken ones. Yet the auditory-motor associations are probably always latent at the least, that is, they are unconsciously brought into play. Even those who read and think without the slightest use of sound imagery are, at last analysis, dependent on it. They are merely handling the circulating medium, the money, of visual symbols as a convenient substitute for the economic goods and services of the fundamental auditory symbols.

All the changes we've looked at so far are based directly on how normal speech works. It's really interesting and important to think about the possibility of transferring the entire system of speech symbolism into different forms than what’s typically involved. This process, as we've seen, involves sounds and movements intended to create those sounds. Vision doesn't play a role here. But let's imagine that someone not only hears the spoken sounds but also sees the movements happening as the speaker articulates them. Clearly, if one can develop a high level of skill in perceiving these movements of the speech organs, it opens the door for a new kind of speech symbolism—one where the sound is replaced by the visual image of the articulations that correspond to the sound. This kind of system isn’t very valuable for most of us because we already have the auditory-motor system, of which this is just an imperfect translation, since not all articulations are visible. However, it's well known how effectively deaf-mutes can use "lip reading" as a secondary method of understanding speech. The most important visual speech symbolism is, of course, the written or printed word, which corresponds on the motor side to the finely tuned movements that result in writing, typewriting, or other graphical ways of recording speech. The key point to recognize in these new types of symbolism, aside from the fact that they are no longer a direct result of normal speech itself, is that each element (letter or written word) in this system corresponds to a specific element (sound or sound group or spoken word) in the primary system. Written language is thus a direct equivalence, to use a mathematical term, to its spoken counterpart. The written forms are secondary symbols of the spoken ones—symbols of symbols—but the correspondence is so close that, in theory and in the actual practice of certain skilled readers, and possibly in certain kinds of thinking, they can completely replace the spoken ones. Still, the auditory-motor associations are probably always present at least in the background, meaning they are unconsciously activated. Even those who read and think without any sound imagery are ultimately reliant on it. They are just using visual symbols as a convenient substitute for the basic auditory symbols.

The possibilities of linguistic transfer are practically unlimited. A familiar example is the Morse telegraph code, in which the letters of written speech are represented by a conventionally fixed sequence of longer or shorter ticks. Here the transfer takes place from the written word rather than directly from the sounds of spoken speech. The letter of the telegraph code is thus a symbol of a symbol of a symbol. It does not, of course, in the least follow that the skilled operator, in order to arrive at an understanding of a telegraphic message, needs to transpose the individual sequence of ticks into a visual image of the word before he experiences its normal auditory image. The precise method of reading off speech from the telegraphic communication undoubtedly varies widely with the individual. It is even conceivable, if not exactly likely, that certain operators may have learned to think directly, so far as the purely conscious part of the process of thought is concerned, in terms of the tick-auditory symbolism or, if they happen to have a strong natural bent toward motor symbolism, in terms of the correlated tactile-motor symbolism developed in the sending of telegraphic messages.

The possibilities of language transfer are practically endless. A familiar example is Morse code, where the letters of written language are represented by a specific sequence of short and long signals. In this case, the transfer happens from the written word instead of directly from the sounds of spoken language. Therefore, the Morse code letter is a symbol of a symbol of a symbol. However, this doesn’t mean that a skilled operator needs to translate each sequence of signals into a visual representation of the word before understanding its usual auditory form. The exact way individuals interpret speech from telegraphic messages can vary greatly. It’s even possible, though not very likely, that some operators have learned to think directly, at least in terms of the conscious aspect of their thought process, in terms of the signal-auditory symbolism or, if they have a strong inclination towards motor symbolism, in terms of the related tactile-motor symbolism used when sending telegraphic messages.

Still another interesting group of transfers are the different gesture languages, developed for the use of deaf-mutes, of Trappist monks vowed to perpetual silence, or of communicating parties that are within seeing distance of each other but are out of earshot. Some of these systems are one-to-one equivalences of the normal system of speech; others, like military gesture-symbolism or the gesture language of the Plains Indians of North America (understood by tribes of mutually unintelligible forms of speech) are imperfect transfers, limiting themselves to the rendering of such grosser speech elements as are an imperative minimum under difficult circumstances. In these latter systems, as in such still more imperfect symbolisms as those used at sea or in the woods, it may be contended that language no longer properly plays a part but that the ideas are directly conveyed by an utterly unrelated symbolic process or by a quasi-instinctive imitativeness. Such an interpretation would be erroneous. The intelligibility of these vaguer symbolisms can hardly be due to anything but their automatic and silent translation into the terms of a fuller flow of speech.

Another interesting group of transfers includes the various gesture languages developed for deaf-mutes, Trappist monks who take a vow of silence, or for communicating parties who can see each other but are out of earshot. Some of these systems correspond directly to the normal system of speech, while others, like military gesture-codes or the gesture language of the Plains Indians of North America (which is understood by tribes that speak different, unintelligible languages), are less effective transfers, limited to conveying only the most basic speech elements necessary in challenging situations. In these latter systems, as well as in more primitive symbolisms used at sea or in the woods, one could argue that language no longer plays a role; instead, ideas are communicated through an unrelated symbolic process or a sort of instinctual mimicry. Such an interpretation would be incorrect. The clarity of these broader symbolisms can only come from their automatic and silent translation into a more comprehensive flow of speech.

We shall no doubt conclude that all voluntary communication of ideas, aside from normal speech, is either a transfer, direct or indirect, from the typical symbolism of language as spoken and heard or, at the least, involves the intermediary of truly linguistic symbolism. This is a fact of the highest importance. Auditory imagery and the correlated motor imagery leading to articulation are, by whatever devious ways we follow the process, the historic fountain-head of all speech and of all thinking. One other point is of still greater importance. The ease with which speech symbolism can be transferred from one sense to another, from technique to technique, itself indicates that the mere sounds of speech are not the essential fact of language, which lies rather in the classification, in the formal patterning, and in the relating of concepts. Once more, language, as a structure, is on its inner face the mold of thought. It is this abstracted language, rather more than the physical facts of speech, that is to concern us in our inquiry.

We can confidently say that all voluntary communication of ideas, apart from regular speech, is either a direct or indirect transfer from the usual symbols of language as spoken and heard or, at the very least, involves true linguistic symbols as an intermediary. This is an extremely important fact. Auditory imagery and the related motor imagery that leads to speaking are, no matter the complicated paths we take through the process, the historical source of all speech and all thinking. There's one more thing that's even more important. The way speech symbols can easily transfer from one sense to another, and from method to method, indicates that the actual sounds of speech are not the core aspect of language; rather, it lies in the categorization, the formal structure, and the connections between concepts. Once again, language, as a structure, is fundamentally a mold for thought. It is this abstract language, more than the physical aspects of speech, that will be the focus of our study.

There is no more striking general fact about language than its universality. One may argue as to whether a particular tribe engages in activities that are worthy of the name of religion or of art, but we know of no people that is not possessed of a fully developed language. The lowliest South African Bushman speaks in the forms of a rich symbolic system that is in essence perfectly comparable to the speech of the cultivated Frenchman. It goes without saying that the more abstract concepts are not nearly so plentifully represented in the language of the savage, nor is there the rich terminology and the finer definition of nuances that reflect the higher culture. Yet the sort of linguistic development that parallels the historic growth of culture and which, in its later stages, we associate with literature is, at best, but a superficial thing. The fundamental groundwork of language—the development of a clear-cut phonetic system, the specific association of speech elements with concepts, and the delicate provision for the formal expression of all manner of relations—all this meets us rigidly perfected and systematized in every language known to us. Many primitive languages have a formal richness, a latent luxuriance of expression, that eclipses anything known to the languages of modern civilization. Even in the mere matter of the inventory of speech the layman must be prepared for strange surprises. Popular statements as to the extreme poverty of expression to which primitive languages are doomed are simply myths. Scarcely less impressive than the universality of speech is its almost incredible diversity. Those of us that have studied French or German, or, better yet, Latin or Greek, know in what varied forms a thought may run. The formal divergences between the English plan and the Latin plan, however, are comparatively slight in the perspective of what we know of more exotic linguistic patterns. The universality and the diversity of speech lead to a significant inference. We are forced to believe that language is an immensely ancient heritage of the human race, whether or not all forms of speech are the historical outgrowth of a single pristine form. It is doubtful if any other cultural asset of man, be it the art of drilling for fire or of chipping stone, may lay claim to a greater age. I am inclined to believe that it antedated even the lowliest developments of material culture, that these developments, in fact, were not strictly possible until language, the tool of significant expression, had itself taken shape.

There’s no more striking fact about language than its universality. You can debate whether a particular tribe participates in activities that deserve to be called religion or art, but we don’t know of any people lacking a fully developed language. Even the most basic South African Bushman speaks with a rich symbolic system that is essentially comparable to the language of a cultured Frenchman. It’s clear that the more abstract concepts aren’t represented as abundantly in the language of a primitive society, nor is there the extensive terminology and fine distinctions that reflect a higher culture. However, the linguistic development that aligns with the historical growth of culture, and which we later associate with literature, is ultimately just a surface-level aspect. The core elements of language—the development of a clear phonetic system, the specific linking of speech elements with concepts, and the careful structuring for the formal expression of various relationships—are all rigidly perfected and organized in every known language. Many primitive languages possess a richness and a hidden depth of expression that surpass anything found in modern languages. Even in the basic inventory of speech, the average person should be ready for surprising revelations. Claims about the extreme limitations of primitive languages are mere myths. Almost as striking as the universality of speech is its incredible diversity. Those of us who have studied French or German, or even better, Latin or Greek, understand the many ways a thought can be expressed. However, the formal differences between English and Latin are relatively minor compared to the exotic linguistic patterns we know of. The universality and diversity of speech lead to an important conclusion. We must believe that language is an immensely ancient heritage of humanity, whether all forms of speech evolved from a single original form or not. It’s doubtful that any other cultural achievement of humankind, be it the art of creating fire or shaping stone, can claim greater antiquity. I believe it predates even the simplest developments of material culture, which in fact couldn’t have occurred until language, the tool for meaningful expression, had taken form.

II

The Elements of Speech

We have more than once referred to the “elements of speech,” by which we understood, roughly speaking, what are ordinarily called “words.” We must now look more closely at these elements and acquaint ourselves with the stuff of language. The very simplest element of speech—and by “speech” we shall hence-forth mean the auditory system of speech symbolism, the flow of spoken words—is the individual sound, though, as we shall see later on, the sound is not itself a simple structure but the resultant of a series of independent, yet closely correlated, adjustments in the organs of speech. And yet the individual sound is not, properly considered, an element of speech at all, for speech is a significant function and the sound as such has no significance. It happens occasionally that the single sound is an independently significant element (such as French a “has” and à “to” or Latin i “go!”), but such cases are fortuitous coincidences between individual sound and significant word. The coincidence is apt to be fortuitous not only in theory but in point of actual historic fact; thus, the instances cited are merely reduced forms of originally fuller phonetic groups—Latin habet and ad and Indo-European ei respectively. If language is a structure and if the significant elements of language are the bricks of the structure, then the sounds of speech can only be compared to the unformed and unburnt clay of which the bricks are fashioned. In this chapter we shall have nothing further to do with sounds as sounds.

We have referred to the “elements of speech,” which we understood, in simple terms, as what are typically called “words.” Now, we need to take a closer look at these elements and understand the components of language. The most basic element of speech—and by “speech,” we will now mean the auditory system of speech symbolism, the flow of spoken words—is the individual sound. However, as we will see later, the sound itself is not a simple structure but the result of a series of independent, yet closely related, adjustments in the speech organs. Still, the individual sound isn't really an element of speech at all, since speech is a meaningful function and the sound by itself has no meaning. Occasionally, a single sound can be a significant element on its own (like French a for “has” and à for “to” or Latin i for “go!”), but these cases are just random coincidences between individual sounds and meaningful words. This coincidence tends to be random not only in theory but also in actual historical facts; thus, the examples mentioned are just simplified forms of originally longer phonetic groups—Latin habet and ad and Indo-European ei respectively. If language is a structure and the significant elements of language are the building blocks of that structure, then the sounds of speech can only be compared to the unshaped and unbaked clay of from which the bricks are made. In this chapter, we won’t be dealing with sounds as sounds anymore.

The true, significant elements of language are generally sequences of sounds that are either words, significant parts of words, or word groupings. What distinguishes each of these elements is that it is the outward sign of a specific idea, whether of a single concept or image or of a number of such concepts or images definitely connected into a whole. The single word may or may not be the simplest significant element we have to deal with. The English words sing, sings, singing, singer each conveys a perfectly definite and intelligible idea, though the idea is disconnected and is therefore functionally of no practical value. We recognize immediately that these words are of two sorts. The first word, sing, is an indivisible phonetic entity conveying the notion of a certain specific activity. The other words all involve the same fundamental notion but, owing to the addition of other phonetic elements, this notion is given a particular twist that modifies or more closely defines it. They represent, in a sense, compounded concepts that have flowered from the fundamental one. We may, therefore, analyze the words sings, singing, and singer as binary expressions involving a fundamental concept, a concept of subject matter (sing), and a further concept of more abstract order—one of person, number, time, condition, function, or of several of these combined.

The key elements of language are usually sequences of sounds that represent either words, meaningful parts of words, or combinations of words. What makes each element unique is that it serves as a signal of a specific idea, whether it’s a single concept or image or a collection of related concepts or images that come together as a whole. A single word may or may not be the simplest significant element we encounter. The English words sing, sings, singing, and singer each express a clear and understandable idea, though each idea stands alone and isn’t practically useful on its own. We can quickly see that these words fall into two categories. The first word, sing, is an indivisible phonetic unit that represents a specific activity. The other words share the same basic idea, but due to the addition of other sounds, this idea takes on a unique angle that refines or clarifies it. They represent, in a way, complex concepts that have evolved from the basic one. Thus, we can break down the words sings, singing, and singer as binary expressions that include a core concept, a concept of subject matter (sing), and an additional concept that is more abstract—like person, number, time, condition, function, or a combination of these.

If we symbolize such a term as sing by the algebraic formula A, we shall have to symbolize such terms as sings and singer by the formula A + b.[1] The element A may be either a complete and independent word (sing) or the fundamental substance, the so-called root or stem[2] or “radical element” (sing-) of a word. The element b (-s, -ing, -er) is the indicator of a subsidiary and, as a rule, a more abstract concept; in the widest sense of the word “form,” it puts upon the fundamental concept a formal limitation. We may term it a “grammatical element” or affix. As we shall see later on, the grammatical element or the grammatical increment, as we had better put it, need not be suffixed to the radical element. It may be a prefixed element (like the un- of unsingable), it may be inserted into the very body of the stem (like the n of the Latin vinco “I conquer” as contrasted with its absence in vici “I have conquered”), it may be the complete or partial repetition of the stem, or it may consist of some modification of the inner form of the stem (change of vowel, as in sung and song; change of consonant as in dead and death; change of accent; actual abbreviation). Each and every one of these types of grammatical element or modification has this peculiarity, that it may not, in the vast majority of cases, be used independently but needs to be somehow attached to or welded with a radical element in order to convey an intelligible notion. We had better, therefore, modify our formula, A + b, to A + (b), the round brackets symbolizing the incapacity of an element to stand alone. The grammatical element, moreover, is not only non-existent except as associated with a radical one, it does not even, as a rule, obtain its measure of significance unless it is associated with a particular class of radical elements. Thus, the -s of English he hits symbolizes an utterly different notion from the -s of books, merely because hit and book are differently classified as to function. We must hasten to observe, however, that while the radical element may, on occasion, be identical with the word, it does not follow that it may always, or even customarily, be used as a word. Thus, the hort- “garden” of such Latin forms as hortus, horti, and horto is as much of an abstraction, though one yielding a more easily apprehended significance, than the -ing of singing. Neither exists as an independently intelligible and satisfying element of speech. Both the radical element, as such, and the grammatical element, therefore, are reached only by a process of abstraction. It seemed proper to symbolize sing-er as A + (b); hort-us must be symbolized as (A) + (b).

If we represent a term like sing with the algebraic formula A, we should represent terms like sings and singer with the formula A + b.[1] The element A can either be a complete and standalone word (sing) or the fundamental part, known as the root or stem[2] or “radical element” (sing-) of a word. The element b (-s, -ing, -er) indicates an additional, often more abstract concept; broadly speaking, it places a formal limit on the fundamental concept. We can call it a “grammatical element” or affix. As we will discuss later, the grammatical element, or the grammatical increment, as we should probably refer to it, doesn’t always need to be attached to the radical element. It can be a prefix (like the un- in unsingable), it can be inserted into the middle of the stem (like the n in the Latin vinco “I conquer” compared to its absence in vici “I have conquered”), it can be a complete or partial repetition of the stem, or it might involve some modification of the inner form of the stem (like a vowel change, as in sung and song; a consonant change, as in dead and death; a change of accent; actual abbreviation). Each of these types of grammatical elements or modifications has the characteristic that, in most cases, they cannot be used independently and must be attached to or combined with a radical element to convey a clear idea. Therefore, it’s better to revise our formula to A + (b), with the parentheses indicating that the element cannot stand alone. Moreover, the grammatical element not only doesn’t exist unless it’s connected to a radical one, but it typically doesn’t have significant meaning unless associated with a specific class of radical elements. For instance, the -s in he hits represents a completely different concept from the -s in books, simply because hit and book function differently. However, we should note that while the radical element can sometimes be identical to the word, it doesn’t mean it can always or even usually be used as a word. For example, hort- “garden” from Latin forms like hortus, horti, and horto is just as abstract, though easier to understand, as the -ing in singing. Neither serves as an independently meaningful and complete element of speech. Therefore, both the radical element and the grammatical element are derived through a process of abstraction. It seems appropriate to symbolize sing-er as A + (b); hort-us should be represented as (A) + (b).

So far, the first speech element that we have found which we can say actually “exists” is the word. Before defining the word, however, we must look a little more closely at the type of word that is illustrated by sing. Are we, after all, justified in identifying it with a radical element? Does it represent a simple correspondence between concept and linguistic expression? Is the element sing-, that we have abstracted from sings, singing, and singer and to which we may justly ascribe a general unmodified conceptual value, actually the same linguistic fact as the word sing? It would almost seem absurd to doubt it, yet a little reflection only is needed to convince us that the doubt is entirely legitimate. The word sing cannot, as a matter of fact, be freely used to refer to its own conceptual content. The existence of such evidently related forms as sang and sung at once shows that it cannot refer to past time, but that, for at least an important part of its range of usage, it is limited to the present. On the other hand, the use of sing as an “infinitive” (in such locutions as to sing and he will sing) does indicate that there is a fairly strong tendency for the word sing to represent the full, untrammeled amplitude of a specific concept. Yet if sing were, in any adequate sense, the fixed expression of the unmodified concept, there should be no room for such vocalic aberrations as we find in sang and sung and song, nor should we find sing specifically used to indicate present time for all persons but one (third person singular sings).

So far, the first element of speech we've found that we can say actually "exists" is the word. Before defining the word, though, we need to take a closer look at the type of word illustrated by sing. Are we really justified in identifying it as a fundamental element? Does it show a simple relationship between concept and linguistic expression? Is the element sing-, which we've abstracted from sings, singing, and singer, and to which we can fairly assign a general unmodified conceptual value, actually the same linguistic fact as the word sing? It might seem silly to question it, yet a bit of thought is enough to convince us that the doubt is entirely valid. The word sing can't, in fact, be used freely to refer to its own conceptual content. The existence of related forms like sang and sung immediately shows that it can't refer to past time, but rather, for at least a significant part of its usage, it is limited to the present. On the flip side, the use of sing as an "infinitive" (in phrases like to sing and he will sing) suggests that the word sing tends to represent the full, unrestricted scope of a specific concept. Yet if sing were, in any meaningful sense, the fixed expression of the unmodified concept, there shouldn't be any vocalic variations like those we see in sang and sung and song, nor should sing specifically be used to indicate present time for all forms but one (the third person singular sings).

The truth of the matter is that sing is a kind of twilight word, trembling between the status of a true radical element and that of a modified word of the type of singing. Though it has no outward sign to indicate that it conveys more than a generalized idea, we do feel that there hangs about it a variable mist of added value. The formula A does not seem to represent it so well as A + (0). We might suspect sing of belonging to the A + (b) type, with the reservation that the (b) had vanished. This report of the “feel” of the word is far from fanciful, for historical evidence does, in all earnest, show that sing is in origin a number of quite distinct words, of type A + (b), that have pooled their separate values. The (b) of each of these has gone as a tangible phonetic element; its force, however, lingers on in weakened measure. The sing of I sing is the correspondent of the Anglo-Saxon singe; the infinitive sing, of singan; the imperative sing of sing. Ever since the breakdown of English forms that set in about the time of the Norman Conquest, our language has been straining towards the creation of simple concept-words, unalloyed by formal connotations, but it has not yet succeeded in this, apart, possibly, from isolated adverbs and other elements of that sort. Were the typical unanalyzable word of the language truly a pure concept-word (type A) instead of being of a strangely transitional type (type A + [0]), our sing and work and house and thousands of others would compare with the genuine radical-words of numerous other languages.[3] Such a radical-word, to take a random example, is the Nootka[4] word hamot “bone.” Our English correspondent is only superficially comparable. Hamot means “bone” in a quite indefinite sense; to our English word clings the notion of singularity. The Nootka Indian can convey the idea of plurality, in one of several ways, if he so desires, but he does not need to; hamot may do for either singular or plural, should no interest happen to attach to the distinction. As soon as we say “bone” (aside from its secondary usage to indicate material), we not merely specify the nature of the object but we imply, whether we will or no, that there is but one of these objects to be considered. And this increment of value makes all the difference.

The truth is that sing is a kind of in-between word, caught between being a true radical element and a modified word like singing. Although it doesn't have any obvious markers to show that it carries more than a general idea, we can sense that it has a variable mist of added value around it. The formula A doesn't seem to represent it as well as A + (0). We might think of sing as belonging to the A + (b) type, except that the (b) has disappeared. This observation about the "feel" of the word is not just fanciful; historical evidence shows that sing originally came from several distinct words of type A + (b) that combined their separate values. The (b) of each of these has vanished as a clear phonetic element, yet its essence remains in a diminished form. The sing of I sing corresponds to the Anglo-Saxon singe; the infinitive sing relates to singan; the imperative sing comes from sing. Since the breakdown of English forms that began around the time of the Norman Conquest, our language has been trying to create simple concept-words that aren't mixed with formal meanings, but it hasn't fully succeeded, except perhaps for isolated adverbs and similar elements. If the typical unanalyzable word in the language were truly a pure concept-word (type A) instead of a strangely transitional type (type A + [0]), our sing, work, and house—along with thousands of others—would compare to the genuine radical words found in many other languages.[3] A radical word, to give a random example, is the Nootka[4] word hamot, meaning “bone.” Our English equivalent is only superficially similar. Hamot refers to “bone” in a vague sense; our English word carries the idea of singularity. The Nootka Indian can express the concept of plurality in several ways if needed, but he doesn't have to; hamot can refer to either singular or plural as long as there's no interest in making that distinction. As soon as we say “bone” (aside from its secondary use to refer to material), we not only specify what the object is but also imply, whether we mean to or not, that there is only one of these objects being addressed. And this added value makes all the difference.

We now know of four distinct formal types of word: A (Nootka hamot); A + (0) (sing, bone); A + (b) (singing); (A) + (b) (Latin hortus). There is but one other type that is fundamentally possible: A + B, the union of two (or more) independently occurring radical elements into a single term. Such a word is the compound fire-engine or a Sioux form equivalent to eat-stand (i.e., “to eat while standing”). It frequently happens, however, that one of the radical elements becomes functionally so subordinated to the other that it takes on the character of a grammatical element. We may symbolize this by A + b, a type that may gradually, by loss of external connection between the subordinated element b and its independent counterpart B merge with the commoner type A + (b). A word like beautiful is an example of A + b, the -ful barely preserving the impress of its lineage. A word like homely, on the other hand, is clearly of the type A + (b), for no one but a linguistic student is aware of the connection between the -ly and the independent word like.

We now recognize four distinct formal types of words: A (Nootka hamot); A + (0) (sing, bone); A + (b) (singing); (A) + (b) (Latin hortus). There is only one other type that is fundamentally possible: A + B, the combination of two (or more) independently occurring root elements into a single term. Examples include the compound fire-engine or a Sioux term equivalent to eat-stand (i.e., “to eat while standing”). However, it often happens that one of the root elements becomes so functionally subordinate to the other that it takes on the role of a grammatical element. We can represent this as A + b, a type that may gradually, through the loss of external connection between the subordinate element b and its independent counterpart B, merge with the more common type A + (b). A word like beautiful is an example of A + b, where the -ful barely retains a trace of its origin. In contrast, a word like homely clearly falls under the type A + (b), as only someone studying linguistics would recognize the link between -ly and the independent word like.

In actual use, of course, these five (or six) fundamental types may be indefinitely complicated in a number of ways. The (0) may have a multiple value; in other words, the inherent formal modification of the basic notion of the word may affect more than one category. In such a Latin word as cor “heart,” for instance, not only is a concrete concept conveyed, but there cling to the form, which is actually shorter than its own radical element (cord-), the three distinct, yet intertwined, formal concepts of singularity, gender classification (neuter), and case (subjective-objective). The complete grammatical formula for cor is, then, A + (0) + (0) + (0), though the merely external, phonetic formula would be (A)—, (A) indicating the abstracted “stem” cord-, the minus sign a loss of material. The significant thing about such a word as cor is that the three conceptual limitations are not merely expressed by implication as the word sinks into place in a sentence; they are tied up, for good and all, within the very vitals of the word and cannot be eliminated by any possibility of usage.

In actual use, these five (or six) basic types can become infinitely complex in various ways. The (0) can have multiple values; in other words, the inherent formal modification of the word's basic concept can affect more than one category. For example, in the Latin word cor meaning “heart,” it conveys a concrete concept, but the form, which is actually shorter than its own root (cord-), carries three distinct yet interconnected formal concepts of singularity, gender classification (neuter), and case (subjective-objective). The complete grammatical formula for cor is A + (0) + (0) + (0), although the external, phonetic formula would be (A)—, where (A) indicates the abstracted “stem” cord- and the minus sign indicates a loss of material. What's significant about a word like cor is that these three conceptual boundaries aren't just implied as the word fits into a sentence; they're permanently embedded in the very essence of the word and can't be removed by any usage.

Other complications result from a manifolding of parts. In a given word there may be several elements of the order A (we have already symbolized this by the type A + B), of the order (A), of the order b, and of the order (b). Finally, the various types may be combined among themselves in endless ways. A comparatively simple language like English, or even Latin, illustrates but a modest proportion of these theoretical possibilities. But if we take our examples freely from the vast storehouse of language, from languages exotic as well as from those that we are more familiar with, we shall find that there is hardly a possibility that is not realized in actual usage. One example will do for thousands, one complex type for hundreds of possible types. I select it from Paiute, the language of the Indians of the arid plateaus of southwestern Utah. The word wii-to-kuchum-punku-rügani-yugwi-va-ntü-m(ü)[5] is of unusual length even for its own language, but it is no psychological monster for all that. It means “they who are going to sit and cut up with a knife a black cow (or bull),” or, in the order of the Indian elements, “knife-black-buffalo-pet-cut up-sit(plur.)-future-participle-animate plur.” The formula for this word, in accordance with our symbolism, would be (F) + (E) + C + d + A + B + (g) + (h) + (i) + (0). It is the plural of the future participle of a compound verb “to sit and cut up”—A + B. The elements (g)—which denotes futurity—, (h)—a participial suffix—, and (i)—indicating the animate plural—are grammatical elements which convey nothing when detached. The formula (0) is intended to imply that the finished word conveys, in addition to what is definitely expressed, a further relational idea, that of subjectivity; in other words, the form can only be used as the subject of a sentence, not in an objective or other syntactic relation. The radical element A (“to cut up”), before entering into combination with the coördinate element B (“to sit”), is itself compounded with two nominal elements or element-groups—an instrumentally used stem (F) (“knife”), which may be freely used as the radical element of noun forms but cannot be employed as an absolute noun in its given form, and an objectively used group—(E) + C + d (“black cow or bull”). This group in turn consists of an adjectival radical element (E) (“black”), which cannot be independently employed (the absolute notion of “black” can be rendered only as the participle of a verb: “black-be-ing”), and the compound noun C + d (“buffalo-pet”). The radical element C properly means “buffalo,” but the element d, properly an independently occurring noun meaning “horse” (originally “dog” or “domesticated animal” in general), is regularly used as a quasi-subordinate element indicating that the animal denoted by the stem to which it is affixed is owned by a human being. It will be observed that the whole complex (F) + (E) + C + d + A + B is functionally no more than a verbal base, corresponding to the sing- of an English form like singing; that this complex remains verbal in force on the addition of the temporal element (g)—this (g), by the way, must not be understood as appended to B alone, but to the whole basic complex as a unit—; and that the elements (h) + (i) + (0) transform the verbal expression into a formally well-defined noun.

Other complications come from a mix of elements. In a given word, there can be several components of type A (which we previously symbolized as A + B), of type (A), of type b, and of type (b). Furthermore, the different types can combine in countless ways. A relatively simple language like English, or even Latin, reflects only a small fraction of these theoretical possibilities. However, if we pull examples from the vast world of languages, including both exotic languages and those we know well, we'll find that there's hardly a possibility that isn't actually used. One example can represent thousands, and one complex type can represent hundreds of possible types. I’ll take it from Paiute, the language spoken by the Native Americans of the arid plateaus of southwestern Utah. The word wii-to-kuchum-punku-rügani-yugwi-va-ntü-m(ü)[5] is unusually long even for its language, but it's not a psychological monster by any means. It means “they who are going to sit and cut up with a knife a black cow (or bull),” or, according to the order of the Indian components, “knife-black-buffalo-pet-cut up-sit(plur.)-future-participle-animate plur.” The formula for this word, based on our symbols, would be (F) + (E) + C + d + A + B + (g) + (h) + (i) + (0). It represents the plural form of the future participle of the combined verb “to sit and cut up”—A + B. The components (g)—which indicates futurity—, (h)—a participial suffix—, and (i)—denoting the animate plural—are grammatical components that don’t mean anything on their own. The formula (0) suggests that the completed word conveys, in addition to what is clearly stated, an extra relational idea, that of subjectivity; in other words, this form can only function as the subject of a sentence, not in an objective or other grammatical context. The core element A (“to cut up”), before combining with the coordinate element B (“to sit”), is itself combined with two nominal elements or groups—an instrumentally used stem (F) (“knife”), which can be used freely as a noun stem but cannot be used as an independent noun in its current form, and an objectively used group—(E) + C + d (“black cow or bull”). This group consists of an adjectival root (E) (“black”), which cannot be used on its own (the absolute sense of “black” can only be expressed as a verb participle: “black-be-ing”), and the compound noun C + d (“buffalo-pet”). The core element C means “buffalo,” but the element d, which is an independently occurring noun meaning “horse” (originally “dog” or “domesticated animal” in general), is often used as a quasi-subordinate element indicating that the animal referred to by the stem it attaches to is owned by a person. It can be noticed that the entire complex (F) + (E) + C + d + A + B functions essentially as a verbal base, similar to sing- in an English word like singing; that this complex retains its verbal meaning even when the temporal element (g) is added—by the way, (g) shouldn’t be seen as added only to B, but to the entire base complex as a whole; and that the components (h) + (i) + (0) convert the verbal expression into a formally defined noun.

It is high time that we decided just what is meant by a word. Our first impulse, no doubt, would have been to define the word as the symbolic, linguistic counterpart of a single concept. We now know that such a definition is impossible. In truth it is impossible to define the word from a functional standpoint at all, for the word may be anything from the expression of a single concept—concrete or abstract or purely relational (as in of or by or and)—to the expression of a complete thought (as in Latin dico “I say” or, with greater elaborateness of form, in a Nootka verb form denoting “I have been accustomed to eat twenty round objects [e.g., apples] while engaged in [doing so and so]”). In the latter case the word becomes identical with the sentence. The word is merely a form, a definitely molded entity that takes in as much or as little of the conceptual material of the whole thought as the genius of the language cares to allow. Thus it is that while the single radical elements and grammatical elements, the carriers of isolated concepts, are comparable as we pass from language to language, the finished words are not. Radical (or grammatical) element and sentence—these are the primary functional units of speech, the former as an abstracted minimum, the latter as the esthetically satisfying embodiment of a unified thought. The actual formal units of speech, the words, may on occasion identify themselves with either of the two functional units; more often they mediate between the two extremes, embodying one or more radical notions and also one or more subsidiary ones. We may put the whole matter in a nutshell by saying that the radical and grammatical elements of language, abstracted as they are from the realities of speech, respond to the conceptual world of science, abstracted as it is from the realities of experience, and that the word, the existent unit of living speech, responds to the unit of actually apprehended experience, of history, of art. The sentence is the logical counterpart of the complete thought only if it be felt as made up of the radical and grammatical elements that lurk in the recesses of its words. It is the psychological counterpart of experience, of art, when it is felt, as indeed it normally is, as the finished play of word with word. As the necessity of defining thought solely and exclusively for its own sake becomes more urgent, the word becomes increasingly irrelevant as a means. We can therefore easily understand why the mathematician and the symbolic logician are driven to discard the word and to build up their thought with the help of symbols which have, each of them, a rigidly unitary value.

It’s about time we figured out what a word really means. Initially, we might have tried to define it as the symbolic, linguistic equivalent of a single concept. Now we know that such a definition is impossible. In fact, it’s impossible to define a word functionally at all, because a word can range from the expression of a single concept—whether concrete, abstract, or purely relational (like “of,” “by,” or “and”)—to the expression of a complete thought (like the Latin “dico” meaning “I say,” or a more complex form in a Nootka verb indicating “I have been accustomed to eat twenty round objects [like apples] while engaged in [doing so and so]”). In these cases, the word becomes identical with the sentence. A word is simply a form, a distinctly shaped entity that captures as much or as little of the conceptual material of the whole thought as the nature of the language allows. This is why, while the basic radical elements and grammatical elements, which carry isolated concepts, can be compared across languages, finished words cannot. Radical (or grammatical) elements and sentences are the primary functional units of speech, with the former as an abstract minimum and the latter as the aesthetically satisfying expression of a unified thought. The actual formal units of speech, or words, can sometimes align with either of the two functional units; more often, they act as a bridge between the two extremes, incorporating one or more core ideas alongside additional ones. In short, we can say that the radical and grammatical elements of language, removed as they are from the realities of speech, correspond to the conceptual world of science, which is also abstracted from the realities of experience, and that the word, the existing unit of living speech, corresponds to the unit of actual experience, history, and art. A sentence logically represents a complete thought only if it’s perceived as made up of the radical and grammatical elements lurking within its words. It serves as the psychological representation of experience and art when it is experienced, as it usually is, as the complete interaction of word with word. As the need to define thought solely for its own sake grows more urgent, the word itself becomes less relevant as a method. This helps explain why mathematicians and symbolic logicians often feel compelled to abandon words and instead use symbols that each have a strictly singular value.

But is not the word, one may object, as much of an abstraction as the radical element? Is it not as arbitrarily lifted out of the living sentence as is the minimum conceptual element out of the word? Some students of language have, indeed, looked upon the word as such an abstraction, though with very doubtful warrant, it seems to me. It is true that in particular cases, especially in some of the highly synthetic languages of aboriginal America, it is not always easy to say whether a particular element of language is to be interpreted as an independent word or as part of a larger word. These transitional cases, puzzling as they may be on occasion, do not, however, materially weaken the case for the psychological validity of the word. Linguistic experience, both as expressed in standardized, written form and as tested in daily usage, indicates overwhelmingly that there is not, as a rule, the slightest difficulty in bringing the word to consciousness as a psychological reality. No more convincing test could be desired than this, that the naïve Indian, quite unaccustomed to the concept of the written word, has nevertheless no serious difficulty in dictating a text to a linguistic student word by word; he tends, of course, to run his words together as in actual speech, but if he is called to a halt and is made to understand what is desired, he can readily isolate the words as such, repeating them as units. He regularly refuses, on the other hand, to isolate the radical or grammatical element, on the ground that it “makes no sense.”[6] What, then, is the objective criterion of the word? The speaker and hearer feel the word, let us grant, but how shall we justify their feeling? If function is not the ultimate criterion of the word, what is?

But isn’t the word, as someone might argue, just as abstract as the radical element? Isn’t it as arbitrarily taken out of the living sentence as the smallest conceptual element is from the word? Some language students have indeed viewed the word as such an abstraction, although I find that position quite questionable. It’s true that in certain cases, especially among some of the highly synthetic languages of indigenous America, it can be tricky to determine whether a specific part of language should be understood as an independent word or as part of a bigger word. These borderline cases, puzzling as they may sometimes be, don’t significantly undermine the argument for the psychological validity of the word. Linguistic experience, both in standardized written form and in everyday use, overwhelmingly shows that, generally, there’s hardly any trouble in bringing the word into consciousness as a psychological reality. The most convincing evidence is that the naïve Indian, who is not familiar with the concept of the written word, still has no major issue dictating a text to a language student word by word; he tends to blend his words together like in spoken language, but if you stop him and explain what you need, he can easily isolate the words as separate units. He consistently refuses, however, to isolate the radical or grammatical element because he thinks it “makes no sense.”[6] So, what is the objective criterion for the word? The speaker and listener feel the word, let’s agree on that, but how can we justify their feeling? If function isn’t the ultimate criterion for the word, then what is?

It is easier to ask the question than to answer it. The best that we can do is to say that the word is one of the smallest, completely satisfying bits of isolated “meaning” into which the sentence resolves itself. It cannot be cut into without a disturbance of meaning, one or the other or both of the severed parts remaining as a helpless waif on our hands. In practice this unpretentious criterion does better service than might be supposed. In such a sentence as It is unthinkable, it is simply impossible to group the elements into any other and smaller “words” than the three indicated. Think or thinkable might be isolated, but as neither un- nor -able nor is-un yields a measurable satisfaction, we are compelled to leave unthinkable as an integral whole, a miniature bit of art. Added to the “feel” of the word are frequently, but by no means invariably, certain external phonetic characteristics. Chief of these is accent. In many, perhaps in most, languages the single word is marked by a unifying accent, an emphasis on one of the syllables, to which the rest are subordinated. The particular syllable that is to be so distinguished is dependent, needless to say, on the special genius of the language. The importance of accent as a unifying feature of the word is obvious in such English examples as unthinkable, characterizing. The long Paiute word that we have analyzed is marked as a rigid phonetic unit by several features, chief of which are the accent on its second syllable (wii’-“knife”) and the slurring (“unvoicing,” to use the technical phonetic term) of its final vowel (-mü, animate plural). Such features as accent, cadence, and the treatment of consonants and vowels within the body of a word are often useful as aids in the external demarcation of the word, but they must by no means be interpreted, as is sometimes done, as themselves responsible for its psychological existence. They at best but strengthen a feeling of unity that is already present on other grounds.

It's easier to ask a question than to answer it. The best we can say is that a word is one of the smallest, completely satisfying pieces of “meaning” that a sentence can break down into. It can't be divided without losing meaning, with one part or both becoming a useless fragment. In practice, this simple criterion proves to be more useful than one might think. In a sentence like It is unthinkable, it's impossible to break it down into smaller “words” than the three mentioned. Think or thinkable could stand alone, but since neither un- nor -able nor is-un offers any meaningful satisfaction, we have to accept unthinkable as a complete unit, a small piece of art. Along with the “feel” of the word, there are often, but not always, certain external phonetic features. The most important is accent. In many, if not most, languages, a single word is highlighted by a unifying accent, emphasizing one syllable while the others are less important. The specific syllable that gets this emphasis depends, of course, on the unique characteristics of the language. The significance of accent as a unifying aspect of a word is clear in English examples like unthinkable and characterizing. The lengthy Paiute word we examined is marked as a distinct phonetic unit by several features, the main one being the accent on its second syllable (wii’-“knife”) and the slurring (“unvoicing,” in technical terms) of its final vowel (-mü, animate plural). Features like accent, rhythm, and the way consonants and vowels are handled within a word are often helpful in clearly defining the word, but they shouldn't be interpreted, as sometimes happens, as being responsible for its psychological existence. They at best reinforce a sense of unity that is already based on other factors.

We have already seen that the major functional unit of speech, the sentence, has, like the word, a psychological as well as a merely logical or abstracted existence. Its definition is not difficult. It is the linguistic expression of a proposition. It combines a subject of discourse with a statement in regard to this subject. Subject and “predicate” may be combined in a single word, as in Latin dico; each may be expressed independently, as in the English equivalent, I say; each or either may be so qualified as to lead to complex propositions of many sorts. No matter how many of these qualifying elements (words or functional parts of words) are introduced, the sentence does not lose its feeling of unity so long as each and every one of them falls in place as contributory to the definition of either the subject of discourse or the core of the predicate[7]. Such a sentence as The mayor of New York is going to deliver a speech of welcome in French is readily felt as a unified statement, incapable of reduction by the transfer of certain of its elements, in their given form, to the preceding or following sentences. The contributory ideas of of New York, of welcome, and in French may be eliminated without hurting the idiomatic flow of the sentence. The mayor is going to deliver a speech is a perfectly intelligible proposition. But further than this we cannot go in the process of reduction. We cannot say, for instance, Mayor is going to deliver.[8] The reduced sentence resolves itself into the subject of discourse—the mayor—and the predicate—is going to deliver a speech. It is customary to say that the true subject of such a sentence is mayor, the true predicate is going or even is, the other elements being strictly subordinate. Such an analysis, however, is purely schematic and is without psychological value. It is much better frankly to recognize the fact that either or both of the two terms of the sentence-proposition may be incapable of expression in the form of single words. There are languages that can convey all that is conveyed by The-mayor is-going-to-deliver-a-speech in two words, a subject word and a predicate word, but English is not so highly synthetic. The point that we are really making here is that underlying the finished sentence is a living sentence type, of fixed formal characteristics. These fixed types or actual sentence-groundworks may be freely overlaid by such additional matter as the speaker or writer cares to put on, but they are themselves as rigidly “given” by tradition as are the radical and grammatical elements abstracted from the finished word. New words may be consciously created from these fundamental elements on the analogy of old ones, but hardly new types of words. In the same way new sentences are being constantly created, but always on strictly traditional lines. The enlarged sentence, however, allows as a rule of considerable freedom in the handling of what may be called “unessential” parts. It is this margin of freedom which gives us the opportunity of individual style.

We’ve already seen that the main functional unit of speech, the sentence, has a psychological aspect in addition to a purely logical or abstract existence. Its definition is straightforward. It is the linguistic expression of a proposition. It combines a subject of discussion with a statement regarding that subject. The subject and “predicate” can be joined in a single word, as in Latin dico; each can be stated separately, like the English equivalent, I say; either can be elaborated in ways that lead to complex propositions of various kinds. No matter how many of these qualifying elements (words or functional parts of words) are added, the sentence maintains a sense of unity as long as each contributes to defining either the subject of discussion or the core of the predicate . A sentence like The mayor of New York is going to deliver a speech of welcome in French is easily recognized as a coherent statement that cannot be simplified by moving certain elements, in their current form, to earlier or later sentences. The contributing ideas of of New York, of welcome, and in French can be removed without disrupting the natural flow of the sentence. The mayor is going to deliver a speech is a perfectly clear proposition. However, we can’t go any further in simplifying it. We can't say, for instance, Mayor is going to deliver.[8] The simplified sentence breaks down into the subject—the mayor—and the predicate—is going to deliver a speech. It's common to say that the true subject of such a sentence is mayor, and the true predicate is is going or even is, with the other elements being strictly subordinate. However, this analysis is purely schematic and lacks psychological significance. It’s more beneficial to acknowledge that either or both parts of the sentence-proposition may not be expressible as single words. There are languages that can express everything conveyed by The-mayor is-going-to-deliver-a-speech in just two words, one for the subject and one for the predicate, but English isn’t that synthetic. The key point we’re making is that beneath the finished sentence lies a living sentence type with fixed formal characteristics. These fixed types or actual sentence structures can be layered with additional elements that the speaker or writer chooses to include, but they are just as rigidly “given” by tradition as are the fundamental and grammatical elements abstracted from the finished word. New words can be consciously created from these basic elements based on older ones, but it's rare to create entirely new types of words. Similarly, new sentences are constantly being formed, but always within strictly traditional frameworks. The expanded sentence, however, usually allows for considerable freedom in handling what might be considered “unessential” parts. This margin of freedom provides us the opportunity for individual style.

The habitual association of radical elements, grammatical elements, words, and sentences with concepts or groups of concepts related into wholes is the fact itself of language. It is important to note that there is in all languages a certain randomness of association. Thus, the idea of “hide” may be also expressed by the word “conceal,” the notion of “three times” also by “thrice.” The multiple expression of a single concept is universally felt as a source of linguistic strength and variety, not as a needless extravagance. More irksome is a random correspondence between idea and linguistic expression in the field of abstract and relational concepts, particularly when the concept is embodied in a grammatical element. Thus, the randomness of the expression of plurality in such words as books, oxen, sheep, and geese is felt to be rather more, I fancy, an unavoidable and traditional predicament than a welcome luxuriance. It is obvious that a language cannot go beyond a certain point in this randomness. Many languages go incredibly far in this respect, it is true, but linguistic history shows conclusively that sooner or later the less frequently occurring associations are ironed out at the expense of the more vital ones. In other words, all languages have an inherent tendency to economy of expression. Were this tendency entirely inoperative, there would be no grammar. The fact of grammar, a universal trait of language, is simply a generalized expression of the feeling that analogous concepts and relations are most conveniently symbolized in analogous forms. Were a language ever completely “grammatical,” it would be a perfect engine of conceptual expression. Unfortunately, or luckily, no language is tyrannically consistent. All grammars leak.

The regular connection of basic elements, grammar, words, and sentences with ideas or groups of ideas into cohesive units is the very essence of language. It's important to recognize that there's a certain randomness in all languages. For example, the idea of "hide" can also be expressed as "conceal," and "three times" can also be said as "thrice." The various ways to express a single idea are seen as a source of strength and variety in language, rather than unnecessary extravagance. A more annoying issue arises when there’s a random connection between an idea and its language expression, especially with abstract or relational concepts, particularly when the concept is tied to a grammatical element. The randomness in how plurality is expressed in words like books, oxen, sheep, and geese feels more like an unavoidable traditional issue than a welcome complexity. It’s clear that a language can’t be too random beyond a certain point. Many languages can go really far with this, but linguistic history shows that sooner or later, less common associations tend to get smoothed over, leaving the more essential ones. In other words, all languages naturally lean toward being more economical in their expressions. If this tendency didn’t exist, grammar wouldn’t exist. The existence of grammar, a universal characteristic of language, is simply a broad reflection of the idea that similar concepts and relationships are best represented in similar forms. If a language were ever completely “grammatical,” it would be a perfect machine for expressing concepts. Unfortunately, or perhaps fortunately, no language is perfectly consistent. All grammars have their shortcomings.

Up to the present we have been assuming that the material of language reflects merely the world of concepts and, on what I have ventured to call the “pre-rational” plane, of images, which are the raw material of concepts. We have, in other words, been assuming that language moves entirely in the ideational or cognitive sphere. It is time that we amplified the picture. The volitional aspect of consciousness also is to some extent explicitly provided for in language. Nearly all languages have special means for the expression of commands (in the imperative forms of the verb, for example) and of desires, unattained or unattainable (Would he might come! or Would he were here!) The emotions, on the whole, seem to be given a less adequate outlet. Emotion, indeed, is proverbially inclined to speechlessness. Most, if not all, the interjections are to be put to the credit of emotional expression, also, it may be, a number of linguistic elements expressing certain modalities, such as dubitative or potential forms, which may be interpreted as reflecting the emotional states of hesitation or doubt—attenuated fear. On the whole, it must be admitted that ideation reigns supreme in language, that volition and emotion come in as distinctly secondary factors. This, after all, is perfectly intelligible. The world of image and concept, the endless and ever-shifting picture of objective reality, is the unavoidable subject-matter of human communication, for it is only, or mainly, in terms of this world that effective action is possible. Desire, purpose, emotion are the personal color of the objective world; they are applied privately by the individual soul and are of relatively little importance to the neighboring one. All this does not mean that volition and emotion are not expressed. They are, strictly speaking, never absent from normal speech, but their expression is not of a truly linguistic nature. The nuances of emphasis, tone, and phrasing, the varying speed and continuity of utterance, the accompanying bodily movements, all these express something of the inner life of impulse and feeling, but as these means of expression are, at last analysis, but modified forms of the instinctive utterance that man shares with the lower animals, they cannot be considered as forming part of the essential cultural conception of language, however much they may be inseparable from its actual life. And this instinctive expression of volition and emotion is, for the most part, sufficient, often more than sufficient, for the purposes of communication.

Up to now, we’ve been assuming that the way we use language only reflects the world of ideas and, on what I’ve dared to call the “pre-rational” level, of images, which are the raw building blocks of those ideas. In other words, we’ve been thinking that language exists solely in the realm of ideas or cognition. It’s time to broaden our perspective. The willful aspect of consciousness is also to some degree represented in language. Almost all languages have specific ways to express commands (like the imperative forms of verbs) and desires, whether fulfilled or not (like “Would he might come!” or “Would he were here!”). Emotions, for the most part, seem to have a less adequate way of being expressed. Emotion is often said to struggle to find words. Most, if not all, interjections are results of emotional expression, and there are also various linguistic elements that show certain modalities, such as doubtful or potential forms, which might reflect emotional states of hesitation or doubt—mild fear. Overall, it should be acknowledged that ideas dominate language, while will and emotion play a distinctly secondary role. This is completely understandable. The world of images and concepts, the endless and constantly changing picture of objective reality, is the essential subject of human communication because it’s primarily through this world that effective action can occur. Desire, intention, and emotion add a personal touch to the objective world; they are privately applied by the individual and are relatively less significant to those around them. However, this doesn’t mean that will and emotion aren’t expressed. They’re, strictly speaking, never entirely absent from normal conversation, but their expression isn’t purely linguistic. The nuances of emphasis, tone, and phrasing, along with varying speed and continuity of speech, and accompanying body movements all communicate something of our inner impulses and feelings. Yet, because these methods of expression are ultimately just modified forms of the instinctive expressions that humans share with animals, they can’t be considered an essential part of the cultural understanding of language, even though they are inseparable from its actual use. This instinctive way of expressing will and emotion is usually sufficient, often more than enough, for effective communication.

There are, it is true, certain writers on the psychology of language[9] who deny its prevailingly cognitive character but attempt, on the contrary, to demonstrate the origin of most linguistic elements within the domain of feeling. I confess that I am utterly unable to follow them. What there is of truth in their contentions may be summed up, it seems to me, by saying that most words, like practically all elements of consciousness, have an associated feeling-tone, a mild, yet none the less real and at times insidiously powerful, derivative of pleasure or pain. This feeling-tone, however, is not as a rule an inherent value in the word itself; it is rather a sentimental growth on the word’s true body, on its conceptual kernel. Not only may the feeling-tone change from one age to another (this, of course, is true of the conceptual content as well), but it varies remarkably from individual to individual according to the personal associations of each, varies, indeed, from time to time in a single individual’s consciousness as his experiences mold him and his moods change. To be sure, there are socially accepted feeling-tones, or ranges of feeling-tone, for many words over and above the force of individual association, but they are exceedingly variable and elusive things at best. They rarely have the rigidity of the central, primary fact. We all grant, for instance, that storm, tempest, and hurricane, quite aside from their slight differences of actual meaning, have distinct feeling-tones, tones that are felt by all sensitive speakers and readers of English in a roughly equivalent fashion. Storm, we feel, is a more general and a decidedly less “magnificent” word than the other two; tempest is not only associated with the sea but is likely, in the minds of many, to have obtained a softened glamour from a specific association with Shakespeare’s great play; hurricane has a greater forthrightness, a directer ruthlessness than its synonyms. Yet the individual’s feeling-tones for these words are likely to vary enormously. To some tempest and hurricane may seem “soft,” literary words, the simpler storm having a fresh, rugged value which the others do not possess (think of storm and stress). If we have browsed much in our childhood days in books of the Spanish Main, hurricane is likely to have a pleasurably bracing tone; if we have had the misfortune to be caught in one, we are not unlikely to feel the word as cold, cheerless, sinister.

There are definitely some writers on the psychology of language[9] who argue against its mainly cognitive nature, instead trying to show that most linguistic elements come from feelings. I admit that I can’t really follow their reasoning. The truth in their claims, as I see it, is that most words, like almost all elements of consciousness, have an associated feeling-tone, a mild yet still real and sometimes insidiously powerful derivative of pleasure or pain. However, this feeling-tone isn’t usually an inherent value of the word itself; it's more like a sentimental layer added to the word’s true essence, its conceptual core. The feeling-tone can change from one era to the next (this is true for the conceptual content as well), and it can vary significantly from person to person based on individual associations. It can even change for the same person over time as their experiences shape them and their moods change. Of course, there are socially accepted feeling-tones or ranges of feeling-tone for many words beyond individual associations, but they are generally quite variable and elusive. They rarely have the firmness of the central, primary fact. For example, we all agree that storm, tempest, and hurricane, despite their slight differences in actual meaning, carry distinct feeling-tones, which are perceived by all sensitive speakers and readers of English in a roughly similar way. Storm feels more general and definitely less “grand” than the other two; tempest is not only linked to the sea but is often associated with a certain charm from Shakespeare’s great play; hurricane has a more direct, ruthless quality than its synonyms. Yet, individuals’ feeling-tones for these words can vary greatly. For some, tempest and hurricane might seem like “soft,” literary words, while the simpler storm feels fresh and rugged in a way the others can’t match (think of storm and stress). If we've spent our childhoods reading about the Spanish Main, hurricane is likely to evoke a pleasantly invigorating tone; if we’ve unfortunately been caught in one, we might instead perceive the word as cold, bleak, and sinister.

The feeling-tones of words are of no use, strictly speaking, to science; the philosopher, if he desires to arrive at truth rather than merely to persuade, finds them his most insidious enemies. But man is rarely engaged in pure science, in solid thinking. Generally his mental activities are bathed in a warm current of feeling and he seizes upon the feeling-tones of words as gentle aids to the desired excitation. They are naturally of great value to the literary artist. It is interesting to note, however, that even to the artist they are a danger. A word whose customary feeling-tone is too unquestioningly accepted becomes a plushy bit of furniture, a cliché. Every now and then the artist has to fight the feeling-tone, to get the word to mean what it nakedly and conceptually should mean, depending for the effect of feeling on the creative power of an individual juxtaposition of concepts or images.

The emotional aspects of words are not really useful for science; a philosopher who wants to uncover truth instead of just persuading finds them to be his sneakiest foes. But people rarely engage in pure science or rigorous thinking. Usually, their thoughts are wrapped in a warm wave of feelings, and they grab onto the emotional aspects of words as gentle pushes toward the desired reaction. These aspects are obviously very important for a literary artist. However, it's interesting to point out that even for artists, they can be a risk. A word that is too readily accepted for its usual emotional tone becomes like a frumpy piece of furniture, a cliché. Now and then, artists have to challenge the emotional tone to make the word mean what it should clearly and conceptually mean, relying on the creative power of uniquely pairing concepts or images for the emotional effect.

III

The Sounds of Language

We have seen that the mere phonetic framework of speech does not constitute the inner fact of language and that the single sound of articulated speech is not, as such, a linguistic element at all. For all that, speech is so inevitably bound up with sounds and their articulation that we can hardly avoid giving the subject of phonetics some general consideration. Experience has shown that neither the purely formal aspects of a language nor the course of its history can be fully understood without reference to the sounds in which this form and this history are embodied. A detailed survey of phonetics would be both too technical for the general reader and too loosely related to our main theme to warrant the needed space, but we can well afford to consider a few outstanding facts and ideas connected with the sounds of language.

We have seen that the basic sound structure of speech doesn’t represent the true essence of language, and that the individual sound of spoken words isn’t actually a linguistic element by itself. Still, speech is so closely linked to sounds and how they’re formed that we can hardly avoid discussing phonetics in general. Experience shows that neither the formal aspects of a language nor the details of its history can be fully understood without considering the sounds that express this form and history. A thorough study of phonetics would be too technical for the average reader and too loosely related to our main topic to justify the necessary space, but we can certainly look at a few key facts and ideas related to the sounds of language.

The feeling that the average speaker has of his language is that it is built up, acoustically speaking, of a comparatively small number of distinct sounds, each of which is rather accurately provided for in the current alphabet by one letter or, in a few cases, by two or more alternative letters. As for the languages of foreigners, he generally feels that, aside from a few striking differences that cannot escape even the uncritical ear, the sounds they use are the same as those he is familiar with but that there is a mysterious “accent” to these foreign languages, a certain unanalyzed phonetic character, apart from the sounds as such, that gives them their air of strangeness. This naïve feeling is largely illusory on both scores. Phonetic analysis convinces one that the number of clearly distinguishable sounds and nuances of sounds that are habitually employed by the speakers of a language is far greater than they themselves recognize. Probably not one English speaker out of a hundred has the remotest idea that the t of a word like sting is not at all the same sound as the t of teem, the latter t having a fullness of “breath release” that is inhibited in the former case by the preceding s; that the ea of meat is of perceptibly shorter duration than the ea of mead; or that the final s of a word like heads is not the full, buzzing z sound of the s in such a word as please. It is the frequent failure of foreigners, who have acquired a practical mastery of English and who have eliminated all the cruder phonetic shortcomings of their less careful brethren, to observe such minor distinctions that helps to give their English pronunciation the curiously elusive “accent” that we all vaguely feel. We do not diagnose the “accent” as the total acoustic effect produced by a series of slight but specific phonetic errors for the very good reason that we have never made clear to ourselves our own phonetic stock in trade. If two languages taken at random, say English and Russian, are compared as to their phonetic systems, we are more apt than not to find that very few of the phonetic elements of the one find an exact analogue in the other. Thus, the t of a Russian word like tam “there” is neither the English t of sting nor the English t of teem. It differs from both in its “dental” articulation, in other words, in being produced by contact of the tip of the tongue with the upper teeth, not, as in English, by contact of the tongue back of the tip with the gum ridge above the teeth; moreover, it differs from the t of teem also in the absence of a marked “breath release” before the following vowel is attached, so that its acoustic effect is of a more precise, “metallic” nature than in English. Again, the English l is unknown in Russian, which possesses, on the other hand, two distinct l-sounds that the normal English speaker would find it difficult exactly to reproduce—a “hollow,” guttural-like l and a “soft,” palatalized l-sound that is only very approximately rendered, in English terms, as ly. Even so simple and, one would imagine, so invariable a sound as m differs in the two languages. In a Russian word like most “bridge” the m is not the same as the m of the English word most; the lips are more fully rounded during its articulation, so that it makes a heavier, more resonant impression on the ear. The vowels, needless to say, differ completely in English and Russian, hardly any two of them being quite the same.

The average speaker feels that their language is made up, acoustically, of a relatively small set of distinct sounds, each of which is usually represented in the current alphabet by a single letter or, in a few cases, by two or more alternative letters. When it comes to foreign languages, they generally think that, aside from a few noticeable differences that even an untrained ear can catch, the sounds are similar to those they know, but there’s a mysterious “accent” to these languages, a certain unexamined phonetic quality, apart from the actual sounds, that makes them seem strange. This simple impression is mostly an illusion. Phonetic analysis shows that the number of clearly distinguishable sounds and nuances in a language is much greater than speakers realize. Probably not one English speaker in a hundred knows that the t in sting is not the same as the t in teem; the latter has a fuller “breath release” that is held back in the former because of the preceding s; that the ea in meat lasts noticeably shorter than in mead; or that the final s in heads is not the fully resonant z sound of the s in please. The frequent failure of foreigners, who have become fairly fluent in English and who have fixed many of the more obvious phonetic mistakes of their less careful peers, to catch these minor distinctions contributes to the peculiar “accent” we all sense. We don’t think of this “accent” as the overall acoustic effect produced by a series of subtle but specific phonetic errors for the simple reason that we haven’t clarified our own phonetic inventory. If we randomly compare two languages, like English and Russian, in terms of their phonetic systems, we’re likely to see that very few phonetic elements in one have direct equivalents in the other. For instance, the t in the Russian word tam (“there”) is neither the t of sting nor the t of teem. It is different from both because it is articulated with the tip of the tongue against the upper teeth, while in English, it’s produced with the tongue’s back against the ridge above the teeth. Additionally, it doesn’t have a pronounced “breath release” before the following vowel, giving it a clearer, “metallic” sound compared to English. Furthermore, the English l doesn't exist in Russian, which has two distinct l sounds that a typical English speaker would struggle to reproduce perfectly—a “hollow,” guttural-like l and a “soft,” palatalized l sound that’s roughly represented in English as ly. Even something as basic and seemingly consistent as the sound m differs in the two languages. In the Russian word most (“bridge”), the m is not the same as the m in the English word most; the lips are rounded more fully during its production, making a heavier, more resonant impression on the ear. And, of course, the vowels vary completely in English and Russian, with hardly any two being exactly alike.

I have gone into these illustrative details, which are of little or no specific interest for us, merely in order to provide something of an experimental basis to convince ourselves of the tremendous variability of speech sounds. Yet a complete inventory of the acoustic resources of all the European languages, the languages nearer home, while unexpectedly large, would still fall far short of conveying a just idea of the true range of human articulation. In many of the languages of Asia, Africa, and aboriginal America there are whole classes of sounds that most of us have no knowledge of. They are not necessarily more difficult of enunciation than sounds more familiar to our ears; they merely involve such muscular adjustments of the organs of speech as we have never habituated ourselves to. It may be safely said that the total number of possible sounds is greatly in excess of those actually in use. Indeed, an experienced phonetician should have no difficulty in inventing sounds that are unknown to objective investigation. One reason why we find it difficult to believe that the range of possible speech sounds is indefinitely large is our habit of conceiving the sound as a simple, unanalyzable impression instead of as the resultant of a number of distinct muscular adjustments that take place simultaneously. A slight change in any one of these adjustments gives us a new sound which is akin to the old one, because of the continuance of the other adjustments, but which is acoustically distinct from it, so sensitive has the human ear become to the nuanced play of the vocal mechanism. Another reason for our lack of phonetic imagination is the fact that, while our ear is delicately responsive to the sounds of speech, the muscles of our speech organs have early in life become exclusively accustomed to the particular adjustments and systems of adjustment that are required to produce the traditional sounds of the language. All or nearly all other adjustments have become permanently inhibited, whether through inexperience or through gradual elimination. Of course the power to produce these inhibited adjustments is not entirely lost, but the extreme difficulty we experience in learning the new sounds of foreign languages is sufficient evidence of the strange rigidity that has set in for most people in the voluntary control of the speech organs. The point may be brought home by contrasting the comparative lack of freedom of voluntary speech movements with the all but perfect freedom of voluntary gesture.[10] Our rigidity in articulation is the price we have had to pay for easy mastery of a highly necessary symbolism. One cannot be both splendidly free in the random choice of movements and selective with deadly certainty.[11]

I’ve included these detailed examples, which might not be particularly interesting to us, just to give us a solid experimental foundation to understand the incredible variety of speech sounds. However, even a complete list of the acoustic features of all European languages, or those closer to us, while surprisingly extensive, would still not fully capture the true range of human speech. Many languages in Asia, Africa, and indigenous America have entire categories of sounds that most of us aren’t familiar with. These sounds aren’t necessarily harder to pronounce than the ones we know; they just require different muscle adjustments in our speech organs that we haven’t gotten used to. It’s safe to say that the total number of possible sounds is far greater than the ones currently in use. In fact, an experienced phonetician could easily create sounds that haven’t been discovered through objective study. One reason we struggle to believe that the range of potential speech sounds is virtually limitless is because we typically think of sound as a simple, unbreakable impression rather than the result of multiple distinct muscle adjustments happening at once. A slight change in any one of these adjustments can create a new sound that is similar to the old one because the other adjustments remain, but it's acoustically distinct—our ears are finely tuned to the subtle variations in vocal performance. Another reason we lack phonetic creativity is that, while our ears are highly responsive to speech sounds, the muscles in our speech organs tend to become exclusively adapted to the specific adjustments needed to produce the familiar sounds of our language at an early age. Almost all other adjustments are permanently suppressed, whether due to lack of experience or gradual elimination. Of course, we haven’t completely lost the ability to make these suppressed adjustments, but the significant difficulty we face in learning new sounds from foreign languages shows how rigid most people have become in voluntarily controlling their speech organs. This point can be illustrated by comparing the limited freedom of voluntary speech movements with the near-perfect freedom of voluntary gestures. Our rigidity in articulation is the price we've paid for mastering a crucial form of communication. One cannot be completely free in choosing movements randomly while also being precise with absolute certainty.

There are, then, an indefinitely large number of articulated sounds available for the mechanics of speech; any given language makes use of an explicit, rigidly economical selection of these rich resources; and each of the many possible sounds of speech is conditioned by a number of independent muscular adjustments that work together simultaneously towards its production. A full account of the activity of each of the organs of speech—in so far as its activity has a bearing on language—is impossible here, nor can we concern ourselves in a systematic way with the classification of sounds on the basis of their mechanics.[12] A few bold outlines are all that we can attempt. The organs of speech are the lungs and bronchial tubes; the throat, particularly that part of it which is known as the larynx or, in popular parlance, the “Adam’s apple”; the nose; the uvula, which is the soft, pointed, and easily movable organ that depends from the rear of the palate; the palate, which is divided into a posterior, movable “soft palate” or velum and a “hard palate”; the tongue; the teeth; and the lips. The palate, lower palate, tongue, teeth, and lips may be looked upon as a combined resonance chamber, whose constantly varying shape, chiefly due to the extreme mobility of the tongue, is the main factor in giving the outgoing breath its precise quality[13] of sound.

There are, therefore, an unlimited number of distinct sounds available for speech mechanics; each language utilizes a specific, efficiently selected set of these abundant resources; and each potential speech sound is influenced by various independent muscle adjustments that work together at the same time to produce it. A detailed description of how each speech organ functions—particularly in relation to language—is not feasible here, nor can we systematically categorize sounds based on their mechanics.[12] We can only provide a few general outlines. The speech organs include the lungs and bronchial tubes; the throat, especially the part known as the larynx or, in everyday language, the “Adam’s apple”; the nose; the uvula, which is the soft, pointed, and easily movable structure hanging from the back of the palate; the palate itself, which is divided into a movable “soft palate” (or velum) and a “hard palate”; the tongue; the teeth; and the lips. The palate, lower palate, tongue, teeth, and lips can be seen as a combined resonance chamber, with its ever-changing shape—mainly due to the tongue's high mobility—being the key factor in determining the precise quality[13] of the outgoing sound.

The lungs and bronchial tubes are organs of speech only in so far as they supply and conduct the current of outgoing air without which audible articulation is impossible. They are not responsible for any specific sound or acoustic feature of sounds except, possibly, accent or stress. It may be that differences of stress are due to slight differences in the contracting force of the lung muscles, but even this influence of the lungs is denied by some students, who explain the fluctuations of stress that do so much to color speech by reference to the more delicate activity of the glottal cords. These glottal cords are two small, nearly horizontal, and highly sensitive membranes within the larynx, which consists, for the most part, of two large and several smaller cartilages and of a number of small muscles that control the action of the cords.

The lungs and bronchial tubes are involved in speech only to the extent that they provide and manage the flow of outgoing air, which is essential for producing audible sounds. They don't create any specific sounds or acoustic characteristics, except perhaps for accent or stress. It's possible that variations in stress come from minor differences in the lung muscle contractions, but some researchers dispute this and attribute the stress variations, which greatly impact speech, to the finer movements of the glottal cords. These glottal cords are two small, nearly horizontal, and very sensitive membranes located in the larynx, which mainly consists of two large and several smaller cartilages and multiple small muscles that control the movements of the cords.

The cords, which are attached to the cartilages, are to the human speech organs what the two vibrating reeds are to a clarinet or the strings to a violin. They are capable of at least three distinct types of movement, each of which is of the greatest importance for speech. They may be drawn towards or away from each other, they may vibrate like reeds or strings, and they may become lax or tense in the direction of their length. The last class of these movements allows the cords to vibrate at different “lengths” or degrees of tenseness and is responsible for the variations in pitch which are present not only in song but in the more elusive modulations of ordinary speech. The two other types of glottal action determine the nature of the voice, “voice” being a convenient term for breath as utilized in speech. If the cords are well apart, allowing the breath to escape in unmodified form, we have the condition technically known as “voicelessness.” All sounds produced under these circumstances are “voiceless” sounds. Such are the simple, unmodified breath as it passes into the mouth, which is, at least approximately, the same as the sound that we write h, also a large number of special articulations in the mouth chamber, like p and s. On the other hand, the glottal cords may be brought tight together, without vibrating. When this happens, the current of breath is checked for the time being. The slight choke or “arrested cough” that is thus made audible is not recognized in English as a definite sound but occurs nevertheless not infrequently.[14] This momentary check, technically known as a “glottal stop,” is an integral element of speech in many languages, as Danish, Lettish, certain Chinese dialects, and nearly all American Indian languages. Between the two extremes of voicelessness, that of completely open breath and that of checked breath, lies the position of true voice. In this position the cords are close together, but not so tightly as to prevent the air from streaming through; the cords are set vibrating and a musical tone of varying pitch results. A tone so produced is known as a “voiced sound.” It may have an indefinite number of qualities according to the precise position of the upper organs of speech. Our vowels, nasals (such as m and n), and such sounds as b, z, and l are all voiced sounds. The most convenient test of a voiced sound is the possibility of pronouncing it on any given pitch, in other words, of singing on it.[15] The voiced sounds are the most clearly audible elements of speech. As such they are the carriers of practically all significant differences in stress, pitch, and syllabification. The voiceless sounds are articulated noises that break up the stream of voice with fleeting moments of silence. Acoustically intermediate between the freely unvoiced and the voiced sounds are a number of other characteristic types of voicing, such as murmuring and whisper.[16] These and still other types of voice are relatively unimportant in English and most other European languages, but there are languages in which they rise to some prominence in the normal flow of speech.

The cords connected to the cartilages function for human speech like the vibrating reeds do for a clarinet or strings do for a violin. They can perform at least three distinct types of movement, each vital for speech. They can be pulled toward or away from each other, vibrate like reeds or strings, and become either lax or tense along their length. The last type of movement enables the cords to vibrate at different “lengths” or levels of tension, which is what creates variations in pitch found not only in music but also in the subtle shifts of ordinary speech. The other two types of glottal action affect the quality of the voice, “voice” being a convenient term for breath as it is used in speech. If the cords are separated, allowing breath to escape freely, this condition is technically termed “voicelessness.” All sounds made under these conditions are “voiceless” sounds, such as the simple, unmodified breath as it enters the mouth, which is roughly equivalent to the sound we write as h, along with many specific articulations in the mouth, like p and s. Conversely, the glottal cords can be pressed tightly together without vibrating. When this occurs, the breath is temporarily blocked. The slight choke or “arrested cough” produced this way isn’t recognized in English as a distinct sound, but it does happen quite often.[14] This brief blockage, called a “glottal stop,” is a key component of speech in many languages, including Danish, Lettish, certain Chinese dialects, and nearly all American Indian languages. Between the two extremes of voicelessness—completely open breath and checked breath—lies the position of true voice. In this position, the cords are close together, but not so tight that they stop the air from flowing through; the cords vibrate, producing a musical tone of varying pitch. This tone is referred to as a “voiced sound.” It can take on countless qualities depending on the exact position of the upper speech organs. Our vowels, nasals (like m and n), and sounds like b, z, and l are all voiced sounds. A simple test for a voiced sound is the ability to pronounce it on any given pitch—in other words, singing it.[15] Voiced sounds are the most clearly audible elements of speech, carrying almost all significant differences in stress, pitch, and syllabification. Voiceless sounds are articulated noises that disrupt the flow of voice with brief moments of silence. Acoustically sitting between the freely unvoiced and voiced sounds are several other typical types of voicing, such as murmuring and whispering.[16] These and other types of voice are relatively unimportant in English and most other European languages, but in some languages, they play a significant role in normal speech.

The nose is not an active organ of speech, but it is highly important as a resonance chamber. It may be disconnected from the mouth, which is the other great resonance chamber, by the lifting of the movable part of the soft palate so as to shut off the passage of the breath into the nasal cavity; or, if the soft palate is allowed to hang down freely and unobstructively, so that the breath passes into both the nose and the mouth, these make a combined resonance chamber. Such sounds as b and a (as in father) are voiced “oral” sounds, that is, the voiced breath does not receive a nasal resonance. As soon as the soft palate is lowered, however, and the nose added as a participating resonance chamber, the sounds b and a take on a peculiar “nasal” quality and become, respectively, m and the nasalized vowel written an in French (e.g., sang, tant). The only English sounds[17] that normally receive a nasal resonance are m, n, and the ng sound of sing. Practically all sounds, however, may be nasalized, not only the vowels—nasalized vowels are common in all parts of the world—but such sounds as l or z. Voiceless nasals are perfectly possible. They occur, for instance, in Welsh and in quite a number of American Indian languages.

The nose isn't an active speech organ, but it's really important as a resonance chamber. It can be separated from the mouth, which is another major resonance chamber, by raising the movable part of the soft palate to block airflow into the nasal cavity. If the soft palate hangs down freely and unobstructed, allowing breath to flow into both the nose and mouth, they create a combined resonance chamber. Sounds like b and a (as in father) are voiced "oral" sounds, meaning the voiced breath doesn't get a nasal resonance. However, when the soft palate is lowered and the nose acts as a resonance chamber, the sounds b and a take on a unique "nasal" quality and turn into m and the nasalized vowel written as an in French (e.g., sang, tant). The only English sounds[17] that usually get a nasal resonance are m, n, and the ng sound in sing. Almost any sound can be nasalized, not just vowels—nasalized vowels are common worldwide—but also sounds like l or z. Voiceless nasals are completely possible. They appear, for example, in Welsh and in several American Indian languages.

The organs that make up the oral resonance chamber may articulate in two ways. The breath, voiced or unvoiced, nasalized or unnasalized, may be allowed to pass through the mouth without being checked or impeded at any point; or it may be either momentarily checked or allowed to stream through a greatly narrowed passage with resulting air friction. There are also transitions between the two latter types of articulation. The unimpeded breath takes on a particular color or quality in accordance with the varying shape of the oral resonance chamber. This shape is chiefly determined by the position of the movable parts—the tongue and the lips. As the tongue is raised or lowered, retracted or brought forward, held tense or lax, and as the lips are pursed (“rounded”) in varying degree or allowed to keep their position of rest, a large number of distinct qualities result. These oral qualities are the vowels. In theory their number is infinite, in practice the ear can differentiate only a limited, yet a surprisingly large, number of resonance positions. Vowels, whether nasalized or not, are normally voiced sounds; in not a few languages, however, “voiceless vowels”[18] also occur.

The organs that form the oral resonance chamber can function in two ways. The breath, whether voiced or unvoiced, nasal or not, can pass through the mouth freely without being blocked or restricted at any point; or it can be either briefly stopped or allowed to flow through a tightly narrowed pathway, creating air friction. There are also transitions between these two types of articulation. Unobstructed breath takes on a specific color or quality based on the shape of the oral resonance chamber. This shape is mainly influenced by the position of the movable parts—the tongue and the lips. As the tongue is raised or lowered, pulled back or pushed forward, held tense or relaxed, and as the lips are shaped (rounded) to varying degrees or left in their resting position, a wide range of distinct qualities emerges. These oral qualities are the vowels. In theory, the number of vowels is infinite, but in practice, the ear can only distinguish a limited, yet surprisingly large, number of resonance positions. Vowels, whether nasalized or not, are usually voiced sounds; however, in some languages, “voiceless vowels”[18] also exist.

The remaining oral sounds are generally grouped together as “consonants.” In them the stream of breath is interfered with in some way, so that a lesser resonance results, and a sharper, more incisive quality of tone. There are four main types of articulation generally recognized within the consonantal group of sounds. The breath may be completely stopped for a moment at some definite point in the oral cavity. Sounds so produced, like t or d or p, are known as “stops” or “explosives.”[19] Or the breath may be continuously obstructed through a narrow passage, not entirely checked. Examples of such “spirants” or “fricatives,” as they are called, are s and z and y. The third class of consonants, the “laterals,” are semi-stopped. There is a true stoppage at the central point of articulation, but the breath is allowed to escape through the two side passages or through one of them. Our English d, for instance, may be readily transformed into l, which has the voicing and the position of d, merely by depressing the sides of the tongue on either side of the point of contact sufficiently to allow the breath to come through. Laterals are possible in many distinct positions. They may be unvoiced (the Welsh ll is an example) as well as voiced. Finally, the stoppage of the breath may be rapidly intermittent; in other words, the active organ of contact—generally the point of the tongue, less often the uvula[20]—may be made to vibrate against or near the point of contact. These sounds are the “trills” or “rolled consonants,” of which the normal English r is a none too typical example. They are well developed in many languages, however, generally in voiced form, sometimes, as in Welsh and Paiute, in unvoiced form as well.

The remaining sounds we produce with our mouths are usually called “consonants.” In these sounds, the airflow is blocked in some way, resulting in lower resonance and a sharper, more cutting tone. There are four main types of articulation recognized within consonants. First, the airflow may be completely stopped for a brief moment at a specific point in the mouth. Sounds created this way, like t, d, or p, are referred to as “stops” or “explosives.”[19] Alternatively, the airflow may be continuously blocked through a narrow passage, but not completely stopped. Sounds like s, z, and y are examples of these “spirants” or “fricatives.” The third type of consonants, known as “laterals,” involves a partial blockage: there’s a complete stop at the center of articulation, but the air can escape through one or both sides. For instance, our English d can easily be transformed into l, which shares the voicing and position of d, simply by lowering the sides of the tongue enough to let the air flow through. Laterals can occur in various positions and can be either voiced (like the English l) or unvoiced (as in the Welsh ll). Finally, airflow may be stopped quickly and repeatedly; in other words, the active sound-producing part—usually the tip of the tongue, less often the uvula[20]—may vibrate against or near the contact point. These sounds are called “trills” or “rolled consonants,” and English r is a not-so-typical example. However, these sounds are well-developed in many languages, usually in voiced forms, and sometimes, as in Welsh and Paiute, in unvoiced forms as well.

The oral manner of articulation is naturally not sufficient to define a consonant. The place of articulation must also be considered. Contacts may be formed at a large number of points, from the root of the tongue to the lips. It is not necessary here to go at length into this somewhat complicated matter. The contact is either between the root of the tongue and the throat,[21] some part of the tongue and a point on the palate (as in k or ch or l), some part of the tongue and the teeth (as in the English th of thick and then), the teeth and one of the lips (practically always the upper teeth and lower lip, as in f), or the two lips (as in p or English w). The tongue articulations are the most complicated of all, as the mobility of the tongue allows various points on its surface, say the tip, to articulate against a number of opposed points of contact. Hence arise many positions of articulation that we are not familiar with, such as the typical “dental” position of Russian or Italian t and d; or the “cerebral” position of Sanskrit and other languages of India, in which the tip of the tongue articulates against the hard palate. As there is no break at any point between the rims of the teeth back to the uvula nor from the tip of the tongue back to its root, it is evident that all the articulations that involve the tongue form a continuous organic (and acoustic) series. The positions grade into each other, but each language selects a limited number of clearly defined positions as characteristic of its consonantal system, ignoring transitional or extreme positions. Frequently a language allows a certain latitude in the fixing of the required position. This is true, for instance, of the English k sound, which is articulated much further to the front in a word like kin than in cool. We ignore this difference, psychologically, as a non-essential, mechanical one. Another language might well recognize the difference, or only a slightly greater one, as significant, as paralleling the distinction in position between the k of kin and the t of tin.

The way we produce sounds with our mouths isn’t enough to fully define a consonant. We also need to think about where the sounds are produced. Contacts can happen at many different points, from the back of the tongue to the lips. We don’t need to dive too deep into this complex topic right now. The contact can be between the back of the tongue and the throat,[21] some part of the tongue and a spot on the roof of the mouth (like in k or ch or l), some part of the tongue and the teeth (like the English th in thick and then), the teeth and one of the lips (usually the upper teeth and lower lip, as in f), or both lips together (like in p or English w). Tongue articulations are the most complicated since the flexible tongue can hit various points on its surface, like the tip, against many different contact points. This leads to many articulation positions that we might not recognize, such as the typical “dental” spot for Russian or Italian t and d; or the “cerebral” position in Sanskrit and other Indian languages, where the tip of the tongue touches the hard palate. Because there’s no gap between the rims of the teeth going back to the uvula, or from the tip of the tongue back to its root, it’s clear that all tongue-related articulations form a continuous organic (and acoustic) series. The positions blend into each other, but each language chooses a limited number of distinct positions that define its system of consonants, mostly ignoring transitional or extreme positions. Often, a language allows some flexibility in setting the precise position. This is true, for example, for the English k sound, which is pronounced much closer to the front in a word like kin compared to cool. We mentally overlook this difference as unimportant, a minor detail. Another language might see this difference, or even a slightly larger one, as significant, similar to the difference in position between the k of kin and the t of tin.

The organic classification of speech sounds is a simple matter after what we have learned of their production. Any such sound may be put into its proper place by the appropriate answer to four main questions:—What is the position of the glottal cords during its articulation? Does the breath pass into the mouth alone or is it also allowed to stream into the nose? Does the breath pass freely through the mouth or is it impeded at some point and, if so, in what manner? What are the precise points of articulation in the mouth?[22] This fourfold classification of sounds, worked out in all its detailed ramifications,[23] is sufficient to account for all, or practically all, the sounds of language.[24]

The organic classification of speech sounds is straightforward based on what we have learned about how they are produced. Any sound can be categorized correctly by answering four main questions:—What is the position of the vocal cords during its articulation? Does the airflow go into the mouth only, or is it also allowed to flow into the nose? Does the airflow move freely through the mouth, or is it blocked at some point, and if so, how? What are the specific points of articulation in the mouth? [22] This fourfold classification of sounds, developed in all its detailed aspects, [23] is enough to explain all, or almost all, the sounds of language.[24]

The phonetic habits of a given language are not exhaustively defined by stating that it makes use of such and such particular sounds out of the all but endless gamut that we have briefly surveyed. There remains the important question of the dynamics of these phonetic elements. Two languages may, theoretically, be built up of precisely the same series of consonants and vowels and yet produce utterly different acoustic effects. One of them may not recognize striking variations in the lengths or “quantities” of the phonetic elements, the other may note such variations most punctiliously (in probably the majority of languages long and short vowels are distinguished; in many, as in Italian or Swedish or Ojibwa, long consonants are recognized as distinct from short ones). Or the one, say English, may be very sensitive to relative stresses, while in the other, say French, stress is a very minor consideration. Or, again, the pitch differences which are inseparable from the actual practice of language may not affect the word as such, but, as in English, may be a more or less random or, at best, but a rhetorical phenomenon, while in other languages, as in Swedish, Lithuanian, Chinese, Siamese, and the majority of African languages, they may be more finely graduated and felt as integral characteristics of the words themselves. Varying methods of syllabifying are also responsible for noteworthy acoustic differences. Most important of all, perhaps, are the very different possibilities of combining the phonetic elements. Each language has its peculiarities. The ts combination, for instance, is found in both English and German, but in English it can only occur at the end of a word (as in hats), while it occurs freely in German as the psychological equivalent of a single sound (as in Zeit, Katze). Some languages allow of great heapings of consonants or of vocalic groups (diphthongs), in others no two consonants or no two vowels may ever come together. Frequently a sound occurs only in a special position or under special phonetic circumstances. In English, for instance, the z-sound of azure cannot occur initially, while the peculiar quality of the t of sting is dependent on its being preceded by the s. These dynamic factors, in their totality, are as important for the proper understanding of the phonetic genius of a language as the sound system itself, often far more so.

The phonetic habits of a language aren't fully captured just by listing the specific sounds it uses from the seemingly infinite range we've briefly explored. There's an important aspect regarding how these phonetic elements function. Two languages might theoretically consist of the same set of consonants and vowels, yet produce completely different sound experiences. One language might not pay attention to noticeable differences in the lengths or “quantities” of these sounds, while another language might focus on these differences very closely (most languages distinguish between long and short vowels; in many, like Italian, Swedish, or Ojibwa, long consonants are also seen as distinct from short ones). For example, English might be highly sensitive to relative stresses, whereas in French, stress plays a minor role. Similarly, the pitch variations that come with actual speech may not change the word itself, but in English, they could be somewhat random or merely rhetorical, while in other languages, like Swedish, Lithuanian, Chinese, Siamese, and the majority of African languages, pitch variations might be more subtle and considered essential characteristics of the words. Different methods of forming syllables also contribute to noticeable acoustic differences. Most importantly, each language has unique ways of combining phonetic elements. For example, the ts combination exists in both English and German, but in English, it only appears at the end of a word (like in hats), whereas in German, it's used freely as if it were a single sound (like in Zeit, Katze). Some languages permit long strings of consonants or vocalic groups (diphthongs), while in others, no two consonants or vowels can sit next to each other. Often, a sound can only appear in certain positions or under particular phonetic conditions. For instance, in English, the z-sound in azure cannot come at the beginning of a word, while the specific quality of the t in sting depends on it being preceded by the s. These dynamic factors are just as crucial for truly understanding the phonetic nature of a language as the sound system itself, often even more so.

We have already seen, in an incidental way, that phonetic elements or such dynamic features as quantity and stress have varying psychological “values.” The English ts of fiats is merely a t followed by a functionally independent s, the ts of the German word Zeit has an integral value equivalent, say, to the t of the English word tide. Again, the t of time is indeed noticeably distinct from that of sting, but the difference, to the consciousness of an English-speaking person, is quite irrelevant. It has no “value.” If we compare the t-sounds of Haida, the Indian language spoken in the Queen Charlotte Islands, we find that precisely the same difference of articulation has a real value. In such a word as sting “two,” the t is pronounced precisely as in English, but in sta “from” the t is clearly “aspirated,” like that of time. In other words, an objective difference that is irrelevant in English is of functional value in Haida; from its own psychological standpoint the t of sting is as different from that of sta as, from our standpoint, is the t of time from the d of divine. Further investigation would yield the interesting result that the Haida ear finds the difference between the English t of sting and the d of divine as irrelevant as the naïve English ear finds that of the t-sounds of sting and time. The objective comparison of sounds in two or more languages is, then, of no psychological or historical significance unless these sounds are first “weighted,” unless their phonetic “values” are determined. These values, in turn, flow from the general behavior and functioning of the sounds in actual speech.

We’ve already noticed, in a casual way, that phonetic elements or dynamic features like length and stress have different psychological “values.” The English ts in fiats is just a t followed by a separate s, whereas the ts in the German word Zeit has a combined value similar to the t in the English word tide. Similarly, the t in time sounds noticeably different from that in sting, but this difference, for an English speaker, isn't significant. It has no “value.” In contrast, when we compare the t sounds in Haida, the indigenous language from the Queen Charlotte Islands, we see that the same articulation difference actually matters. In the word sting meaning “two,” the t is pronounced exactly like in English, but in sta meaning “from,” the t is definitely “aspirated,” like in time. This means that a difference that is unimportant in English has functional significance in Haida; from its own psychological perspective, the t in sting is as different from that in sta as the t in time is from the d in divine. Further investigation would show that a Haida speaker finds the difference between the English t in sting and d in divine as insignificant as an English speaker finds the difference between the t sounds in sting and time. Therefore, comparing sounds across different languages is not psychologically or historically relevant unless these sounds are first “weighted,” and their phonetic “values” are established. These values come from how the sounds behave and function in actual speech.

These considerations as to phonetic value lead to an important conception. Back of the purely objective system of sounds that is peculiar to a language and which can be arrived at only by a painstaking phonetic analysis, there is a more restricted “inner” or “ideal” system which, while perhaps equally unconscious as a system to the naïve speaker, can far more readily than the other be brought to his consciousness as a finished pattern, a psychological mechanism. The inner sound-system, overlaid though it may be by the mechanical or the irrelevant, is a real and an immensely important principle in the life of a language. It may persist as a pattern, involving number, relation, and functioning of phonetic elements, long after its phonetic content is changed. Two historically related languages or dialects may not have a sound in common, but their ideal sound-systems may be identical patterns. I would not for a moment wish to imply that this pattern may not change. It may shrink or expand or change its functional complexion, but its rate of change is infinitely less rapid than that of the sounds as such. Every language, then, is characterized as much by its ideal system of sounds and by the underlying phonetic pattern (system, one might term it, of symbolic atoms) as by a definite grammatical structure. Both the phonetic and conceptual structures show the instinctive feeling of language for form.[25]

These ideas about phonetic value lead to an important concept. Behind the purely objective system of sounds that defines a language and can only be understood through detailed phonetic analysis, there's a more specific "inner" or "ideal" system. This system, while perhaps just as unconscious to the casual speaker, can be much more easily recognized as a complete pattern or psychological mechanism. The inner sound system, even if it's influenced by external or irrelevant factors, is a real and highly significant principle in the life of a language. It can last as a pattern, involving the number, relationships, and functions of phonetic elements, long after its phonetic content has changed. Two historically connected languages or dialects might not share a sound, but their ideal sound systems could have identical patterns. I wouldn’t want to suggest that this pattern can’t change. It can shrink, expand, or alter its function, but it changes much more slowly than the sounds themselves do. So, every language is defined not only by its ideal system of sounds and by its underlying phonetic pattern (which could be called a system of symbolic units) but also by its specific grammatical structure. Both phonetic and conceptual structures reflect the innate sense of language for form.[25]

IV

Form in Language: Grammatical Processes

The question of form in language presents itself under two aspects. We may either consider the formal methods employed by a language, its “grammatical processes,” or we may ascertain the distribution of concepts with reference to formal expression. What are the formal patterns of the language? And what types of concepts make up the content of these formal patterns? The two points of view are quite distinct. The English word unthinkingly is, broadly speaking, formally parallel to the word reformers, each being built up on a radical element which may occur as an independent verb (think, form), this radical element being preceded by an element (un-, re-) that conveys a definite and fairly concrete significance but that cannot be used independently, and followed by two elements (-ing, -ly; -er, -s) that limit the application of the radical concept in a relational sense. This formal pattern—(b) + A + (c) + (d)[26]—is a characteristic feature of the language. A countless number of functions may be expressed by it; in other words, all the possible ideas conveyed by such prefixed and suffixed elements, while tending to fall into minor groups, do not necessarily form natural, functional systems. There is no logical reason, for instance, why the numeral function of -s should be formally expressed in a manner that is analogous to the expression of the idea conveyed by -ly. It is perfectly conceivable that in another language the concept of manner (-ly) may be treated according to an entirely different pattern from that of plurality. The former might have to be expressed by an independent word (say, thus unthinking), the latter by a prefixed element (say, plural[27]-reform-er). There are, of course, an unlimited number of other possibilities. Even within the confines of English alone the relative independence of form and function can be made obvious. Thus, the negative idea conveyed by un- can be just as adequately expressed by a suffixed element (-less) in such a word as thoughtlessly. Such a twofold formal expression of the negative function would be inconceivable in certain languages, say Eskimo, where a suffixed element would alone be possible. Again, the plural notion conveyed by the -s of reformers is just as definitely expressed in the word geese, where an utterly distinct method is employed. Furthermore, the principle of vocalic change (goose—geese) is by no means confined to the expression of the idea of plurality; it may also function as an indicator of difference of time (e.g., sing—sang, throw—threw). But the expression in English of past time is not by any means always bound up with a change of vowel. In the great majority of cases the same idea is expressed by means of a distinct suffix (die-d, work-ed). Functionally, died and sang are analogous; so are reformers and geese. Formally, we must arrange these words quite otherwise. Both die-d and re-form-er-s employ the method of suffixing grammatical elements; both sang and geese have grammatical form by virtue of the fact that their vowels differ from the vowels of other words with which they are closely related in form and meaning (goose; sing, sung).

The issue of form in language can be looked at in two ways. We can examine the formal methods a language uses, its “grammatical processes,” or we can analyze how concepts are represented through formal expression. What are the formal structures of the language? And what kinds of concepts form the content of these structures? These two perspectives are quite different. The English word unthinkingly is, in broad terms, formally similar to the word reformers, as both are constructed from a root element that can stand alone as a verb (think, form). This root element is preceded by a prefix (un-, re-) that has a clear, specific meaning but cannot be used by itself, and is followed by two suffixes (-ing, -ly; -er, -s) that narrow down the application of the root concept in a relational way. This structure—(b) + A + (c) + (d)[26]—is a defining characteristic of the language. A vast number of functions can be expressed through it; in other words, all the ideas conveyed by such prefixes and suffixes, while often clustering into smaller groups, do not necessarily create obvious, functional systems. There is no logical reason, for example, why the plural function of -s should be expressed in a way that is similar to the way the idea expressed by -ly is formed. It’s entirely possible that in another language the idea of manner (-ly) could be expressed using a completely different pattern than plurality. The former might have to be communicated with an independent word (like thus unthinking), while the latter might use a prefix (like plural[27]-reform-er). There are, of course, countless other possibilities. Even just within English, the relative independence of form and function can be clearly seen. For instance, the negative idea expressed by un- can just as easily be conveyed through a suffix (-less) in a word like thoughtlessly. This dual formal expression of the negative function wouldn’t be possible in certain languages, such as Eskimo, where only a suffix would be feasible. Similarly, the plural concept signaled by the -s in reformers is clearly represented in the word geese, which uses a completely different method. Moreover, the principle of vowel change (goose—geese) isn't limited to expressing plurality; it can also indicate a change in tense (e.g., sing—sang, throw—threw). However, expressing past time in English doesn’t always require a vowel change. In most cases, the same idea is conveyed with a distinct suffix (die-d, work-ed). Functionally, died and sang are similar; the same goes for reformers and geese. Formally, we must categorize these words differently. Both die-d and re-form-er-s use suffixing to form grammatical elements; both sang and geese have grammatical forms primarily because their vowels differ from those of other words that are closely related in form and meaning (goose; sing, sung).

Every language possesses one or more formal methods or indicating the relation of a secondary concept to the main concept of the radical element. Some of these grammatical processes, like suffixing, are exceedingly wide-spread; others, like vocalic change, are less common but far from rare; still others, like accent and consonantal change, are somewhat exceptional as functional processes. Not all languages are as irregular as English in the assignment of functions to its stock of grammatical processes. As a rule, such basic concepts as those of plurality and time are rendered by means of one or other method alone, but the rule has so many exceptions that we cannot safely lay it down as a principle. Wherever we go we are impressed by the fact that pattern is one thing, the utilization of pattern quite another. A few further examples of the multiple expression of identical functions in other languages than English may help to make still more vivid this idea of the relative independence of form and function.

Every language has one or more formal ways to show the relationship between a secondary concept and the main concept of the root element. Some of these grammatical methods, like adding suffixes, are very common; others, like vowel changes, are not as frequent but still fairly common; and some, like accent and consonant changes, are somewhat rare as functional processes. Not all languages are as irregular as English in how they assign functions to their grammatical processes. Generally, basic concepts such as plurality and tense are expressed using only one method, but there are so many exceptions that we can't reliably say this is a rule. Everywhere we look, we notice that the pattern is one thing, while how the pattern is used is quite another. A few more examples of the multiple ways to express the same function in languages other than English can help clarify this idea of the relative independence of form and function.

In Hebrew, as in other Semitic languages, the verbal idea as such is expressed by three, less often by two or four, characteristic consonants. Thus, the group sh-m-r expresses the idea of “guarding,” the group g-n-b that of “stealing,” n-t-n that of “giving.” Naturally these consonantal sequences are merely abstracted from the actual forms. The consonants are held together in different forms by characteristic vowels that vary according to the idea that it is desired to express. Prefixed and suffixed elements are also frequently used. The method of internal vocalic change is exemplified in shamar “he has guarded,” shomer “guarding,” shamur “being guarded,” shmor “(to) guard.” Analogously, ganab “he has stolen,” goneb “stealing,” ganub “being stolen,” gnob “(to) steal.” But not all infinitives are formed according to the type of shmor and gnob or of other types of internal vowel change. Certain verbs suffix a t-element for the infinitive, e.g., ten-eth “to give,” heyo-th “to be.” Again, the pronominal ideas may be expressed by independent words (e.g., anoki “I”), by prefixed elements (e.g., e-shmor “I shall guard”), or by suffixed elements (e.g., shamar-ti “I have guarded”). In Nass, an Indian language of British Columbia, plurals are formed by four distinct methods. Most nouns (and verbs) are reduplicated in the plural, that is, part of the radical element is repeated, e.g., gyat “person,” gyigyat “people.” A second method is the use of certain characteristic prefixes, e.g., an’on “hand,” ka-an’on “hands”; wai “one paddles,” lu-wai “several paddle.” Still other plurals are formed by means of internal vowel change, e.g., gwula “cloak,” gwila “cloaks.” Finally, a fourth class of plurals is constituted by such nouns as suffix a grammatical element, e.g., waky “brother,” wakykw “brothers.”

In Hebrew, like in other Semitic languages, the basic idea of a verb is represented by three, and less commonly two or four, specific consonants. For example, the group sh-m-r conveys the concept of "guarding," g-n-b means "stealing," and n-t-n means "giving." These sequences of consonants are just abstractions from the actual forms. The consonants are combined in various forms with specific vowels that change based on the idea being expressed. Prefixes and suffixes are also commonly used. The internal vowel change method is illustrated by shamar "he has guarded," shomer "guarding," shamur "being guarded," and shmor "to guard." Similarly, ganab "he has stolen," goneb "stealing," ganub "being stolen," and gnob "to steal." However, not all infinitives follow the pattern of shmor and gnob or other forms of internal vowel change. Some verbs add a t element for the infinitive, such as ten-eth "to give" and heyo-th "to be." Additionally, pronominal ideas can be expressed using independent words (e.g., anoki "I"), prefixed elements (e.g., e-shmor "I shall guard"), or suffixed elements (e.g., shamar-ti "I have guarded"). In Nass, a language spoken by an Indigenous group in British Columbia, plurals are created using four different methods. Most nouns (and verbs) are made plural by reduplication, where part of the root is repeated, such as gyat "person," becoming gyigyat "people." Another method involves specific prefixes, e.g., an’on "hand," which becomes ka-an’on "hands"; wai "one paddles," changing to lu-wai "several paddle." Some plurals are formed through internal vowel change, like gwula "cloak," becoming gwila "cloaks." Finally, a fourth type of plural includes nouns that have a grammatical element added, such as waky "brother," which changes to wakykw "brothers."

From such groups of examples as these—and they might be multiplied ad nauseam—we cannot but conclude that linguistic form may and should be studied as types of patterning, apart from the associated functions. We are the more justified in this procedure as all languages evince a curious instinct for the development of one or more particular grammatical processes at the expense of others, tending always to lose sight of any explicit functional value that the process may have had in the first instance, delighting, it would seem, in the sheer play of its means of expression. It does not matter that in such a case as the English goose—geese, foul—defile, sing—sang—sung we can prove that we are dealing with historically distinct processes, that the vocalic alternation of sing and sang, for instance, is centuries older as a specific type of grammatical process than the outwardly parallel one of goose and geese. It remains true that there is (or was) an inherent tendency in English, at the time such forms as geese came into being, for the utilization of vocalic change as a significant linguistic method. Failing the precedent set by such already existing types of vocalic alternation as sing—sang—sung, it is highly doubtful if the detailed conditions that brought about the evolution of forms like teeth and geese from tooth and goose would have been potent enough to allow the native linguistic feeling to win through to an acceptance of these new types of plural formation as psychologically possible. This feeling for form as such, freely expanding along predetermined lines and greatly inhibited in certain directions by the lack of controlling types of patterning, should be more clearly understood than it seems to be. A general survey of many diverse types of languages is needed to give us the proper perspective on this point. We saw in the preceding chapter that every language has an inner phonetic system of definite pattern. We now learn that it has also a definite feeling for patterning on the level of grammatical formation. Both of these submerged and powerfully controlling impulses to definite form operate as such, regardless of the need for expressing particular concepts or of giving consistent external shape to particular groups of concepts. It goes without saying that these impulses can find realization only in concrete functional expression. We must say something to be able to say it in a certain manner.

From groups of examples like these—and they could be expanded ad nauseam—it's clear that we can and should analyze linguistic forms as patterns, separate from their functions. We're justified in this approach because all languages show a strange tendency to develop one or more specific grammatical processes while neglecting others, often losing track of any clear functional value those processes might have originally had, seemingly enjoying the simple play of their means of expression. It doesn't matter that in the case of the English goose—geese, foul—defile, sing—sang—sung, we can demonstrate that we are dealing with historically distinct processes, or that the vowel change in sing and sang is centuries older as a type of grammatical process than the outwardly similar one of goose and geese. It remains true that there is (or was) a natural tendency in English, at the time forms like geese emerged, to use vowel change as a significant linguistic method. Without the examples set by existing types of vowel alternation like sing—sang—sung, it's unlikely that the specific conditions that led to the evolution of forms like teeth and geese from tooth and goose would have been strong enough for the native linguistic intuition to recognize these new plural formations as psychologically feasible. This sense of form, freely developing along certain lines and significantly limited by the absence of controlling patterns, should be understood more clearly than it currently is. A broad examination of various types of languages is necessary to give us the right perspective on this matter. In the previous chapter, we observed that every language has an internal phonetic system with a definite pattern. We now learn that it also has a strong intuition for grammatical patterns. Both of these underlying and influential impulses toward a definite form operate independently of the need to express specific concepts or to give a consistent external shape to particular groups of concepts. It’s obvious that these impulses can only be realized through concrete functional expression. We need to say something to communicate it in a specific way.

Let us now take up a little more systematically, however briefly, the various grammatical processes that linguistic research has established. They may be grouped into six main types: word order; composition; affixation, including the use of prefixes, suffixes, and infixes; internal modification of the radical or grammatical element, whether this affects a vowel or a consonant; reduplication; and accentual differences, whether dynamic (stress) or tonal (pitch). There are also special quantitative processes, like vocalic lengthening or shortening and consonantal doubling, but these may be looked upon as particular sub-types of the process of internal modification. Possibly still other formal types exist, but they are not likely to be of importance in a general survey. It is important to bear in mind that a linguistic phenomenon cannot be looked upon as illustrating a definite “process“ unless it has an inherent functional value. The consonantal change in English, for instance, of book-s and bag-s (s in the former, z in the latter) is of no functional significance. It is a purely external, mechanical change induced by the presence of a preceding voiceless consonant, k, in the former case, of a voiced consonant, g, in the latter. This mechanical alternation is objectively the same as that between the noun house and the verb to house. In the latter case, however, it has an important grammatical function, that of transforming a noun into a verb. The two alternations belong, then, to entirely different psychological categories. Only the latter is a true illustration of consonantal modification as a grammatical process.

Let's take a more systematic look, even if briefly, at the various grammatical processes that linguistic research has identified. These can be categorized into six main types: word order; composition; affixation, which includes using prefixes, suffixes, and infixes; internal modification of the base or grammatical component, whether this involves changing a vowel or a consonant; reduplication; and accentual differences, which can be dynamic (stress) or tonal (pitch). There are also specific quantitative processes, like lengthening or shortening vowels and doubling consonants, but these can be considered special sub-types of internal modification. It's possible that there are other formal types, but they are unlikely to matter in a general overview. It's essential to remember that a linguistic phenomenon cannot be viewed as demonstrating a definite “process“ unless it has a functional value. For example, the consonantal change in English, such as book-s and bag-s (with s in the first case and z in the second) is not functionally significant. It's a purely external, mechanical change caused by the presence of a preceding voiceless consonant, k, in the first case, and a voiced consonant, g, in the second. This mechanical alternation is objectively the same as that between the noun house and the verb to house. In the latter instance, however, it serves an important grammatical function, transforming a noun into a verb. Therefore, the two alternations belong to entirely different psychological categories. Only the latter is a true example of consonantal modification as a grammatical process.

The simplest, at least the most economical, method of conveying some sort of grammatical notion is to juxtapose two or more words in a definite sequence without making any attempt by inherent modification of these words to establish a connection between them. Let us put down two simple English words at random, say sing praise. This conveys no finished thought in English, nor does it clearly establish a relation between the idea of singing and that of praising. Nevertheless, it is psychologically impossible to hear or see the two words juxtaposed without straining to give them some measure of coherent significance. The attempt is not likely to yield an entirely satisfactory result, but what is significant is that as soon as two or more radical concepts are put before the human mind in immediate sequence it strives to bind them together with connecting values of some sort. In the case of sing praise different individuals are likely to arrive at different provisional results. Some of the latent possibilities of the juxtaposition, expressed in currently satisfying form, are: sing praise (to him)! or singing praise, praise expressed in a song or to sing and praise or one who sings a song of praise (compare such English compounds as killjoy, i.e., one who kills joy) or he sings a song of praise (to him). The theoretical possibilities in the way of rounding out these two concepts into a significant group of concepts or even into a finished thought are indefinitely numerous. None of them will quite work in English, but there are numerous languages where one or other of these amplifying processes is habitual. It depends entirely on the genius of the particular language what function is inherently involved in a given sequence of words.

The simplest, and definitely the most cost-effective, way to express a grammatical idea is to place two or more words together in a specific order without trying to modify them to show a connection. Let’s take two random English words, for example, sing praise. This doesn’t create a complete thought in English, nor does it clearly relate the concepts of singing and praising. However, it is almost impossible to hear or see those two words next to each other without trying to find some meaningful connection. The result might not be completely satisfying, but what’s important is that as soon as two or more basic ideas are placed in front of someone, the mind tries to link them together somehow. In the case of sing praise, different people will likely come up with different temporary interpretations. Some of the possible interpretations of the juxtaposition, expressed in currently satisfying ways, include: sing praise (to him)! or singing praise, praise expressed in a song or to sing and praise or one who sings a song of praise (similar to English compounds like killjoy, meaning one who kills joy) or he sings a song of praise (to him). The theoretical possibilities for combining these two concepts into a coherent group or even a complete thought are countless. None of them perfectly fit in English, but there are many languages where one or another of these ways of expanding meaning is common. It entirely depends on the characteristics of the specific language what role is intended in a certain sequence of words.

Some languages, like Latin, express practically all relations by means of modifications within the body of the word itself. In these, sequence is apt to be a rhetorical rather than a strictly grammatical principle. Whether I say in Latin hominem femina videt or femina hominem videt or hominem videt femina or videt femina hominem makes little or no difference beyond, possibly, a rhetorical or stylistic one. The woman sees the man is the identical significance of each of these sentences. In Chinook, an Indian language of the Columbia River, one can be equally free, for the relation between the verb and the two nouns is as inherently fixed as in Latin. The difference between the two languages is that, while Latin allows the nouns to establish their relation to each other and to the verb, Chinook lays the formal burden entirely on the verb, the full content of which is more or less adequately rendered by she-him-sees. Eliminate the Latin case suffixes (-a and -em) and the Chinook pronominal prefixes (she-him-) and we cannot afford to be so indifferent to our word order. We need to husband our resources. In other words, word order takes on a real functional value. Latin and Chinook are at one extreme. Such languages as Chinese, Siamese, and Annamite, in which each and every word, if it is to function properly, falls into its assigned place, are at the other extreme. But the majority of languages fall between these two extremes. In English, for instance, it may make little grammatical difference whether I say yesterday the man saw the dog or the man saw the dog yesterday, but it is not a matter of indifference whether I say yesterday the man saw the dog or yesterday the dog saw the man or whether I say he is here or is he here? In the one case, of the latter group of examples, the vital distinction of subject and object depends entirely on the placing of certain words of the sentence, in the latter a slight difference of sequence makes all the difference between statement and question. It goes without saying that in these cases the English principle of word order is as potent a means of expression as is the Latin use of case suffixes or of an interrogative particle. There is here no question of functional poverty, but of formal economy.

Some languages, like Latin, express almost all relationships by changing the words themselves. In these languages, the order of words tends to be more about style than strict grammar. Whether I say in Latin hominem femina videt or femina hominem videt or hominem videt femina or videt femina hominem, there’s little or no difference in meaning beyond perhaps a stylistic one. The woman sees the man carries the same meaning in each of these sentences. In Chinook, an Indigenous language from the Columbia River, you can also be flexible because the relationship between the verb and the two nouns is just as fixed as in Latin. The difference is that while Latin allows the nouns to define their relationship with each other and the verb, Chinook puts all the formal responsibility on the verb, which is reasonably captured by she-him-sees. If we remove the Latin case endings (-a and -em) and the Chinook pronoun prefixes (she-him-), we can't afford to be so casual about word order. We need to use our resources wisely. In other words, word order becomes really important. Latin and Chinook are on one end of the spectrum. Languages like Chinese, Siamese, and Annamite, where every word must fit its specific place to work properly, are on the opposite end. Most languages, however, are somewhere in the middle. In English, for example, it may not make much difference grammatically whether I say yesterday the man saw the dog or the man saw the dog yesterday, but it does matter if I say yesterday the man saw the dog versus yesterday the dog saw the man, or if I say he is here or is he here? In the second set of examples, the crucial distinction between subject and object relies entirely on the arrangement of certain words, while a small change in order shifts the sentence from a statement to a question. It’s clear that in these cases, the English principle of word order is just as powerful a tool for expression as the Latin use of case endings or an interrogative word. This isn't about lacking functionality, but rather about being efficient with form.

We have already seen something of the process of composition, the uniting into a single word of two or more radical elements. Psychologically this process is closely allied to that of word order in so far as the relation between the elements is implied, not explicitly stated. It differs from the mere juxtaposition of words in the sentence in that the compounded elements are felt as constituting but parts of a single word-organism. Such languages as Chinese and English, in which the principle of rigid sequence is well developed, tend not infrequently also to the development of compound words. It is but a step from such a Chinese word sequence as jin tak “man virtue,” i.e., “the virtue of men,” to such more conventionalized and psychologically unified juxtapositions as t’ien tsz “heaven son,” i.e., “emperor,” or shui fu “water man,” i.e., “water carrier.” In the latter case we may as well frankly write shui-fu as a single word, the meaning of the compound as a whole being as divergent from the precise etymological values of its component elements as is that of our English word typewriter from the merely combined values of type and writer. In English the unity of the word typewriter is further safeguarded by a predominant accent on the first syllable and by the possibility of adding such a suffixed element as the plural -s to the whole word. Chinese also unifies its compounds by means of stress. However, then, in its ultimate origins the process of composition may go back to typical sequences of words in the sentence, it is now, for the most part, a specialized method of expressing relations. French has as rigid a word order as English but does not possess anything like its power of compounding words into more complex units. On the other hand, classical Greek, in spite of its relative freedom in the placing of words, has a very considerable bent for the formation of compound terms.

We have already looked at how words are formed by combining two or more root elements. This process is closely related to word order because the relationship between the elements is implied rather than explicitly stated. It’s different from just placing words next to each other in a sentence because the combined elements are perceived as parts of a single word. Languages like Chinese and English, where a strict word sequence is important, often develop compound words as well. For instance, from a Chinese word sequence like jin tak meaning “man virtue,” or “the virtue of men,” it’s a natural progression to more conventional and psychologically unified phrases like t’ien tsz meaning “heaven son,” or “emperor,” and shui fu meaning “water man,” or “water carrier.” In this case, we might as well write shui-fu as a single word, as the meaning of the whole compound differs significantly from the specific meanings of its individual parts, much like the English word typewriter differs from the combined meanings of type and writer. In English, the unity of the word typewriter is further emphasized by the stress on the first syllable and the ability to add a suffix like -s to make it plural. Chinese also creates unity in its compounds through stress. Although the origins of this composition process may trace back to typical word sequences in sentences, it is primarily a specialized way of expressing relationships today. French has as strict a word order as English but lacks the same ability to combine words into more complex units. Meanwhile, classical Greek, despite having relatively free word placement, has a strong tendency to form compound terms.

It is curious to observe how greatly languages differ in their ability to make use of the process of composition. One would have thought on general principles that so simple a device as gives us our typewriter and blackbird and hosts of other words would be an all but universal grammatical process. Such is not the case. There are a great many languages, like Eskimo and Nootka and, aside from paltry exceptions, the Semitic languages, that cannot compound radical elements. What is even stranger is the fact that many of these languages are not in the least averse to complex word-formations, but may on the contrary effect a synthesis that far surpasses the utmost that Greek and Sanskrit are capable of. Such a Nootka word, for instance, as “when, as they say, he had been absent for four days” might be expected to embody at least three radical elements corresponding to the concepts of “absent,” “four,” and “day.” As a matter of fact the Nootka word is utterly incapable of composition in our sense. It is invariably built up out of a single radical element and a greater or less number of suffixed elements, some of which may have as concrete a significance as the radical element itself. In, the particular case we have cited the radical element conveys the idea of “four,” the notions of “day” and “absent” being expressed by suffixes that are as inseparable from the radical nucleus of the word as is an English element like -er from the sing or hunt of such words as singer and hunter. The tendency to word synthesis is, then, by no means the same thing as the tendency to compounding radical elements, though the latter is not infrequently a ready means for the synthetic tendency to work with.

It’s interesting to see how much languages differ in their ability to use composition. One might think that such a simple tool, which gives us words like typewriter and blackbird, would be a nearly universal grammatical method. However, that's not the case. Many languages, like Eskimo and Nootka—and with a few exceptions, the Semitic languages—can't combine basic elements. What’s even stranger is that a lot of these languages aren’t against creating complex words. In fact, they can combine elements in ways that far exceed what Greek and Sanskrit can do. For example, a Nootka word that means “when, as they say, he had been absent for four days” would seem to include at least three basic elements corresponding to the ideas of “absent,” “four,” and “day.” In reality, the Nootka word can’t be compounded in our sense. It’s always formed from a single basic element with a varying number of suffixes, some of which can be just as concrete as the basic element itself. In the particular case we’ve mentioned, the basic element conveys “four,” while the ideas of “day” and “absent” are represented by suffixes that are as inseparable from the basic part of the word as the English element -er is from sing or hunt in words like singer and hunter. The tendency to combine words is not the same as the tendency to merge basic elements, though the latter often provides an easy way for the former to function.

There is a bewildering variety of types of composition. These types vary according to function, the nature of the compounded elements, and order. In a great many languages composition is confined to what we may call the delimiting function, that is, of the two or more compounded elements one is given a more precisely qualified significance by the others, which contribute nothing to the formal build of the sentence. In English, for instance, such compounded elements as red in redcoat or over in overlook merely modify the significance of the dominant coat or look without in any way sharing, as such, in the predication that is expressed by the sentence. Some languages, however, such as Iroquois and Nahuatl,[28] employ the method of composition for much heavier work than this. In Iroquois, for instance, the composition of a noun, in its radical form, with a following verb is a typical method of expressing case relations, particularly of the subject or object. I-meat-eat for instance, is the regular Iroquois method of expressing the sentence I am eating meat. In other languages similar forms may express local or instrumental or still other relations. Such English forms as killjoy and marplot also illustrate the compounding of a verb and a noun, but the resulting word has a strictly nominal, not a verbal, function. We cannot say he marplots. Some languages allow the composition of all or nearly all types of elements. Paiute, for instance, may compound noun with noun, adjective with noun, verb with noun to make a noun, noun with verb to make a verb, adverb with verb, verb with verb. Yana, an Indian language of California, can freely compound noun with noun and verb with noun, but not verb with verb. On the other hand, Iroquois can compound only noun with verb, never noun and noun as in English or verb and verb as in so many other languages. Finally, each language has its characteristic types of order of composition. In English the qualifying element regularly precedes; in certain other languages it follows. Sometimes both types are used in the same language, as in Yana, where “beef” is “bitter-venison” but “deer-liver” is expressed by “liver-deer.” The compounded object of a verb precedes the verbal element in Paiute, Nahuatl, and Iroquois, follows it in Yana, Tsimshian,[29] and the Algonkin languages.

There is a confusing variety of types of composition. These types depend on function, the nature of the combined elements, and order. In many languages, composition is limited to what we might call the delimiting function; that is, of the two or more combined elements, one gets a more precisely defined meaning from the others, which don’t contribute to the grammatical structure of the sentence. In English, for example, combined elements like red in redcoat or over in overlook simply modify the meaning of the main coat or look without actually being part of the statement expressed by the sentence. However, some languages, like Iroquois and Nahuatl,[28] use composition for much more complex purposes. In Iroquois, for instance, combining a noun in its basic form with a following verb is a common way to express case relations, especially for the subject or object. I-meat-eat is the usual Iroquois way of saying I am eating meat. In other languages, similar forms can express local, instrumental, or other relations. English words like killjoy and marplot also show the combination of a verb and a noun, but the resulting word has a strictly nominal function, not a verbal one. We can’t say he marplots. Some languages allow the combination of almost all types of elements. Paiute, for example, can combine noun with noun, adjective with noun, verb with noun to form a noun, noun with verb to create a verb, adverb with verb, and verb with verb. Yana, a Native American language from California, can freely combine noun with noun and verb with noun, but not verb with verb. On the other hand, Iroquois can only combine noun with verb, never noun with noun like English or verb with verb as in many other languages. Lastly, each language has its own typical order of composition. In English, the qualifying element usually comes first; in some other languages, it comes after. Sometimes both types appear in the same language, as in Yana, where “beef” translates to “bitter-venison” but “deer-liver” is expressed as “liver-deer.” The combined object of a verb comes before the verb in Paiute, Nahuatl, and Iroquois, while it comes after in Yana, Tsimshian,[29] and the Algonkin languages.

Of all grammatical processes affixing is incomparably the most frequently employed. There are languages, like Chinese and Siamese, that make no grammatical use of elements that do not at the same time possess an independent value as radical elements, but such languages are uncommon. Of the three types of affixing—the use of prefixes, suffixes, and infixes—suffixing is much the commonest. Indeed, it is a fair guess that suffixes do more of the formative work of language than all other methods combined. It is worth noting that there are not a few affixing languages that make absolutely no use of prefixed elements but possess a complex apparatus of suffixes. Such are Turkish, Hottentot, Eskimo, Nootka, and Yana. Some of these, like the three last mentioned, have hundreds of suffixed elements, many of them of a concreteness of significance that would demand expression in the vast majority of languages by means of radical elements. The reverse case, the use of prefixed elements to the complete exclusion of suffixes, is far less common. A good example is Khmer (or Cambodgian), spoken in French Cochin-China, though even here there are obscure traces of old suffixes that have ceased to function as such and are now felt to form part of the radical element.

Of all grammatical processes, affixing is by far the most commonly used. There are languages, like Chinese and Siamese, that don’t use elements in a grammatical way unless those elements also have independent value as root elements, but those languages are rare. Of the three types of affixing—prefixes, suffixes, and infixes—suffixing is by far the most frequent. In fact, it’s a reasonable estimate that suffixes do more to shape language than all other methods combined. It’s interesting to note that there are quite a few affixing languages that don’t use prefixed elements at all but have a complex system of suffixes. Examples include Turkish, Hottentot, Eskimo, Nootka, and Yana. Some of these, like the last three mentioned, have hundreds of suffix elements, many of which have meanings that most languages would express using root elements. The opposite scenario, using prefixed elements entirely without suffixes, is much less common. A good example is Khmer (or Cambodian), spoken in French Cochinchina, though even here there are faint traces of old suffixes that no longer work as such and are now seen as part of the root elements.

A considerable majority of known languages are prefixing and suffixing at one and the same time, but the relative importance of the two groups of affixed elements naturally varies enormously. In some languages, such as Latin and Russian, the suffixes alone relate the word to the rest of the sentence, the prefixes being confined to the expression of such ideas as delimit the concrete significance of the radical element without influencing its bearing in the proposition. A Latin form like remittebantur “they were being sent back” may serve as an illustration of this type of distribution of elements. The prefixed element re- “back” merely qualifies to a certain extent the inherent significance of the radical element mitt- “send,” while the suffixes -eba-, -nt-, and -ur convey the less concrete, more strictly formal, notions of time, person, plurality, and passivity.

A significant majority of known languages use both prefixes and suffixes at the same time, but the importance of these two types of affixed elements varies widely. In some languages, like Latin and Russian, the suffixes alone connect the word to the rest of the sentence, while the prefixes are mainly used to define ideas that clarify the specific meaning of the root element without affecting its role in the sentence. A Latin example like remittebantur “they were being sent back” illustrates this distribution of elements. The prefixed element re- “back” slightly modifies the basic meaning of the root mitt- “send,” whereas the suffixes -eba-, -nt-, and -ur express the less concrete and more formal ideas of time, person, plurality, and passivity.

On the other hand, there are languages, like the Bantu group of Africa or the Athabaskan languages[30] of North America, in which the grammatically significant elements precede, those that follow the radical element forming a relatively dispensable class. The Hupa word te-s-e-ya-te “I will go,” for example, consists of a radical element -ya- “to go,” three essential prefixes and a formally subsidiary suffix. The element te- indicates that the act takes place here and there in space or continuously over space; practically, it has no clear-cut significance apart from such verb stems as it is customary to connect it with. The second prefixed element, -s-, is even less easy to define. All we can say is that it is used in verb forms of “definite” time and that it marks action as in progress rather than as beginning or coming to an end. The third prefix, -e-, is a pronominal element, “I,” which can be used only in “definite” tenses. It is highly important to understand that the use of -e- is conditional on that of -s- or of certain alternative prefixes and that te- also is in practice linked with -s-. The group te-s-e-ya is a firmly knit grammatical unit. The suffix -te, which indicates the future, is no more necessary to its formal balance than is the prefixed re- of the Latin word; it is not an element that is capable of standing alone but its function is materially delimiting rather than strictly formal.[31]

On the other hand, there are languages, like the Bantu group in Africa or the Athabaskan languages[30] of North America, where the important grammatical elements come before those that follow the root element, which forms a relatively optional class. The Hupa word te-s-e-ya-te means “I will go.” It consists of a root element -ya- meaning “to go,” three key prefixes, and a less important suffix. The prefix te- shows that the action happens here and there in space or continuously over space; it doesn't have a clear meaning on its own apart from the verb stems it is commonly associated with. The second prefix, -s-, is even harder to define. All we can say is that it's used in verb forms indicating “definite” time and marks actions as ongoing rather than starting or stopping. The third prefix, -e-, is a pronoun meaning “I,” and it can only be used in “definite” tenses. It's important to understand that using -e- depends on using -s- or certain other prefixes, and that te- is also typically linked with -s-. The group te-s-e-ya is a tightly connected grammatical unit. The suffix -te, which indicates the future, is as unnecessary for its formal structure as the prefixed re- in the Latin word; it's not an element that can stand alone, but its role is more about defining the meaning than just maintaining a formal structure.[31]

It is not always, however, that we can clearly set off the suffixes of a language as a group against its prefixes. In probably the majority of languages that use both types of affixes each group has both delimiting and formal or relational functions. The most that we can say is that a language tends to express similar functions in either the one or the other manner. If a certain verb expresses a certain tense by suffixing, the probability is strong that it expresses its other tenses in an analogous fashion and that, indeed, all verbs have suffixed tense elements. Similarly, we normally expect to find the pronominal elements, so far as they are included in the verb at all, either consistently prefixed or suffixed. But these rules are far from absolute. We have already seen that Hebrew prefixes its pronominal elements in certain cases, suffixes them in others. In Chimariko, an Indian language of California, the position of the pronominal affixes depends on the verb; they are prefixed for certain verbs, suffixed for others.

It’s not always easy to clearly distinguish the suffixes of a language from its prefixes as distinct groups. In fact, in most languages that use both types of affixes, each group serves both delimiting and formal or relational functions. All we can really say is that a language tends to express similar functions in one way or the other. If a specific verb indicates a certain tense with a suffix, it's likely that it will express its other tenses in a similar way, and that, in fact, all verbs will have suffixed tense elements. Likewise, we usually expect to find pronominal elements, when they are included in the verb at all, consistently either prefixed or suffixed. But these rules aren’t absolute. We’ve already seen that Hebrew prefixes its pronominal elements in some cases and suffixes them in others. In Chimariko, a Native American language from California, the position of the pronominal affixes depends on the verb; they are prefixed for certain verbs and suffixed for others.

It will not be necessary to give many further examples of prefixing and suffixing. One of each category will suffice to illustrate their formative possibilities. The idea expressed in English by the sentence I came to give it to her is rendered in Chinook[32] by i-n-i-a-l-u-d-am. This word—and it is a thoroughly unified word with a clear-cut accent on the first a—consists of a radical element, -d- “to give,” six functionally distinct, if phonetically frail, prefixed elements, and a suffix. Of the prefixes, i- indicates recently past time; n-, the pronominal subject “I”; -i-, the pronominal object “it”;[33] -a-, the second pronominal object “her”; -l-, a prepositional element indicating that the preceding pronominal prefix is to be understood as an indirect object (-her-to-, i.e., “to her”); and -u-, an element that it is not easy to define satisfactorily but which, on the whole, indicates movement away from the speaker. The suffixed -am modifies the verbal content in a local sense; it adds to the notion conveyed by the radical element that of “arriving” or “going (or coming) for that particular purpose.” It is obvious that in Chinook, as in Hupa, the greater part of the grammatical machinery resides in the prefixes rather than in the suffixes.

It won't be necessary to provide many more examples of prefixes and suffixes. One from each category will be enough to demonstrate their forming possibilities. The idea conveyed in English by the sentence I came to give it to her is expressed in Chinook[32] as i-n-i-a-l-u-d-am. This word—and it’s a completely unified word with a clear accent on the first a—contains a root element, -d- meaning “to give,” six functionally distinct, though phonetically weak, prefixes, and a suffix. Among the prefixes, i- shows recently past time; n- indicates the pronoun “I”; -i- represents the pronoun “it”;[33] -a- stands for the second pronoun “her”; -l- is a prepositional element indicating that the preceding pronoun is to be understood as an indirect object (-her-to-, meaning “to her”); and -u- is an element that’s hard to define satisfactorily but generally indicates movement away from the speaker. The suffix -am modifies the verbal content in a local sense; it adds to the idea expressed by the root the notion of “arriving” or “going (or coming) for that specific purpose.” It’s clear that in Chinook, just like in Hupa, most of the grammatical structure is found in the prefixes rather than the suffixes.

A reverse case, one in which the grammatically significant elements cluster, as in Latin, at the end of the word is yielded by Fox, one of the better known Algonkin languages of the Mississippi Valley. We may take the form eh-kiwi-n-a-m-oht-ati-wa-ch(i) “then they together kept (him) in flight from them.” The radical element here is kiwi-, a verb stem indicating the general notion of “indefinite movement round about, here and there.” The prefixed element eh- is hardly more than an adverbial particle indicating temporal subordination; it may be conveniently rendered as “then.” Of the seven suffixes included in this highly-wrought word, -n- seems to be merely a phonetic element serving to connect the verb stem with the following -a-;[34] -a- is a “secondary stem”[35] denoting the idea of “flight, to flee”; -m- denotes causality with reference to an animate object;[36] -o(ht)- indicates activity done for the subject (the so-called “middle” or “medio-passive” voice of Greek); -(a)ti- is a reciprocal element, “one another”; -wa-ch(i) is the third person animate plural (-wa-, plural; -chi, more properly personal) of so-called “conjunctive” forms. The word may be translated more literally (and yet only approximately as to grammatical feeling) as “then they (animate) caused some animate being to wander about in flight from one another of themselves.” Eskimo, Nootka, Yana, and other languages have similarly complex arrays of suffixed elements, though the functions performed by them and their principles of combination differ widely.

A reverse case, where the important grammatical elements cluster at the end of the word, like in Latin, can be seen in Fox, one of the better-known Algonkin languages of the Mississippi Valley. We can look at the form eh-kiwi-n-a-m-oht-ati-wa-ch(i), which means “then they together kept (him) in flight from them.” The core part here is kiwi-, a verb stem that refers to the general idea of “indefinite movement around, here and there.” The prefixed element eh- is mostly an adverbial particle suggesting temporal subordination; it can be translated conveniently as “then.” Of the seven suffixes in this complex word, -n- seems to be just a phonetic element that connects the verb stem to the following -a-; -a- serves as a “secondary stem” that means “flight, to flee”; -m- indicates causality relating to an animate subject; -o(ht)- suggests action performed for the subject (similar to the “middle” or “medio-passive” voice in Greek); -(a)ti- is a reciprocal element meaning “one another”; -wa-ch(i) is the third person animate plural (-wa-, plural; -chi, more precisely personal) of the so-called “conjunctive” forms. The word can be more literally translated (though only approximately in terms of grammatical feel) as “then they (animate) caused some animate being to wander around in flight from one another of themselves.” Eskimo, Nootka, Yana, and other languages have similarly complex arrays of suffixed elements, although the functions they serve and how they combine differ greatly.

We have reserved the very curious type of affixation known as “infixing” for separate illustration. It is utterly unknown in English, unless we consider the -n- of stand (contrast stood) as an infixed element. The earlier Indo-European languages, such as Latin, Greek and Sanskrit, made a fairly considerable use of infixed nasals to differentiate the present tense of a certain class of verbs from other forms (contrast Latin vinc-o “I conquer” with vic-i “I conquered”; Greek lamb-an-o “I take” with e-lab-on “I took”). There are, however, more striking examples of the process, examples in which it has assumed a more clearly defined function than in these Latin and Greek cases. It is particularly prevalent in many languages of southeastern Asia and of the Malay archipelago. Good examples from Khmer (Cambodgian) are tmeu “one who walks” and daneu “walking” (verbal noun), both derived from deu “to walk.” Further examples may be quoted from Bontoc Igorot, a Filipino language. Thus, an infixed -in- conveys the idea of the product of an accomplished action, e.g., kayu “wood,” kinayu “gathered wood.” Infixes are also freely used in the Bontoc Igorot verb. Thus, an infixed -um- is characteristic of many intransitive verbs with personal pronominal suffixes, e.g., sad- “to wait,” sumid-ak “I wait”; kineg “silent,” kuminek-ak “I am silent.” In other verbs it indicates futurity, e.g., tengao- “to celebrate a holiday,” tumengao-ak “I shall have a holiday.” The past tense is frequently indicated by an infixed -in-; if there is already an infixed -um-, the two elements combine to -in-m-, e.g., kinminek-ak “I am silent.” Obviously the infixing process has in this (and related) languages the same vitality that is possessed by the commoner prefixes and suffixes of other languages. The process is also found in a number of aboriginal American languages. The Yana plural is sometimes formed by an infixed element, e.g., k’uruwi “medicine-men,” k’uwi “medicine-man”; in Chinook an infixed -l- is used in certain verbs to indicate repeated activity, e.g., ksik’ludelk “she keeps looking at him,” iksik’lutk “she looked at him” (radical element -tk). A peculiarly interesting type of infixation is found in the Siouan languages, in which certain verbs insert the pronominal elements into the very body of the radical element, e.g., Sioux cheti “to build a fire,” chewati “I build a fire”; shuta “to miss,” shuunta-pi “we miss.”

We have set aside a very interesting type of affixation known as “infixing” for separate discussion. It is completely unknown in English, unless we think of the -n- in stand (compare with stood) as an infixed element. The earlier Indo-European languages, like Latin, Greek, and Sanskrit, used infixed nasals quite extensively to distinguish the present tense of certain verbs from other forms (compare Latin vinc-o “I conquer” with vic-i “I conquered”; Greek lamb-an-o “I take” with e-lab-on “I took”). However, there are more striking examples of this process where it has a clearer function than in the Latin and Greek cases. It is especially common in many languages of southeastern Asia and the Malay archipelago. Good examples from Khmer (Cambodian) are tmeu “one who walks” and daneu “walking” (verbal noun), both derived from deu “to walk.” More examples can be found in Bontoc Igorot, a Filipino language. In this language, an infixed -in- indicates the result of a completed action, like kayu “wood,” kinayu “gathered wood.” Infixes are also commonly used in Bontoc Igorot verbs. For example, an infixed -um- is typical of many intransitive verbs with personal pronoun suffixes, e.g., sad- “to wait,” sumid-ak “I wait”; kineg “silent,” kuminek-ak “I am silent.” In other verbs, it conveys future tense, e.g., tengao- “to celebrate a holiday,” tumengao-ak “I shall have a holiday.” The past tense is often indicated by an infixed -in-; if there’s already an infixed -um-, the two elements combine to -in-m-, e.g., kinminek-ak “I am silent.” Clearly, the infixing process exhibits the same vitality in this (and similar) languages as the more common prefixes and suffixes found in other languages. This process is also present in several indigenous American languages. The Yana plural is sometimes formed using an infixed element, e.g., k’uruwi “medicine-men,” k’uwi “medicine-man”; in Chinook, an infixed -l- is used in specific verbs to indicate repeated actions, e.g., ksik’ludelk “she keeps looking at him,” iksik’lutk “she looked at him” (radical element -tk). An especially interesting type of infixation is found in the Siouan languages, where certain verbs insert pronoun elements directly into the structure of the root element, e.g., Sioux cheti “to build a fire,” chewati “I build a fire”; shuta “to miss,” shuunta-pi “we miss.”

A subsidiary but by no means unimportant grammatical process is that of internal vocalic or consonantal change. In some languages, as in English (sing, sang, sung, song; goose, geese), the former of these has become one of the major methods of indicating fundamental changes of grammatical function. At any rate, the process is alive enough to lead our children into untrodden ways. We all know of the growing youngster who speaks of having brung something, on the analogy of such forms as sung and flung. In Hebrew, as we have seen, vocalic change is of even greater significance than in English. What is true of Hebrew is of course true of all other Semitic languages. A few examples of so-called “broken” plurals from Arabic[37] will supplement the Hebrew verb forms that I have given in another connection. The noun balad “place” has the plural form bilad;[38] gild “hide” forms the plural gulud; ragil “man,” the plural rigal; shibbak “window,” the plural shababik. Very similar phenomena are illustrated by the Hamitic languages of Northern Africa, e.g., Shilh[39] izbil “hair,” plural izbel; a-slem “fish,” plural i-slim-en; sn “to know,” sen “to be knowing”; rmi “to become tired,” rumni “to be tired”; ttss[40] “to fall asleep,” ttoss “to sleep.” Strikingly similar to English and Greek alternations of the type sing—sang and leip-o “I leave,” leloip-a “I have left,” are such Somali[41] cases as al “I am,” il “I was”; i-dah-a “I say,” i-di “I said,” deh “say!”

A less obvious but still important grammatical process is internal vowel or consonant change. In some languages, like English (sing, sang, sung, song; goose, geese), this process has become a key way to show significant changes in grammatical function. In any case, this process is active enough that it leads our kids to create new forms. We all know that growing child who says they brung something, following examples like sung and flung. In Hebrew, as we’ve seen, vowel change is even more important than in English. What’s true for Hebrew is also true for all other Semitic languages. A few examples of so-called “broken” plurals from Arabic[37] will complement the Hebrew verb forms I shared before. The noun balad meaning “place” has the plural form bilad; gild meaning “hide” becomes the plural gulud; ragil meaning “man” has the plural rigal; shibbak meaning “window” turns into the plural shababik. Very similar patterns appear in the Hamitic languages of Northern Africa, such as Shilh[39] where izbil means “hair,” with the plural izbel; a-slem meaning “fish,” forms the plural i-slim-en; sn meaning “to know” becomes sen meaning “to be knowing”; rmi meaning “to become tired” becomes rumni meaning “to be tired”; ttss[40] meaning “to fall asleep” turns into ttoss meaning “to sleep.” Strikingly similar to English and Greek variations like sing—sang and leip-o meaning “I leave,” leloip-a meaning “I have left,” are examples in Somali[41] such as al meaning “I am,” il meaning “I was”; i-dah-a meaning “I say,” i-di meaning “I said,” deh meaning “say!”

Vocalic change is of great significance also in a number of American Indian languages. In the Athabaskan group many verbs change the quality or quantity of the vowel of the radical element as it changes its tense or mode. The Navaho verb for “I put (grain) into a receptacle” is bi-hi-sh-ja, in which -ja is the radical element; the past tense, bi-hi-ja’, has a long a-vowel, followed by the “glottal stop”[42]; the future is bi-h-de-sh-ji with complete change of vowel. In other types of Navaho verbs the vocalic changes follow different lines, e.g., yah-a-ni-ye “you carry (a pack) into (a stable)”; past, yah-i-ni-yin (with long i in -yin; -n is here used to indicate nasalization); future, yah-a-di-yehl (with long e). In another Indian language, Yokuts[43], vocalic modifications affect both noun and verb forms. Thus, buchong “son” forms the plural bochang-i (contrast the objective buchong-a); enash “grandfather,” the plural inash-a; the verb engtyim “to sleep” forms the continuative ingetym-ad “to be sleeping” and the past ingetym-ash.

Vocalic change is also really important in several American Indian languages. In the Athabaskan group, many verbs change the quality or length of the vowel in the root as they switch between tenses or moods. The Navaho verb for “I put (grain) into a receptacle” is bi-hi-sh-ja, where -ja is the root; the past tense, bi-hi-ja’, has a long a vowel, followed by a “glottal stop”[42]; the future form is bi-h-de-sh-ji with a completely different vowel. In other types of Navaho verbs, the vowel changes vary, such as in yah-a-ni-ye “you carry (a pack) into (a stable)”; in the past, it becomes yah-i-ni-yin (with a long i in -yin; -n is used here to show nasalization); the future form is yah-a-di-yehl (with a long e). In another Indian language, Yokuts[43], vowel modifications affect both noun and verb forms. For example, buchong “son” becomes the plural bochang-i (contrast this with the objective buchong-a); enash “grandfather” has the plural inash-a; the verb engtyim “to sleep” forms the continuative ingetym-ad “to be sleeping” and the past ingetym-ash.

Consonantal change as a functional process is probably far less common than vocalic modifications, but it is not exactly rare. There is an interesting group of cases in English, certain nouns and corresponding verbs differing solely in that the final consonant is voiceless or voiced. Examples are wreath (with th as in think), but to wreathe (with th as in then); house, but to house (with s pronounced like z). That we have a distinct feeling for the interchange as a means of distinguishing the noun from the verb is indicated by the extension of the principle by many Americans to such a noun as rise (e.g., the rise of democracy)—pronounced like rice—in contrast to the verb to rise (s like z).

Consonant changes as a functional process are likely much less common than vowel changes, but they’re not exactly rare. There is an interesting group of cases in English where certain nouns and their corresponding verbs differ only in whether the final consonant is voiceless or voiced. For example, wreath (with th as in think) but to wreathe (with th as in then); house, but to house (with s pronounced like z). The fact that we clearly recognize this interchange as a way to distinguish the noun from the verb is shown by how many Americans extend this principle to a noun like rise (e.g., the rise of democracy)—pronounced like rice—in contrast to the verb to rise (s like z).

In the Celtic languages the initial consonants undergo several types of change according to the grammatical relation that subsists between the word itself and the preceding word. Thus, in modern Irish, a word like bo “ox” may under the appropriate circumstances, take the forms bho (pronounce wo) or mo (e.g., an bo “the ox,” as a subject, but tir na mo “land of the oxen,” as a possessive plural). In the verb the principle has as one of its most striking consequences the “aspiration” of initial consonants in the past tense. If a verb begins with t, say, it changes the t to th (now pronounced h) in forms of the past; if it begins with g, the consonant changes, in analogous forms, to gh (pronounced like a voiced spirant[44] g or like y, according to the nature of the following vowel). In modern Irish the principle of consonantal change, which began in the oldest period of the language as a secondary consequence of certain phonetic conditions, has become one of the primary grammatical processes of the language.

In the Celtic languages, the initial consonants change in different ways depending on the grammatical relationship between the word and the one before it. For example, in modern Irish, a word like bo “ox” can take the forms bho (pronounced wo) or mo under certain conditions (e.g., an bo means “the ox” as a subject, but tir na mo means “land of the oxen” as a possessive plural). In verbs, one of the most noticeable effects of this principle is the “aspiration” of initial consonants in the past tense. For instance, if a verb starts with t, it changes to th (now pronounced h) in past forms; if it begins with g, that consonant changes to gh (pronounced like a voiced spirant[44] g or like y, depending on the next vowel). In modern Irish, the principle of consonantal change, which originated in the earliest period of the language as a secondary result of specific phonetic conditions, has evolved into one of the main grammatical processes of the language.

Perhaps as remarkable as these Irish phenomena are the consonantal interchanges of Ful, an African language of the Soudan. Here we find that all nouns belonging to the personal class form the plural by changing their initial g, j, d, b, k, ch, and p to y (or w), y, r, w, h, s and f respectively; e.g., jim-o “companion,” yim-’be “companions”; pio-o “beater,” fio-’be “beaters.” Curiously enough, nouns that belong to the class of things form their singular and plural in exactly reverse fashion, e.g., yola-re “grass-grown place,” jola-je “grass-grown places”; fitan-du “soul,” pital-i “souls.” In Nootka, to refer to but one other language in which the process is found, the t or tl[45] of many verbal suffixes becomes hl in forms denoting repetition, e.g., hita-’ato “to fall out,” hita-’ahl “to keep falling out”; mat-achisht-utl “to fly on to the water,” mat-achisht-ohl “to keep flying on to the water.” Further, the hl of certain elements changes to a peculiar h-sound in plural forms, e.g., yak-ohl “sore-faced,” yak-oh “sore-faced (people).”

Perhaps as remarkable as these Irish phenomena are the consonantal changes in Ful, an African language from the Soudan. Here, we see that all nouns in the personal class form the plural by changing their initial g, j, d, b, k, ch, and p to y (or w), y, r, w, h, s, and f respectively; for example, jim-o “companion,” yim-’be “companions”; pio-o “beater,” fio-’be “beaters.” Interestingly, nouns that belong to the class of things form their singular and plural in exactly the opposite way, e.g., yola-re “grass-grown place,” jola-je “grass-grown places”; fitan-du “soul,” pital-i “souls.” In Nootka, to mention just one other language where this process occurs, the t or tl[45] in many verbal suffixes changes to hl in forms indicating repetition, e.g., hita-’ato “to fall out,” hita-’ahl “to keep falling out”; mat-achisht-utl “to fly on to the water,” mat-achisht-ohl “to keep flying on to the water.” Furthermore, the hl in certain forms changes to a unique h-sound in plural forms, e.g., yak-ohl “sore-faced,” yak-oh “sore-faced (people).”

Nothing is more natural than the prevalence of reduplication, in other words, the repetition of all or part of the radical element. The process is generally employed, with self-evident symbolism, to indicate such concepts as distribution, plurality, repetition, customary activity, increase of size, added intensity, continuance. Even in English it is not unknown, though it is not generally accounted one of the typical formative devices of our language. Such words as goody-goody and to pooh-pooh have become accepted as part of our normal vocabulary, but the method of duplication may on occasion be used more freely than is indicated by such stereotyped examples. Such locutions as a big big man or Let it cool till it’s thick thick are far more common, especially in the speech of women and children, than our linguistic text-books would lead one to suppose. In a class by themselves are the really enormous number of words, many of them sound-imitative or contemptuous in psychological tone, that consist of duplications with either change of the vowel or change of the initial consonant—words of the type sing-song, riff-raff, wishy-washy, harum-skarum, roly-poly. Words of this type are all but universal. Such examples as the Russian Chudo-Yudo (a dragon), the Chinese ping-pang “rattling of rain on the roof,”[46] the Tibetan kyang-kyong “lazy,” and the Manchu porpon parpan “blear-eyed” are curiously reminiscent, both in form and in psychology, of words nearer home. But it can hardly be said that the duplicative process is of a distinctively grammatical significance in English. We must turn to other languages for illustration. Such cases as Hottentot go-go “to look at carefully” (from go “to see”), Somali fen-fen “to gnaw at on all sides” (from fen “to gnaw at”), Chinook iwi iwi “to look about carefully, to examine” (from iwi “to appear”), or Tsimshian am’am “several (are) good” (from am “good”) do not depart from the natural and fundamental range of significance of the process. A more abstract function is illustrated in Ewe,[47] in which both infinitives and verbal adjectives are formed from verbs by duplication; e.g., yi “to go,” yiyi “to go, act of going”; wo “to do,” wowo[48] “done”; mawomawo “not to do” (with both duplicated verb stem and duplicated negative particle). Causative duplications are characteristic of Hottentot, e.g., gam-gam[49] “to cause to tell” (from gam “to tell”). Or the process may be used to derive verbs from nouns, as in Hottentot khoe-khoe “to talk Hottentot” (from khoe-b “man, Hottentot”), or as in Kwakiutl metmat “to eat clams” (radical element met- “clam”).

Nothing is more natural than the use of reduplication, or the repetition of all or part of the root element. This process is typically used, with clear symbolism, to express ideas like distribution, plurality, repetition, customary action, size increase, added intensity, and continuation. Even in English, it's not uncommon, although it isn't generally recognized as a typical formative device in our language. Words like goody-goody and to pooh-pooh have become accepted in our everyday vocabulary, but the method of duplication might sometimes be used more freely than suggested by those typical examples. Phrases like a big big man or Let it cool till it’s thick thick are much more common, especially among women and children, than our linguistic textbooks would imply. There are also a significant number of words, many of which are sound-imitative or convey contempt, that consist of duplications with either a change in the vowel or the initial consonant—words like sing-song, riff-raff, wishy-washy, harum-skarum, and roly-poly. These types of words are almost universal. Examples like the Russian Chudo-Yudo (a dragon), the Chinese ping-pang “rattling of rain on the roof,”[46] the Tibetan kyang-kyong “lazy,” and the Manchu porpon parpan “blear-eyed” are intriguingly similar, both in form and psychology, to words closer to home. However, it's hard to say that the duplicative process has a distinct grammatical significance in English. We must look to other languages for examples. Cases like Hottentot go-go “to look at carefully” (from go “to see”), Somali fen-fen “to gnaw at on all sides” (from fen “to gnaw at”), Chinook iwi iwi “to look around carefully, to examine” (from iwi “to appear”), or Tsimshian am’am “several (are) good” (from am “good”) stay within the natural and essential range of meaning of the process. A more abstract function appears in Ewe,[47] where both infinitives and verbal adjectives are formed from verbs by duplication; for example, yi “to go,” yiyi “to go, act of going”; wo “to do,” wowo[48] “done”; mawomawo “not to do” (with both the duplicated verb stem and duplicated negative particle). Causative duplications are common in Hottentot, such as gam-gam[49] “to cause to tell” (from gam “to tell”). The process can also be used to create verbs from nouns, as in Hottentot khoe-khoe “to talk Hottentot” (from khoe-b “man, Hottentot”), or in Kwakiutl metmat “to eat clams” (the root element met- “clam”).

The most characteristic examples of reduplication are such as repeat only part of the radical element. It would be possible to demonstrate the existence of a vast number of formal types of such partial duplication, according to whether the process makes use of one or more of the radical consonants, preserves or weakens or alters the radical vowel, or affects the beginning, the middle, or the end of the radical element. The functions are even more exuberantly developed than with simple duplication, though the basic notion, at least in origin, is nearly always one of repetition or continuance. Examples illustrating this fundamental function can be quoted from all parts of the globe. Initially reduplicating are, for instance, Shilh ggen “to be sleeping” (from gen “to sleep”); Ful pepeu-’do “liar” (i.e., “one who always lies”), plural fefeu-’be (from fewa “to lie”); Bontoc Igorot anak “child,” ananak “children”; kamu-ek “I hasten,” kakamu-ek “I hasten more”; Tsimshian gyad “person,” gyigyad “people”; Nass gyibayuk “to fly,” gyigyibayuk “one who is flying.” Psychologically comparable, but with the reduplication at the end, are Somali ur “body,” plural urar; Hausa suna “name,” plural sunana-ki; Washo[50] gusu “buffalo,” gususu “buffaloes”; Takelma[51] himi-d- “to talk to,” himim-d- “to be accustomed to talk to.” Even more commonly than simple duplication, this partial duplication of the radical element has taken on in many languages functions that seem in no way related to the idea of increase. The best known examples are probably the initial reduplication of our older Indo-European languages, which helps to form the perfect tense of many verbs (e.g., Sanskrit dadarsha “I have seen,” Greek leloipa “I have left,” Latin tetigi “I have touched,” Gothic lelot “I have let”). In Nootka reduplication of the radical element is often employed in association with certain suffixes; e.g., hluch- “woman” forms hluhluch-’ituhl “to dream of a woman,” hluhluch-k’ok “resembling a woman.” Psychologically similar to the Greek and Latin examples are many Takelma cases of verbs that exhibit two forms of the stem, one employed in the present or past, the other in the future and in certain modes and verbal derivatives. The former has final reduplication, which is absent in the latter; e.g., al-yebeb-i’n “I show (or showed) to him,” al-yeb-in “I shall show him.”

The most typical examples of reduplication are those that repeat only part of the base element. It's possible to showcase a large variety of formal types of this kind of partial duplication, depending on whether the process uses one or more of the base consonants, maintains, weakens, or alters the base vowel, or affects the start, middle, or end of the base element. The functions are even more richly developed than with simple duplication, though the basic idea, at least in its origin, is almost always one of repetition or continuation. Examples that illustrate this fundamental function can be found all over the world. For instance, in Shilh, ggen means “to be sleeping” (from gen “to sleep”); in Ful, pepeu-’do means “liar” (meaning “one who always lies”), with the plural being fefeu-’be (from fewa “to lie”); in Bontoc Igorot, anak means “child,” while ananak means “children”; kamu-ek means “I hasten,” and kakamu-ek translates to “I hasten more”; in Tsimshian, gyad means “person,” and gyigyad means “people”; in Nass, gyibayuk means “to fly,” and gyigyibayuk means “one who is flying.” Psychologically similar, but with the reduplication at the end, are Somali ur meaning “body,” and its plural urar; Hausa suna means “name,” with the plural sunana-ki; Washo gusu means “buffalo,” while gususu means “buffaloes”; Takelma himi-d- means “to talk to,” and himim-d- means “to be accustomed to talk to.” Even more commonly than simple duplication, this partial duplication of the base element has in many languages taken on functions that seem unrelated to the idea of increase. The most well-known examples are likely the initial reduplication in our older Indo-European languages, which helps to form the perfect tense of many verbs (e.g., Sanskrit dadarsha means “I have seen,” Greek leloipa means “I have left,” Latin tetigi means “I have touched,” Gothic lelot means “I have let”). In Nootka, reduplication of the base element is often used with certain suffixes; e.g., hluch- meaning “woman” forms hluhluch-’ituhl meaning “to dream of a woman,” and hluhluch-k’ok meaning “resembling a woman.” Psychologically similar to the Greek and Latin examples are many Takelma cases of verbs that show two forms of the stem, one used in the present or past, and the other in the future and certain modes and verbal derivatives. The former has final reduplication, which does not appear in the latter; e.g., al-yebeb-i’n means “I show (or showed) to him,” while al-yeb-in means “I shall show him.”

We come now to the subtlest of all grammatical processes, variations in accent, whether of stress or pitch. The chief difficulty in isolating accent as a functional process is that it is so often combined with alternations in vocalic quantity or quality or complicated by the presence of affixed elements that its grammatical value appears as a secondary rather than as a primary feature. In Greek, for instance, it is characteristic of true verbal forms that they throw the accent back as far as the general accentual rules will permit, while nouns may be more freely accented. There is thus a striking accentual difference between a verbal form like eluthemen “we were released,” accented on the second syllable of the word, and its participial derivative lutheis “released,” accented on the last. The presence of the characteristic verbal elements e- and -men in the first case and of the nominal -s in the second tends to obscure the inherent value of the accentual alternation. This value comes out very neatly in such English doublets as to refund and a refund, to extract and an extract, to come down and a come down, to lack luster and lack-luster eyes, in which the difference between the verb and the noun is entirely a matter of changing stress. In the Athabaskan languages there are not infrequently significant alternations of accent, as in Navaho ta-di-gis “you wash yourself” (accented on the second syllable), ta-di-gis “he washes himself” (accented on the first).[52]

We now come to one of the most subtle grammatical processes: variations in accent, whether it's stress or pitch. The main challenge in isolating accent as a functional process is that it often gets mixed up with changes in vowel length or quality, or is complicated by added elements, making its grammatical value seem secondary instead of primary. In Greek, for example, true verb forms tend to push the accent back as far as the general accent rules allow, while nouns can have a more flexible accent. This creates a noticeable accent difference between a verb like eluthemen “we were released,” which has the accent on the second syllable, and its participial form lutheis “released,” which is accented on the last syllable. The presence of the specific verbal elements e- and -men in the first case, and the nominal -s in the second, tends to hide the essential value of the accentual change. This distinction becomes very clear in English pairs like to refund and a refund, to extract and an extract, to come down and a come down, to lack luster and lack-luster eyes, where the difference between the verb and the noun relies solely on changing stress. In the Athabaskan languages, there are often significant changes in accent as seen in Navaho ta-di-gis “you wash yourself” (accented on the second syllable) and ta-di-gis “he washes himself” (accented on the first).[52]

Pitch accent may be as functional as stress and is perhaps more often so. The mere fact, however, that pitch variations are phonetically essential to the language, as in Chinese (e.g., feng “wind” with a level tone, feng “to serve” with a falling tone) or as in classical Greek (e.g., lab-on “having taken” with a simple or high tone on the suffixed participial -on, gunaik-on “of women” with a compound or falling tone on the case suffix -on) does not necessarily constitute a functional, or perhaps we had better say grammatical, use of pitch. In such cases the pitch is merely inherent in the radical element or affix, as any vowel or consonant might be. It is different with such Chinese alternations as chung (level) “middle” and chung (falling) “to hit the middle”; mai (rising) “to buy” and mai (falling) “to sell”; pei (falling) “back” and pei (level) “to carry on the back.” Examples of this type are not exactly common in Chinese and the language cannot be said to possess at present a definite feeling for tonal differences as symbolic of the distinction between noun and verb.

Pitch accent can be just as functional as stress, and it might be even more so. However, the fact that pitch variations are phonemically important in languages like Chinese (e.g., feng “wind” with a level tone, feng “to serve” with a falling tone) or in classical Greek (e.g., lab-on “having taken” with a simple or high tone on the suffixed participial -on, gunaik-on “of women” with a compound or falling tone on the case suffix -on) doesn’t necessarily mean it serves a functional, or perhaps we should say grammatical, role for pitch. In those instances, the pitch is just a natural part of the root element or affix, similar to any vowel or consonant. It's different with Chinese variations like chung (level) “middle” and chung (falling) “to hit the middle”; mai (rising) “to buy” and mai (falling) “to sell”; pei (falling) “back” and pei (level) “to carry on the back.” Such examples aren't exactly common in Chinese, and the language can't be said to really have a clear sense of tonal differences as a symbol of the distinction between noun and verb.

There are languages, however, in which such differences are of the most fundamental grammatical importance. They are particularly common in the Soudan. In Ewe, for instance, there are formed from subo “to serve” two reduplicated forms, an infinitive subosubo “to serve,” with a low tone on the first two syllables and a high one on the last two, and an adjectival subosubo “serving,” in which all the syllables have a high tone. Even more striking are cases furnished by Shilluk, one of the languages of the headwaters of the Nile. The plural of the noun often differs in tone from the singular, e.g., yit (high) “ear” but yit (low) “ears.” In the pronoun three forms may be distinguished by tone alone; e “he” has a high tone and is subjective, -e “him” (e.g., a chwol-e “he called him”) has a low tone and is objective, -e “his” (e.g., wod-e “his house”) has a middle tone and is possessive. From the verbal element gwed- “to write” are formed gwed-o “(he) writes” with a low tone, the passive gwet “(it was) written” with a falling tone, the imperative gwet “write!” with a rising tone, and the verbal noun gwet “writing” with a middle tone. In aboriginal America also pitch accent is known to occur as a grammatical process. A good example of such a pitch language is Tlingit, spoken by the Indians of the southern coast of Alaska. In this language many verbs vary the tone of the radical element according to tense; hun “to sell,” sin “to hide,” tin “to see,” and numerous other radical elements, if low-toned, refer to past time, if high-toned, to the future. Another type of function is illustrated by the Takelma forms hel “song,” with falling pitch, but hel “sing!” with a rising inflection; parallel to these forms are sel (falling) “black paint,” sel (rising) “paint it!” All in all it is clear that pitch accent, like stress and vocalic or consonantal modifications, is far less infrequently employed as a grammatical process than our own habits of speech would prepare us to believe probable.

There are languages, however, where such differences are extremely important for grammar. They are especially common in the Soudan. For example, in Ewe, the word subo “to serve” produces two reduplicated forms: an infinitive subosubo “to serve,” which has a low tone on the first two syllables and a high tone on the last two, and an adjectival subosubo “serving,” where all the syllables have a high tone. Even more notable are examples from Shilluk, one of the languages spoken in the headwaters of the Nile. The plural form of a noun often has a different tone from the singular, e.g., yit (high) “ear” and yit (low) “ears.” The pronoun has three forms that can be distinguished by tone alone; e “he” is high-toned and subjective, -e “him” (e.g., a chwol-e “he called him”) has a low tone and is objective, while -e “his” (e.g., wod-e “his house”) has a middle tone and is possessive. From the verb gwed- “to write,” we get gwed-o “(he) writes” with a low tone, the passive gwet “(it was) written” with a falling tone, the imperative gwet “write!” with a rising tone, and the verbal noun gwet “writing” with a middle tone. In native America, pitch accent also appears as a grammatical feature. A good example of such a pitch language is Tlingit, spoken by the Indigenous people along the southern coast of Alaska. In this language, many verbs change the tone of their root depending on the tense; hun “to sell,” sin “to hide,” tin “to see,” and several other root elements, when low-toned, indicate past time, while high-toned ones refer to the future. Another function type is illustrated by the Takelma forms hel “song” with a falling pitch, but hel “sing!” with a rising inflection; similar to these forms are sel (falling) “black paint” and sel (rising) “paint it!” Overall, it's clear that pitch accent, like stress and modifications in vowels or consonants, is used as a grammatical process much more often than we might expect based on our own speech patterns.

V

Form in Language: Grammatical Concepts

We have seen that the single word expresses either a simple concept or a combination of concepts so interrelated as to form a psychological unity. We have, furthermore, briefly reviewed from a strictly formal standpoint the main processes that are used by all known languages to affect the fundamental concepts—those embodied in unanalyzable words or in the radical elements of words—by the modifying or formative influence of subsidiary concepts. In this chapter we shall look a little more closely into the nature of the world of concepts, in so far as that world is reflected and systematized in linguistic structure.

We’ve seen that a single word can express either a simple idea or a combination of ideas that are so closely linked they create a psychological unity. Additionally, we’ve briefly examined from a purely formal perspective the main processes that all known languages use to convey fundamental concepts—those found in unanalyzable words or in the core elements of words—through the modifying or shaping influence of secondary concepts. In this chapter, we’ll explore more closely the nature of the world of concepts as it is reflected and organized in language structure.

Let us begin with a simple sentence that involves various kinds of concepts—the farmer kills the duckling. A rough and ready analysis discloses here the presence of three distinct and fundamental concepts that are brought into connection with each other in a number of ways. These three concepts are “farmer” (the subject of discourse), “kill” (defining the nature of the activity which the sentence informs us about), and “duckling” (another subject[53] of discourse that takes an important though somewhat passive part in this activity). We can visualize the farmer and the duckling and we have also no difficulty in constructing an image of the killing. In other words, the elements farmer, kill, and duckling define concepts of a concrete order.

Let’s start with a straightforward sentence involving different concepts—the farmer kills the duckling. A basic analysis reveals three distinct and essential concepts that connect with each other in various ways. These three concepts are “farmer” (the subject of discussion), “kill” (which describes the activity the sentence talks about), and “duckling” (another subject[53] of discussion that plays an important but somewhat passive role in this activity). We can easily picture both the farmer and the duckling, and we also have no trouble imagining the act of killing. In other words, the elements farmer, kill, and duckling define concepts that are concrete.

But a more careful linguistic analysis soon brings us to see that the two subjects of discourse, however simply we may visualize them, are not expressed quite as directly, as immediately, as we feel them. A “farmer” is in one sense a perfectly unified concept, in another he is “one who farms.” The concept conveyed by the radical element (farm-) is not one of personality at all but of an industrial activity (to farm), itself based on the concept of a particular type of object (a farm). Similarly, the concept of duckling is at one remove from that which is expressed by the radical element of the word, duck. This element, which may occur as an independent word, refers to a whole class of animals, big and little, while duckling is limited in its application to the young of that class. The word farmer has an “agentive” suffix -er that performs the function of indicating the one that carries out a given activity, in this case that of farming. It transforms the verb to farm into an agentive noun precisely as it transforms the verbs to sing, to paint, to teach into the corresponding agentive nouns singer, painter, teacher. The element -ling is not so freely used, but its significance is obvious. It adds to the basic concept the notion of smallness (as also in gosling, fledgeling) or the somewhat related notion of “contemptible” (as in weakling, princeling, hireling). The agentive -er and the diminutive -ling both convey fairly concrete ideas (roughly those of “doer” and “little”), but the concreteness is not stressed. They do not so much define distinct concepts as mediate between concepts. The -er of farmer does not quite say “one who (farms)” it merely indicates that the sort of person we call a “farmer” is closely enough associated with activity on a farm to be conventionally thought of as always so occupied. He may, as a matter of fact, go to town and engage in any pursuit but farming, yet his linguistic label remains “farmer.” Language here betrays a certain helplessness or, if one prefers, a stubborn tendency to look away from the immediately suggested function, trusting to the imagination and to usage to fill in the transitions of thought and the details of application that distinguish one concrete concept (to farm) from another “derived” one (farmer). It would be impossible for any language to express every concrete idea by an independent word or radical element. The concreteness of experience is infinite, the resources of the richest language are strictly limited. It must perforce throw countless concepts under the rubric of certain basic ones, using other concrete or semi-concrete ideas as functional mediators. The ideas expressed by these mediating elements—they may be independent words, affixes, or modifications of the radical element—may be called “derivational” or “qualifying.” Some concrete concepts, such as kill, are expressed radically; others, such as farmer and duckling, are expressed derivatively. Corresponding to these two modes of expression we have two types of concepts and of linguistic elements, radical (farm, kill, duck) and derivational (-er, -ling). When a word (or unified group of words) contains a derivational element (or word) the concrete significance of the radical element (farm-, duck-) tends to fade from consciousness and to yield to a new concreteness (farmer, duckling) that is synthetic in expression rather than in thought. In our sentence the concepts of farm and duck are not really involved at all; they are merely latent, for formal reasons, in the linguistic expression.

But a closer look at the language soon shows us that the two topics we're discussing, no matter how simply we picture them, aren’t expressed as directly or as immediately as we feel them. A “farmer” is, in one way, a completely unified idea, but in another, it means “one who farms.” The core idea behind the root word (farm-) isn't about personality at all but rather an industrial activity (to farm), which itself is based on the idea of a specific object (a farm). Similarly, the concept of duckling is a step removed from what the root of the word duck expresses. This root, which can stand alone as a word, refers to a whole category of animals, big and small, while duckling is specifically about the young of that category. The word farmer has an “agentive” suffix -er that serves to indicate the person who performs a specific action, in this case, farming. It turns the verb to farm into an agentive noun just like it turns the verbs to sing, to paint, to teach into the agentive nouns singer, painter, teacher. The element -ling isn’t used as freely, but its meaning is clear. It adds to the original idea a sense of smallness (as in gosling, fledgling) or a somewhat related idea of being “contemptible” (as in weakling, princeling, hireling). The agentive -er and the diminutive -ling both convey fairly tangible ideas (essentially “doer” and “little”), but the concrete nature isn’t emphasized. They don't so much define separate concepts as connect them. The -er in farmer doesn’t explicitly mean “one who (farms),” it just indicates that the type of person we refer to as a “farmer” is closely associated with the activity on a farm to be conventionally seen as always doing that. In reality, he may go to town and do anything but farming, yet his label stays “farmer.” Language here shows a certain limitation or, if you prefer, a stubborn tendency to overlook the immediately suggested function, relying on imagination and usage to clarify the transitions of thought and the details that differentiate one concrete idea (to farm) from another “derived” one (farmer). It would be impossible for any language to express every specific idea with an independent word or root. The concreteness of experience is infinite, while even the richest language has limited resources. It must inevitably group countless ideas under certain basic ones, using other concrete or semi-concrete ideas as functional connectors. The ideas expressed by these connecting elements—they can be independent words, affixes, or changes to the root—can be labeled as “derivational” or “qualifying.” Some concrete ideas, like kill, are expressed directly; others, like farmer and duckling, are expressed derivatively. Corresponding to these two ways of expression, we have two types of concepts and linguistic elements: radical (farm, kill, duck) and derivational (-er, -ling). When a word (or a unified group of words) includes a derivational element (or word), the concrete meaning of the root element (farm-, duck-) tends to fade from awareness and gives way to a new concreteness (farmer, duckling) that is more synthetic in expression than in thought. In our sentence, the ideas of farm and duck aren’t actually involved at all; they are simply latent, for formal reasons, in the linguistic expression.

Returning to this sentence, we feel that the analysis of farmer and duckling are practically irrelevant to an understanding of its content and entirely irrelevant to a feeling for the structure of the sentence as a whole. From the standpoint of the sentence the derivational elements -er and -ling are merely details in the local economy of two of its terms (farmer, duckling) that it accepts as units of expression. This indifference of the sentence as such to some part of the analysis of its words is shown by the fact that if we substitute such radical words as man and chick for farmer and duckling, we obtain a new material content, it is true, but not in the least a new structural mold. We can go further and substitute another activity for that of “killing,” say “taking.” The new sentence, the man takes the chick, is totally different from the first sentence in what it conveys, not in how it conveys it. We feel instinctively, without the slightest attempt at conscious analysis, that the two sentences fit precisely the same pattern, that they are really the same fundamental sentence, differing only in their material trappings. In other words, they express identical relational concepts in an identical manner. The manner is here threefold—the use of an inherently relational word (the) in analogous positions, the analogous sequence (subject; predicate, consisting of verb and object) of the concrete terms of the sentence, and the use of the suffixed element -s in the verb.

Returning to this sentence, we believe that analyzing farmer and duckling is nearly irrelevant for understanding its content and completely irrelevant for grasping the structure of the sentence as a whole. From the perspective of the sentence, the derivational elements -er and -ling are just details in the local economy of two of its terms (farmer, duckling) that it accepts as units of expression. This indifference of the sentence itself to some parts of the analysis of its words is evident because if we swap out the words farmer and duckling for more general words like man and chick, we get a new material content, but not a new structural framework. We can even replace the action of “killing” with “taking.” The new sentence, the man takes the chick, is completely different from the original in terms of meaning, but not in terms of structure. Instinctively, without even trying to consciously analyze it, we feel that the two sentences fit exactly the same pattern, that they are fundamentally the same sentence, differing only in their material disguises. In other words, they express the same relational concepts in the same way. The manner is threefold: the use of a relational word (the) in similar positions, the similar order (subject; predicate, with a verb and object) of the concrete terms in the sentence, and the use of the suffixed element -s in the verb.

Change any of these features of the sentence and it becomes modified, slightly or seriously, in some purely relational, non-material regard. If the is omitted (farmer kills duckling, man takes chick), the sentence becomes impossible; it falls into no recognized formal pattern and the two subjects of discourse seem to hang incompletely in the void. We feel that there is no relation established between either of them and what is already in the minds of the speaker and his auditor. As soon as a the is put before the two nouns, we feel relieved. We know that the farmer and duckling which the sentence tells us about are the same farmer and duckling that we had been talking about or hearing about or thinking about some time before. If I meet a man who is not looking at and knows nothing about the farmer in question, I am likely to be stared at for my pains if I announce to him that “the farmer [what farmer?] the duckling [didn’t know he had any, whoever he is].” If the fact nevertheless seems interesting enough to communicate, I should be compelled to speak of “a farmer up my way” and of “a duckling of his.” These little words, the and a, have the important function of establishing a definite or an indefinite reference.

Change any of these features of the sentence and it becomes modified, either slightly or significantly, in some purely relational, non-material way. If the is left out (farmer kills duckling, man takes chick), the sentence becomes impossible; it doesn’t fit any recognized formal pattern and the two subjects of discussion seem to dangle incomplete in the void. We feel that there is no relation established between either of them and what is already in the minds of the speaker and the listener. As soon as a the is added before the two nouns, we feel relieved. We know that the farmer and duckling mentioned in the sentence are the same farmer and duckling that we had been talking about, hearing about, or thinking about earlier. If I meet someone who isn’t aware of or paying attention to the farmer in question, I’m likely to get confused looks if I say, “the farmer [what farmer?] the duckling [didn’t know he had any, whoever he is].” If the fact still seems interesting enough to share, I would have to refer to “a farmer in my area” and “a duckling of his.” These little words, the and a, play a crucial role in establishing definite or indefinite references.

If I omit the first the and also leave out the suffixed -s, I obtain an entirely new set of relations. Farmer, kill the duckling implies that I am now speaking to the farmer, not merely about him; further, that he is not actually killing the bird, but is being ordered by me to do so. The subjective relation of the first sentence has become a vocative one, one of address, and the activity is conceived in terms of command, not of statement. We conclude, therefore, that if the farmer is to be merely talked about, the little the must go back into its place and the -s must not be removed. The latter element clearly defines, or rather helps to define, statement as contrasted with command. I find, moreover, that if I wish to speak of several farmers, I cannot say the farmers kills the duckling, but must say the farmers kill the duckling. Evidently -s involves the notion of singularity in the subject. If the noun is singular, the verb must have a form to correspond; if the noun is plural, the verb has another, corresponding form.[54] Comparison with such forms as I kill and you kill shows, moreover, that the -s has exclusive reference to a person other than the speaker or the one spoken to. We conclude, therefore, that it connotes a personal relation as well as the notion of singularity. And comparison with a sentence like the farmer killed the duckling indicates that there is implied in this overburdened -s a distinct reference to present time. Statement as such and personal reference may well be looked upon as inherently relational concepts. Number is evidently felt by those who speak English as involving a necessary relation, otherwise there would be no reason to express the concept twice, in the noun and in the verb. Time also is clearly felt as a relational concept; if it were not, we should be allowed to say the farmer killed-s to correspond to the farmer kill-s. Of the four concepts inextricably interwoven in the -s suffix, all are felt as relational, two necessarily so. The distinction between a truly relational concept and one that is so felt and treated, though it need not be in the nature of things, will receive further attention in a moment.

If I drop the first the and also skip the suffixed -s, I end up with a completely new set of relationships. Farmer, kill the duckling means I’m now addressing the farmer directly, not just talking about him; plus, he isn’t actually killing the bird, but I’m telling him to do it. The subjective viewpoint of the first sentence has shifted to a vocative one, which is about addressing someone, and the action is understood as a command, not a statement. So, we can conclude that if I just want to talk about the farmer, the little the needs to go back in its place and the -s shouldn’t be removed. The -s clearly helps define a statement in contrast to a command. Also, if I want to refer to multiple farmers, I can’t say the farmers kills the duckling, but I must say the farmers kill the duckling. Clearly, -s carries the idea of singularity for the subject. If the noun is singular, the verb must match it; if the noun is plural, then the verb has a different corresponding form.[54] Comparing it with forms like I kill and you kill shows that the -s points exclusively to a person who is neither the speaker nor the one being spoken to. Thus, we can conclude it implies a personal relationship along with the idea of singularity. Furthermore, when comparing it to a sentence like the farmer killed the duckling, it suggests that this overloaded -s distinctly refers to the present time. Statements and personal references can be viewed as inherently relational concepts. Those who speak English clearly feel that number involves a necessary relationship; otherwise, there would be no need to express the concept in both the noun and the verb. Time is also seen as a relational concept; if it weren’t, we could say the farmer killed-s to align with the farmer kill-s. All four concepts intertwined in the -s suffix are considered relational, with two being necessarily so. The difference between a truly relational concept and one perceived that way, even if it's not inherently so, will be addressed further in a moment.

Finally, I can radically disturb the relational cut of the sentence by changing the order of its elements. If the positions of farmer and kills are interchanged, the sentence reads kills the farmer the duckling, which is most naturally interpreted as an unusual but not unintelligible mode of asking the question, does the farmer kill the duckling? In this new sentence the act is not conceived as necessarily taking place at all. It may or it may not be happening, the implication being that the speaker wishes to know the truth of the matter and that the person spoken to is expected to give him the information. The interrogative sentence possesses an entirely different “modality” from the declarative one and implies a markedly different attitude of the speaker towards his companion. An even more striking change in personal relations is effected if we interchange the farmer and the duckling. The duckling kills the farmer involves precisely the same subjects of discourse and the same type of activity as our first sentence, but the rôles of these subjects of discourse are now reversed. The duckling has turned, like the proverbial worm, or, to put it in grammatical terminology, what was “subject” is now “object,” what was object is now subject.

Finally, I can radically change the relationship of the sentence by altering the order of its elements. If I swap the positions of farmer and kills, the sentence becomes kills the farmer the duckling, which can be understood as an unusual but still clear way of asking, does the farmer kill the duckling? In this new sentence, the action isn't seen as definitely happening at all. It could be happening or it might not be, suggesting that the speaker wants to know the truth of the situation and expects the person they’re talking to for that information. The question has a completely different “modality” compared to the statement and indicates a significantly different attitude of the speaker towards their companion. An even more striking change in personal relations occurs if we switch the farmer and the duckling. The duckling kills the farmer involves the same subjects and the same type of action as our original sentence, but now the roles of these subjects have flipped. The duckling has changed, like the proverbial worm, or, to express it in grammatical terms, what was the “subject” is now the “object,” and what was the object is now the subject.

The following tabular statement analyzes the sentence from the point of view of the concepts expressed in it and of the grammatical processes employed for their expression.

The following table breaks down the sentence by looking at the concepts it conveys and the grammatical processes used to express them.

Concrete Concepts:
1. First subject of discourse: farmer
2. Second subject of discourse: duckling
3. Activity: kill
—— analyzable into:
1. Radical Concepts:
  1. Verb: (to) farm
  2. Noun: duck
  3. Verb: kill
2. Derivational Concepts:
  1. Agentive: expressed by suffix -er
  2. Diminutive: expressed by suffix -ling
Relational Concepts:
Reference:
1. Definiteness of reference to first subject of discourse: expressed by first the, which has preposed position
2. Definiteness of reference to second subject of discourse: expressed by second the, which has preposed position Mode of being:
3. Declarative: expressed by sequence of “subject” plus verb; and implied by suffixed -s Relationships:
4. Subjectivity of farmer: expressed by position of farmer before kills; and by suffixed -s
5. Objectivity of duckling: expressed by position of duckling after kills Number:
6. Singularity of first subject of discourse: expressed by lack of plural suffix in farmer; and by suffix -s in following verb
7. Singularity of second subject of discourse: expressed by lack of plural suffix in duckling Time:
8. Present: expressed by lack of preterit suffix in verb; and by suffixed -s

In this short sentence of five words there are expressed, therefore, thirteen distinct concepts, of which three are radical and concrete, two derivational, and eight relational. Perhaps the most striking result of the analysis is a renewed realization of the curious lack of accord in our language between function and form. The method of suffixing is used both for derivational and for relational elements; independent words or radical elements express both concrete ideas (objects, activities, qualities) and relational ideas (articles like the and a; words defining case relations, like of, to, for, with, by; words defining local relations, like in, on, at); the same relational concept may be expressed more than once (thus, the singularity of farmer is both negatively expressed in the noun and positively in the verb); and one element may convey a group of interwoven concepts rather than one definite concept alone (thus the -s of kills embodies no less than four logically independent relations).

In this short five-word sentence, there are thirteen distinct concepts expressed. Three of these are fundamental and concrete, two are derivational, and eight are relational. One of the most striking outcomes of the analysis is the renewed awareness of the strange inconsistency in our language between function and form. The method of using suffixes applies to both derivational and relational elements; independent words or fundamental elements convey both concrete ideas (like objects, activities, qualities) and relational ideas (articles such as the and a; words that indicate case relations, like of, to, for, with, by; words that define local relations, like in, on, at); the same relational concept can be expressed more than once (for example, the singularity of farmer is both negatively indicated in the noun and positively in the verb); and one element may express a group of interconnected concepts rather than just one specific concept (thus the -s in kills represents no less than four logically independent relations).

Our analysis may seem a bit labored, but only because we are so accustomed to our own well-worn grooves of expression that they have come to be felt as inevitable. Yet destructive analysis of the familiar is the only method of approach to an understanding of fundamentally different modes of expression. When one has learned to feel what is fortuitous or illogical or unbalanced in the structure of his own language, he is already well on the way towards a sympathetic grasp of the expression of the various classes of concepts in alien types of speech. Not everything that is “outlandish” is intrinsically illogical or far-fetched. It is often precisely the familiar that a wider perspective reveals as the curiously exceptional. From a purely logical standpoint it is obvious that there is no inherent reason why the concepts expressed in our sentence should have been singled out, treated, and grouped as they have been and not otherwise. The sentence is the outgrowth of historical and of unreasoning psychological forces rather than of a logical synthesis of elements that have been clearly grasped in their individuality. This is the case, to a greater or less degree, in all languages, though in the forms of many we find a more coherent, a more consistent, reflection than in our English forms of that unconscious analysis into individual concepts which is never entirely absent from speech, however it may be complicated with or overlaid by the more irrational factors.

Our analysis might seem a bit forced, but that's only because we're so used to our own familiar ways of expressing things that they feel unavoidable. However, taking a critical look at the familiar is the only way to understand completely different modes of expression. Once someone learns to recognize what's random, illogical, or unbalanced in their own language structure, they're already on their way to understanding how different concepts are expressed in other languages. Not everything that seems “strange” is inherently illogical or outlandish. Often, it's the familiar that a broader perspective shows to be oddly exceptional. Logically speaking, there’s no real reason why the concepts in our sentences should have been chosen, treated, and grouped the way they are instead of differently. Sentences come from historical and irrational psychological influences rather than a logical arrangement of elements that are clearly understood individually. This applies to all languages to varying degrees, though in many, we see a more coherent and consistent reflection of that unconscious breakdown into individual concepts, which is never entirely absent from speech, no matter how complicated or covered up by more irrational factors it may become.

A cursory examination of other languages, near and far, would soon show that some or all of the thirteen concepts that our sentence happens to embody may not only be expressed in different form but that they may be differently grouped among themselves; that some among them may be dispensed with; and that other concepts, not considered worth expressing in English idiom, may be treated as absolutely indispensable to the intelligible rendering of the proposition. First as to a different method of handling such concepts as we have found expressed in the English sentence. If we turn to German, we find that in the equivalent sentence (Der Bauer tötet das Entelein) the definiteness of reference expressed by the English the is unavoidably coupled with three other concepts—number (both der and das are explicitly singular), case (der is subjective; das is subjective or objective, by elimination therefore objective), and gender, a new concept of the relational order that is not in this case explicitly involved in English (der is masculine, das is neuter). Indeed, the chief burden of the expression of case, gender, and number is in the German sentence borne by the particles of reference rather than by the words that express the concrete concepts (Bauer, Entelein) to which these relational concepts ought logically to attach themselves. In the sphere of concrete concepts too it is worth noting that the German splits up the idea of “killing” into the basic concept of “dead” (tot) and the derivational one of “causing to do (or be) so and so” (by the method of vocalic change, töt-); the German töt-et (analytically tot-+vowel change+-et) “causes to be dead” is, approximately, the formal equivalent of our dead-en-s, though the idiomatic application of this latter word is different.[55]

A quick look at other languages, near and far, would quickly reveal that some or all of the thirteen concepts our sentence contains might not only be expressed differently but could also be grouped in various ways; some of them might be dropped altogether; and other concepts, which English doesn't typically express, might be seen as absolutely essential for clearly conveying the idea. First, let's discuss a different way of dealing with the concepts we find in the English sentence. If we look at German, we see that in the equivalent sentence (Der Bauer tötet das Entelein), the definiteness indicated by the English the is necessarily combined with three other concepts—number (both der and das are clearly singular), case (der is subjective; das is subjective or objective, thus objectively interpreted), and gender, a new relational concept that isn’t explicitly present in English (der is masculine, das is neuter). In fact, the main responsibility for expressing case, gender, and number in the German sentence lies with the reference particles rather than with the words representing the concrete concepts (Bauer, Entelein) that these relational concepts should logically relate to. It’s also interesting to note that German breaks down the idea of “killing” into the basic concept of “dead” (tot) and the derived concept of “causing to do (or be) so and so” (through vocalic change, töt-); the German töt-et (analytically tot-+vowel change+-et) meaning “causes to be dead” is roughly equivalent to our dead-en-s, although the idiomatic usage of the latter is different.[55]

Wandering still further afield, we may glance at the Yana method of expression. Literally translated, the equivalent Yana sentence would read something like “kill-s he farmer[56] he to duck-ling,” in which “he” and “to” are rather awkward English renderings of a general third personal pronoun (he, she, it, or they) and an objective particle which indicates that the following noun is connected with the verb otherwise than as subject. The suffixed element in “kill-s” corresponds to the English suffix with the important exceptions that it makes no reference to the number of the subject and that the statement is known to be true, that it is vouched for by the speaker. Number is only indirectly expressed in the sentence in so far as there is no specific verb suffix indicating plurality of the subject nor specific plural elements in the two nouns. Had the statement been made on another’s authority, a totally different “tense-modal” suffix would have had to be used. The pronouns of reference (“he”) imply nothing by themselves as to number, gender, or case. Gender, indeed, is completely absent in Yana as a relational category.

Wandering even further, we can take a look at the Yana way of expressing things. Literally translated, a Yana sentence would sound something like “kill-s he farmer[56] he to duck-ling,” where “he” and “to” are somewhat clumsy English versions of a general third-person pronoun (he, she, it, or they) and an objective particle that shows the following noun is related to the verb in a way other than being the subject. The added part in “kill-s” corresponds to the English suffix with key differences: it doesn’t indicate the subject's number and it confirms that the statement is true, as asserted by the speaker. Number is only indirectly shown in the sentence since there are no specific verb suffixes indicating the subject is plural or specific plural elements in the two nouns. If the statement had been made based on someone else's authority, a completely different “tense-modal” suffix would need to be used. The pronouns used (“he”) don’t imply anything about number, gender, or case on their own. Gender is actually entirely absent as a relational category in Yana.

The Yana sentence has already illustrated the point that certain of our supposedly essential concepts may be ignored; both the Yana and the German sentence illustrate the further point that certain concepts may need expression for which an English-speaking person, or rather the English-speaking habit, finds no need whatever. One could go on and give endless examples of such deviations from English form, but we shall have to content ourselves with a few more indications. In the Chinese sentence “Man kill duck,” which may be looked upon as the practical equivalent of “The man kills the duck,” there is by no means present for the Chinese consciousness that childish, halting, empty feeling which we experience in the literal English translation. The three concrete concepts—two objects and an action—are each directly expressed by a monosyllabic word which is at the same time a radical element; the two relational concepts—“subject” and “object”—are expressed solely by the position of the concrete words before and after the word of action. And that is all. Definiteness or indefiniteness of reference, number, personality as an inherent aspect of the verb, tense, not to speak of gender—all these are given no expression in the Chinese sentence, which, for all that, is a perfectly adequate communication—provided, of course, there is that context, that background of mutual understanding that is essential to the complete intelligibility of all speech. Nor does this qualification impair our argument, for in the English sentence too we leave unexpressed a large number of ideas which are either taken for granted or which have been developed or are about to be developed in the course of the conversation. Nothing has been said, for example, in the English, German, Yana, or Chinese sentence as to the place relations of the farmer, the duck, the speaker, and the listener. Are the farmer and the duck both visible or is one or the other invisible from the point of view of the speaker, and are both placed within the horizon of the speaker, the listener, or of some indefinite point of reference “off yonder”? In other words, to paraphrase awkwardly certain latent “demonstrative” ideas, does this farmer (invisible to us but standing behind a door not far away from me, you being seated yonder well out of reach) kill that duckling (which belongs to you)? or does that farmer (who lives in your neighborhood and whom we see over there) kill that duckling (that belongs to him)? This type of demonstrative elaboration is foreign to our way of thinking, but it would seem very natural, indeed unavoidable, to a Kwakiutl Indian.

The Yana sentence has already shown that some of our supposedly essential concepts might be overlooked; both the Yana and the German sentence further demonstrate that there are concepts that need to be expressed which an English speaker, or more specifically, the English-speaking mindset, sees no need for. We could continue with endless examples of such variations from English form, but we'll stick to just a few more examples. In the Chinese sentence “Man kill duck,” which can be considered the practical equivalent of “The man kills the duck,” there is definitely not present for the Chinese mind that childish, awkward, empty feeling that we get from the literal English translation. The three concrete concepts—two objects and an action—are each directly conveyed by a one-syllable word that also serves as a basic element; the two relational concepts—“subject” and “object”—are indicated simply by the placement of the concrete words before and after the action word. And that's it. Concepts of definiteness or indefiniteness, number, personality as an inherent aspect of the verb, tense, not to mention gender—all of these are not expressed in the Chinese sentence, which nonetheless is a perfectly effective communication—of course, provided there is that context, that background of mutual understanding that's crucial for complete clarity in all communication. This point does not undermine our argument, as in the English sentence we also leave many ideas unexpressed that are either assumed or have been developed or will be developed during the conversation. For instance, nothing has been mentioned in the English, German, Yana, or Chinese sentence about the location of the farmer, the duck, the speaker, and the listener. Are the farmer and the duck both visible or is one of them not visible from the speaker's perspective, and are both within the speaker’s, the listener’s, or some indefinite point of reference “over there”? In other words, to awkwardly rephrase certain implicit “demonstrative” ideas, does this farmer (invisible to us but standing behind a door not far from me, while you are seated over there well out of reach) kill that duckling (which belongs to you)? Or does that farmer (who lives in your area and whom we see over there) kill that duckling (which belongs to him)? This kind of demonstrative elaboration is not our typical way of thinking, but it would seem very natural, even necessary, to a Kwakiutl Indian.

What, then, are the absolutely essential concepts in speech, the concepts that must be expressed if language is to be a satisfactory means of communication? Clearly we must have, first of all, a large stock of basic or radical concepts, the concrete wherewithal of speech. We must have objects, actions, qualities to talk about, and these must have their corresponding symbols in independent words or in radical elements. No proposition, however abstract its intent, is humanly possible without a tying on at one or more points to the concrete world of sense. In every intelligible proposition at least two of these radical ideas must be expressed, though in exceptional cases one or even both may be understood from the context. And, secondly, such relational concepts must be expressed as moor the concrete concepts to each other and construct a definite, fundamental form of proposition. In this fundamental form there must be no doubt as to the nature of the relations that obtain between the concrete concepts. We must know what concrete concept is directly or indirectly related to what other, and how. If we wish to talk of a thing and an action, we must know if they are coördinately related to each other (e.g., “He is fond of wine and gambling”); or if the thing is conceived of as the starting point, the “doer” of the action, or, as it is customary to say, the “subject” of which the action is predicated; or if, on the contrary, it is the end point, the “object” of the action. If I wish to communicate an intelligible idea about a farmer, a duckling, and the act of killing, it is not enough to state the linguistic symbols for these concrete ideas in any order, higgledy-piggledy, trusting that the hearer may construct some kind of a relational pattern out of the general probabilities of the case. The fundamental syntactic relations must be unambiguously expressed. I can afford to be silent on the subject of time and place and number and of a host of other possible types of concepts, but I can find no way of dodging the issue as to who is doing the killing. There is no known language that can or does dodge it, any more than it succeeds in saying something without the use of symbols for the concrete concepts.

What are the essential concepts in speech that need to be conveyed for language to be an effective way to communicate? First, we need a wide range of basic or fundamental concepts—the building blocks of speech. We need objects, actions, and qualities to discuss, which should have corresponding symbols in standalone words or fundamental elements. No statement, regardless of how abstract, can be made without connecting to the tangible world we experience. In every understandable statement, at least two of these fundamental ideas must be included, although in some cases one or both may be implied from the context. Additionally, we need to express relational concepts that connect the concrete ideas to one another and create a clear, basic structure for the statement. In this basic structure, the relationships between the concrete concepts must be clear. We need to understand how one concept is directly or indirectly related to another. For example, if we want to discuss a thing and an action, we must know whether they are equally related (e.g., “He enjoys wine and gambling”), or if one is seen as the starting point, the “doer” of the action (the “subject” of the action), or if, on the other hand, it is the endpoint, the “object” of the action. If I want to express a clear idea about a farmer, a duckling, and the act of killing, simply using the words for these concrete ideas in any random order isn’t enough, hoping that the listener can figure out some kind of relationship from the general context. The core syntactic relationships must be clearly defined. I can skip discussing time, place, number, and other types of concepts, but there’s no way to avoid clarifying who is doing the killing. No known language avoids this, just as none can convey meaning without using symbols for the concrete concepts.

We are thus once more reminded of the distinction between essential or unavoidable relational concepts and the dispensable type. The former are universally expressed, the latter are but sparsely developed in some languages, elaborated with a bewildering exuberance in others. But what prevents us from throwing in these “dispensable” or “secondary” relational concepts with the large, floating group of derivational, qualifying concepts that we have already discussed? Is there, after all is said and done, a fundamental difference between a qualifying concept like the negative in unhealthy and a relational one like the number concept in books? If unhealthy may be roughly paraphrased as not healthy, may not books be just as legitimately paraphrased, barring the violence to English idiom, as several book? There are, indeed, languages in which the plural, if expressed at all, is conceived of in the same sober, restricted, one might almost say casual, spirit in which we feel the negative in unhealthy. For such languages the number concept has no syntactic significance whatever, is not essentially conceived of as defining a relation, but falls into the group of derivational or even of basic concepts. In English, however, as in French, German, Latin, Greek—indeed in all the languages that we have most familiarity with—the idea of number is not merely appended to a given concept of a thing. It may have something of this merely qualifying value, but its force extends far beyond. It infects much else in the sentence, molding other concepts, even such as have no intelligible relation to number, into forms that are said to correspond to or “agree with” the basic concept to which it is attached in the first instance. If “a man falls” but “men fall” in English, it is not because of any inherent change that has taken place in the nature of the action or because the idea of plurality inherent in “men” must, in the very nature of ideas, relate itself also to the action performed by these men. What we are doing in these sentences is what most languages, in greater or less degree and in a hundred varying ways, are in the habit of doing—throwing a bold bridge between the two basically distinct types of concept, the concrete and the abstractly relational, infecting the latter, as it were, with the color and grossness of the former. By a certain violence of metaphor the material concept is forced to do duty for (or intertwine itself with) the strictly relational.

We are once again reminded of the difference between essential or unavoidable relational concepts and those that are optional. The former are expressed universally, while the latter are only slightly developed in some languages and elaborated with bewildering flair in others. But what stops us from grouping these “optional” or “secondary” relational concepts with the larger collection of derivational, qualifying concepts we've already discussed? Is there really a fundamental difference between a qualifying concept like the negative in unhealthy and a relational one like the number concept in books? If unhealthy can be roughly rephrased as not healthy, can books not just as legitimately be paraphrased, without straying from English idiom, as several book? Indeed, there are languages where the plural, if it exists, is understood in the same sober, limited, one might almost say casual manner as we understand the negative in unhealthy. In such languages, the number concept holds no syntactic significance, is not fundamentally seen as defining a relation, but falls into the group of derivational or even basic concepts. In English, however, as in French, German, Latin, Greek—indeed in all the languages we are most familiar with—the idea of number isn’t simply added to a concept of a thing. It may serve some merely qualifying function, but its impact goes far beyond that. It affects much else in the sentence, shaping other concepts, even those that have no clear relation to number, into forms that are said to correspond to or “agree with” the basic concept it initially attaches to. If “a man falls” but “men fall” in English, it isn't due to any inherent change in the nature of the action or because the idea of plurality in “men” must naturally relate to the action performed by these men. What we are doing in these sentences is what most languages, to varying degrees and in a hundred different ways, typically do—creating a bold connection between the two fundamentally distinct types of concept, the concrete and the abstractly relational, as it were coloring the latter with the qualities and substance of the former. Through a certain metaphorical force, the material concept is compelled to represent (or intertwine with) the strictly relational.

The case is even more obvious if we take gender as our text. In the two English phrases, “The white woman that comes” and “The white men that come,” we are not reminded that gender, as well as number, may be elevated into a secondary relational concept. It would seem a little far-fetched to make of masculinity and femininity, crassly material, philosophically accidental concepts that they are, a means of relating quality and person, person and action, nor would it easily occur to us, if we had not studied the classics, that it was anything but absurd to inject into two such highly attenuated relational concepts as are expressed by “the” and “that” the combined notions of number and sex. Yet all this, and more, happens in Latin. Illa alba femina quae venit and illi albi homines qui veniunt, conceptually translated, amount to this: that-one-feminine-doer[57] one-feminine-white-doer feminine-doing-one-woman which-one-feminine-doer other[58]-one-now-come; and: that-several-masculine-doer several-masculine-white-doer masculine-doing-several-man which-several-masculine-doer other-several-now-come. Each word involves no less than four concepts, a radical concept (either properly concrete—white, man, woman, come—or demonstrative—that, which) and three relational concepts, selected from the categories of case, number, gender, person, and tense. Logically, only case[59] (the relation of woman or men to a following verb, of which to its antecedent, of that and white to woman or men, and of which to come) imperatively demands expression, and that only in connection with the concepts directly affected (there is, for instance, no need to be informed that the whiteness is a doing or doer’s whiteness[60]). The other relational concepts are either merely parasitic (gender throughout; number in the demonstrative, the adjective, the relative, and the verb) or irrelevant to the essential syntactic form of the sentence (number in the noun; person; tense). An intelligent and sensitive Chinaman, accustomed as he is to cut to the very bone of linguistic form, might well say of the Latin sentence, “How pedantically imaginative!” It must be difficult for him, when first confronted by the illogical complexities of our European languages, to feel at home in an attitude that so largely confounds the subject-matter of speech with its formal pattern or, to be more accurate, that turns certain fundamentally concrete concepts to such attenuated relational uses.

The case is even clearer if we focus on gender. In the two English phrases, “The white woman that comes” and “The white men that come,” we don’t consider that gender, like number, can become a secondary relational concept. It might seem a bit far-fetched to treat masculinity and femininity—material yet philosophically trivial—as a way to connect quality with person, and person with action. Nor would it easily occur to us, if we hadn’t studied the classics, that it was anything but absurd to inject the combined ideas of number and gender into such abstract relational concepts expressed by “the” and “that.” Yet all of this, and more, happens in Latin. Illa alba femina quae venit and illi albi homines qui veniunt, when conceptually translated, amount to this: that-one-feminine-doer[57] one-feminine-white-doer feminine-doing-one-woman which-one-feminine-doer other[58]-one-now-come; and: that-several-masculine-doer several-masculine-white-doer masculine-doing-several-man which-several-masculine-doer other-several-now-come. Each word involves no less than four concepts: a basic concept (either properly concrete—white, man, woman, come—or demonstrative—that, which) and three relational concepts, chosen from case, number, gender, person, and tense. Logically, only case[59] (the relationship of woman or men to a following verb, of which to its antecedent, of that and white to woman or men, and of which to come) must be expressed, and only in relation to the concepts directly affected (there's no need to establish that whiteness is a characteristic of the doing or doer[60]). The other relational concepts are either just parasitic (gender throughout; number in the demonstrative, the adjective, the relative pronoun, and the verb) or irrelevant to the essential structure of the sentence (number in the noun; person; tense). A perceptive and thoughtful Chinese person, used to getting to the heart of linguistic structure, might very well say of the Latin sentence, “How pedantically creative!” It must be challenging for him, when first facing the illogical complexities of our European languages, to feel comfortable with an approach that confuses the subject-matter of speech with its formal structure or, to be more precise, that uses certain fundamentally concrete concepts for such abstract relational purposes.

I have exaggerated somewhat the concreteness of our subsidiary or rather non-syntactical relational concepts In order that the essential facts might come out in bold relief. It goes without saying that a Frenchman has no clear sex notion in his mind when he speaks of un arbre (“a-masculine tree”) or of une pomme (“a-feminine apple”). Nor have we, despite the grammarians, a very vivid sense of the present as contrasted with all past and all future time when we say He comes.[61] This is evident from our use of the present to indicate both future time (“He comes to-morrow”) and general activity unspecified as to time (“Whenever he comes, I am glad to see him,” where “comes” refers to past occurrences and possible future ones rather than to present activity). In both the French and English instances the primary ideas of sex and time have become diluted by form-analogy and by extensions into the relational sphere, the concepts ostensibly indicated being now so vaguely delimited that it is rather the tyranny of usage than the need of their concrete expression that sways us in the selection of this or that form. If the thinning-out process continues long enough, we may eventually be left with a system of forms on our hands from which all the color of life has vanished and which merely persist by inertia, duplicating each other’s secondary, syntactic functions with endless prodigality. Hence, in part, the complex conjugational systems of so many languages, in which differences of form are attended by no assignable differences of function. There must have been a time, for instance, though it antedates our earliest documentary evidence, when the type of tense formation represented by drove or sank differed in meaning, in however slightly nuanced a degree, from the type (killed, worked) which has now become established in English as the prevailing preterit formation, very much as we recognize a valuable distinction at present between both these types and the “perfect” (has driven, has killed) but may have ceased to do so at some point in the future.[62] Now form lives longer than its own conceptual content. Both are ceaselessly changing, but, on the whole, the form tends to linger on when the spirit has flown or changed its being. Irrational form, form for form’s sake—however we term this tendency to hold on to formal distinctions once they have come to be—is as natural to the life of language as is the retention of modes of conduct that have long outlived the meaning they once had.

I have somewhat exaggerated the clarity of our subsidiary or rather non-syntactical relational concepts so that the key facts might stand out more clearly. It’s obvious that a French speaker doesn’t have a clear concept of gender in mind when they say un arbre (“a masculine tree”) or une pomme (“a feminine apple”). Likewise, we don’t really have a strong sense of the present compared to all past and future time when we say He comes.[61] This is clear from how we use the present tense to indicate future time (“He comes tomorrow”) and general actions not restricted to a specific time (“Whenever he comes, I’m glad to see him,” where “comes” refers to past events and possible future ones rather than what’s happening right now). In both the French and English examples, the basic ideas of gender and time have been diluted by similarities in form and extensions into the relational realm, with the concepts they’re supposed to indicate being so vaguely defined that it’s more about the habits of use than the need for precise expression that guide our choice of one form over another. If this trend continues long enough, we might end up with a system of forms from which all vividness has disappeared, persisting only out of habit and replicating each other’s secondary, syntactic functions with endless abundance. This partly explains the complex conjugation systems of many languages, where differences in form show no clear differences in function. There must have been a time—though it predates our earliest records—when the tense formation types represented by drove or sank had different meanings, even if slightly nuanced, from the type (killed, worked) that has now become the common past tense in English, similar to how we currently see a valuable distinction between these types and the “perfect” (has driven, has killed), but might not perceive it the same way in the future.[62] Now, form outlasts its conceptual content. Both are continuously changing, but generally, the form tends to remain even when the meaning has shifted or disappeared. Irrational form, form for form’s sake—whatever we call this tendency to cling to formal distinctions once they exist—is as natural to the evolution of language as keeping behaviors that have long outlasted their original significance.

There is another powerful tendency which makes for a formal elaboration that does not strictly correspond to clear-cut conceptual differences. This is the tendency to construct schemes of classification into which all the concepts of language must be fitted. Once we have made up our minds that all things are either definitely good or bad or definitely black or white, it is difficult to get into the frame of mind that recognizes that any particular thing may be both good and bad (in other words, indifferent) or both black and white (in other words, gray), still more difficult to realize that the good-bad or black-white categories may not apply at all. Language is in many respects as unreasonable and stubborn about its classifications as is such a mind. It must have its perfectly exclusive pigeon-holes and will tolerate no flying vagrants. Any concept that asks for expression must submit to the classificatory rules of the game, just as there are statistical surveys in which even the most convinced atheist must perforce be labeled Catholic, Protestant, or Jew or get no hearing. In English we have made up our minds that all action must be conceived of in reference to three standard times. If, therefore, we desire to state a proposition that is as true to-morrow as it was yesterday, we have to pretend that the present moment may be elongated fore and aft so as to take in all eternity.[63] In French we know once for all that an object is masculine or feminine, whether it be living or not; just as in many American and East Asiatic languages it must be understood to belong to a certain form-category (say, ring-round, ball-round, long and slender, cylindrical, sheet-like, in mass like sugar) before it can be enumerated (e.g., “two ball-class potatoes,” “three sheet-class carpets”) or even said to “be” or “be handled in a definite way” (thus, in the Athabaskan languages and in Yana, “to carry” or “throw” a pebble is quite another thing than to carry or throw a log, linguistically no less than in terms of muscular experience). Such instances might be multiplied at will. It is almost as though at some period in the past the unconscious mind of the race had made a hasty inventory of experience, committed itself to a premature classification that allowed of no revision, and saddled the inheritors of its language with a science that they no longer quite believed in nor had the strength to overthrow. Dogma, rigidly prescribed by tradition, stiffens into formalism. Linguistic categories make up a system of surviving dogma—dogma of the unconscious. They are often but half real as concepts; their life tends ever to languish away into form for form’s sake.

There is another strong tendency that leads to a formal elaboration that doesn’t really match clear-cut conceptual differences. This is the tendency to create classification schemes that all language concepts must fit into. Once we decide that everything is either definitely good or bad, or definitely black or white, it becomes hard to accept that something can be both good and bad (in other words, indifferent) or both black and white (in other words, gray). It is even harder to recognize that the good-bad or black-white categories may not apply at all. Language is often as unreasonable and stubborn about its classifications as the mindset that created them. It requires perfectly exclusive categories and won’t tolerate anything outside of them. Any concept that wants to be expressed must follow the classification rules of the game, just like in statistical surveys where even the most committed atheist has to be labeled Catholic, Protestant, or Jew to be heard. In English, we’ve decided that all action must be considered in relation to three standard times. So, if we want to state something true tomorrow as it was yesterday, we have to act as if the present moment can stretch backwards and forwards to encompass all eternity.[63] In French, we know that an object is either masculine or feminine, regardless of whether it is living or not; just as in many American and East Asian languages, objects need to be classified into specific forms (like ring-shaped, ball-shaped, long and thin, cylindrical, sheet-like, or mass like sugar) before they can be counted (e.g., “two ball-shaped potatoes,” “three sheet-shaped carpets”) or even described as “being” or “being handled in a specific way” (for instance, in Athabaskan languages and Yana, “to carry” or “throw” a pebble is completely different from carrying or throwing a log, both linguistically and in physical experience). Such examples could go on indefinitely. It’s almost like, at some point in the past, the collective unconscious of humanity did a rushed inventory of experience, committed itself to a premature classification that couldn’t be revised, and left future generations with a framework they no longer fully believed in and lacked the power to change. Dogma, strictly enforced by tradition, hardens into formalism. Linguistic categories form a system of surviving dogma—dogma of the unconscious. They often exist as only half-real concepts; their vitality tends to fade away into mere form for the sake of form.

There is still a third cause for the rise of this non-significant form, or rather of non-significant differences of form. This is the mechanical operation of phonetic processes, which may bring about formal distinctions that have not and never had a corresponding functional distinction. Much of the irregularity and general formal complexity of our declensional and conjugational systems is due to this process. The plural of hat is hats, the plural of self is selves. In the former case we have a true -s symbolizing plurality, in the latter a z-sound coupled with a change in the radical element of the word of f to v. Here we have not a falling together of forms that originally stood for fairly distinct concepts—as we saw was presumably the case with such parallel forms as drove and worked—but a merely mechanical manifolding of the same formal element without a corresponding growth of a new concept. This type of form development, therefore, while of the greatest interest for the general history of language, does not directly concern us now in our effort to understand the nature of grammatical concepts and their tendency to degenerate into purely formal counters.

There is also a third reason for the emergence of this non-significant form, or rather, non-significant differences in form. This is the mechanical operation of phonetic processes, which can create formal distinctions that do not and never did correspond to any functional distinction. A lot of the irregularity and overall complexity in our declension and conjugation systems comes from this process. The plural of hat is hats, while the plural of self is selves. In the first case, we have a true -s indicating plurality, whereas in the second, there's a z-sound combined with a change in the root element of the word from f to v. Here, we don't see a merging of forms that originally represented fairly distinct concepts—as we noted with parallel forms like drove and worked—but rather a purely mechanical duplication of the same formal element without a corresponding development of a new concept. Thus, while this kind of form development is very interesting for the overall history of language, it doesn't directly relate to our current effort to comprehend the nature of grammatical concepts and their tendency to deteriorate into purely formal indicators.

We may now conveniently revise our first classification of concepts as expressed in language and suggest the following scheme:

We can now easily update our initial classification of concepts as expressed in language and propose the following scheme:

Basic (Concrete) Concepts (such as objects, actions, qualities): normally expressed by independent words or radical elements; involve no relation as such[64]
Derivational Concepts (less concrete, as a rule, than I, more so than III): normally expressed by affixing non-radical elements to radical elements or by inner modification of these; differ from type I in defining ideas that are irrelevant to the proposition as a whole but that give a radical element a particular increment of significance and that are thus inherently related in a specific way to concepts of type I[65]
Concrete Relational Concepts (still more abstract, yet not entirely devoid of a measure of concreteness): normally expressed by affixing non-radical elements to radical elements, but generally at a greater remove from these than is the case with elements of type II, or by inner modification of radical elements; differ fundamentally from type II in indicating or implying relations that transcend the particular word to which they are immediately attached, thus leading over to
Pure Relational Concepts (purely abstract): normally expressed by affixing non-radical elements to radical elements (in which case these concepts are frequently intertwined with those of type III) or by their inner modification, by independent words, or by position; serve to relate the concrete elements of the proposition to each other, thus giving it definite syntactic form.

The nature of these four classes of concepts as regards their concreteness or their power to express syntactic relations may be thus symbolized:

The essence of these four classes of concepts concerning their concreteness or their ability to convey syntactic relations can be represented as follows:

Material Content	{	I.	Basic Concepts
Material Content	{	II.	Derivational Concepts
Relation	{	III.	Concrete Relational Concepts
Relation	{	IV.	Pure Relational Concepts

These schemes must not be worshipped as fetiches. In the actual work of analysis difficult problems frequently arise and we may well be in doubt as to how to group a given set of concepts. This is particularly apt to be the case in exotic languages, where we may be quite sure of the analysis of the words in a sentence and yet not succeed in acquiring that inner “feel” of its structure that enables us to tell infallibly what is “material content” and what is “relation.” Concepts of class I are essential to all speech, also concepts of class IV. Concepts II and III are both common, but not essential; particularly group III, which represents, in effect, a psychological and formal confusion of types II and IV or of types I and IV, is an avoidable class of concepts. Logically there is an impassable gulf between I and IV, but the illogical, metaphorical genius of speech has wilfully spanned the gulf and set up a continuous gamut of concepts and forms that leads imperceptibly from the crudest of materialities (“house” or “John Smith”) to the most subtle of relations. It is particularly significant that the unanalyzable independent word belongs in most cases to either group I or group IV, rather less commonly to II or III. It is possible for a concrete concept, represented by a simple word, to lose its material significance entirely and pass over directly into the relational sphere without at the same time losing its independence as a word. This happens, for instance, in Chinese and Cambodgian when the verb “give” is used in an abstract sense as a mere symbol of the “indirect objective” relation (e.g., Cambodgian “We make story this give all that person who have child,” i.e., “We have made this story for all those that have children”).

These schemes shouldn't be treated like idols. In the actual work of analysis, we often encounter tough problems and might feel uncertain about how to organize a specific set of concepts. This is especially true in complex languages, where we might clearly analyze the words in a sentence but still struggle to grasp the underlying "feel" of its structure that helps us reliably distinguish between "material content" and "relation." Concepts of class I are essential to all speech, as are concepts of class IV. Concepts II and III are both common but not crucial; particularly group III, which essentially mixes types II and IV or types I and IV, is an avoidable class of concepts. Logically, there’s a significant divide between I and IV, but the illogical, metaphorical nature of speech has intentionally bridged that divide and created a seamless range of concepts and forms that quietly transitions from the most basic material ideas ("house" or "John Smith") to the most nuanced relations. It’s particularly noteworthy that the unanalyzable independent word mostly belongs to either group I or group IV, and is less frequently found in II or III. It’s possible for a concrete concept, expressed by a simple word, to completely lose its material significance and shift directly into the relational realm without losing its independent status as a word. This occurs, for example, in Chinese and Cambodian when the verb "give" is used in an abstract sense as a mere symbol of the "indirect objective" relation (e.g., in Cambodian, "We make story this give all that person who have child," meaning "We have made this story for all those that have children").

There are, of course, also not a few instances of transitions between groups I and II and I and III, as well as of the less radical one between II and III. To the first of these transitions belongs that whole class of examples in which the independent word, after passing through the preliminary stage of functioning as the secondary or qualifying element in a compound, ends up by being a derivational affix pure and simple, yet without losing the memory of its former independence. Such an element and concept is the full of teaspoonfull, which hovers psychologically between the status of an independent, radical concept (compare full) or of a subsidiary element in a compound (cf. brim-full) and that of a simple suffix (cf. dutiful) in which the primary concreteness is no longer felt. In general, the more highly synthetic our linguistic type, the more difficult and even arbitrary it becomes to distinguish groups I and II.

There are, of course, also quite a few instances of transitions between groups I and II and I and III, as well as the less radical one between II and III. The first of these transitions includes that entire class of examples where the independent word, after going through the initial stage of functioning as the secondary or qualifying element in a compound, ultimately becomes a derivational affix on its own, yet still retains a memory of its previous independence. An example of this concept is the full of teaspoonful, which psychologically hovers between being an independent, radical concept (see full) or a subsidiary element in a compound (see brim-full) and that of a simple suffix (see dutiful) in which the primary meaning is no longer felt. In general, the more synthetic our linguistic type is, the harder and even more arbitrary it becomes to distinguish groups I and II.

Not only is there a gradual loss of the concrete as we pass through from group I to group IV, there is also a constant fading away of the feeling of sensible reality within the main groups of linguistic concepts themselves. In many languages it becomes almost imperative, therefore, to make various sub-classifications, to segregate, for instance, the more concrete from the more abstract concepts of group II. Yet we must always beware of reading into such abstracter groups that purely formal, relational feeling that we can hardly help associating with certain of the abstracter concepts which, with us, fall in group III, unless, indeed, there is clear evidence to warrant such a reading in. An example or two should make clear these all-important distinctions.[66] In Nootka we have an unusually large number of derivational affixes (expressing concepts of group II). Some of these are quite material in content (e.g., “in the house,” “to dream of”), others, like an element denoting plurality and a diminutive affix, are far more abstract in content. The former type are more closely welded with the radical element than the latter, which can only be suffixed to formations that have the value of complete words. If, therefore, I wish to say “the small fires in the house”—and I can do this in one word—I must form the word “fire-in-the-house,” to which elements corresponding to “small,” our plural, and “the” are appended. The element indicating the definiteness of reference that is implied in our “the” comes at the very end of the word. So far, so good. “Fire-in-the-house-the” is an intelligible correlate of our “the house-fire.”[67] But is the Nootka correlate of “the small fires in the house” the true equivalent of an English “the house-firelets”?[68] By no means. First of all, the plural element precedes the diminutive in Nootka: “fire-in-the-house-plural-small-the,” in other words “the house-fires-let,” which at once reveals the important fact that the plural concept is not as abstractly, as relationally, felt as in English. A more adequate rendering would be “the house-fire-several-let,” in which, however, “several” is too gross a word, “-let” too choice an element (“small” again is too gross). In truth we cannot carry over into English the inherent feeling of the Nootka word, which seems to hover somewhere between “the house-firelets” and “the house-fire-several-small.” But what more than anything else cuts off all possibility of comparison between the English -s of “house-firelets” and the “-several-small” of the Nootka word is this, that in Nootka neither the plural nor the diminutive affix corresponds or refers to anything else in the sentence. In English “the house-firelets burn” (not “burns”), in Nootka neither verb, nor adjective, nor anything else in the proposition is in the least concerned with the plurality or the diminutiveness of the fire. Hence, while Nootka recognizes a cleavage between concrete and less concrete concepts within group II, the less concrete do not transcend the group and lead us into that abstracter air into which our plural -s carries us. But at any rate, the reader may object, it is something that the Nootka plural affix is set apart from the concreter group of affixes; and may not the Nootka diminutive have a slenderer, a more elusive content than our -let or -ling or the German -chen or -lein?[69]

Not only is there a gradual loss of the concrete as we move from group I to group IV, but there's also a constant fading of the sense of tangible reality within the main groups of linguistic concepts themselves. In many languages, it almost becomes necessary to create various sub-classifications, separating, for example, the more concrete concepts from the more abstract ones in group II. However, we should always be cautious about imposing a purely formal, relational feeling onto these more abstract groups, which we tend to associate with certain concepts in group III, unless there's clear evidence to support such a reading. A few examples should clarify these crucial distinctions.[66] In Nootka, we have an unusually large number of derivational affixes (expressing concepts from group II). Some of these are quite concrete in meaning (e.g., “in the house,” “to dream of”), while others, like the plural marker and the diminutive affix, are much more abstract. The former type is more closely linked with the root element than the latter, which can only attach to formations that function as complete words. Therefore, if I want to say “the small fires in the house” — which I can express in one word — I must create the term “fire-in-the-house,” to which elements corresponding to “small,” our plural, and “the” are added. The element indicating the definiteness that corresponds with our “the” comes at the very end of the word. So far, so good. “Fire-in-the-house-the” is a clear counterpart to our “the house-fire.”[67] But is the Nootka counterpart of “the small fires in the house” truly equivalent to the English “the house-firelets”?[68] Not at all. First of all, the plural element comes before the diminutive in Nootka: “fire-in-the-house-plural-small-the,” or in other words “the house-fires-let,” which immediately indicates that the concept of plurality is not felt as abstractly or relationally as it is in English. A more accurate translation would be “the house-fire-several-let,” but here “several” feels too broad, “-let” is too refined (“small” is again too broad). In truth, we can't fully convey the inherent feeling of the Nootka word in English, which seems to fall somewhere between “the house-firelets” and “the house-fire-several-small.” But what primarily prevents any comparison between the English -s in “house-firelets” and the “-several-small” in the Nootka word is that in Nootka, neither the plural nor the diminutive affix has any connection to other elements in the sentence. In English, “the house-firelets burn” (not “burns”), while in Nootka, neither the verb, nor the adjective, nor anything else in the statement addresses the plurality or diminutiveness of the fire. Thus, while Nootka recognizes a distinction between concrete and less concrete concepts within group II, the less concrete do not exceed the group and lead us into the more abstract space our plural -s takes us. However, the reader may object that it’s significant that the Nootka plural affix is distinguishable from the more concrete group of affixes; and might the Nootka diminutive have a subtler, more elusive meaning than our -let or -ling or the German -chen or -lein?[69]

Can such a concept as that of plurality ever be classified with the more material concepts of group II? Indeed it can be. In Yana the third person of the verb makes no formal distinction between singular and plural. Nevertheless the plural concept can be, and nearly always is, expressed by the suffixing of an element (-ba-) to the radical element of the verb. “It burns in the east” is rendered by the verb ya-hau-si “burn-east-s.”[70] “They burn in the east” is ya-ba-hau-si. Note that the plural affix immediately follows the radical element (ya-), disconnecting it from the local element (-hau-). It needs no labored argument to prove that the concept of plurality is here hardly less concrete than that of location “in the east,” and that the Yana form corresponds in feeling not so much to our “They burn in the east” (ardunt oriente) as to a “Burn-several-east-s, it plurally burns in the east,” an expression which we cannot adequately assimilate for lack of the necessary form-grooves into which to run it.

Can a concept like plurality ever be classified with the more tangible concepts of group II? Absolutely. In Yana, the third person of the verb doesn't formally differentiate between singular and plural. However, the plural concept can be, and usually is, expressed by adding an element (-ba-) to the root of the verb. “It burns in the east” is expressed as ya-hau-si “burn-east-s.”[70] “They burn in the east” becomes ya-ba-hau-si. Notice that the plural affix immediately follows the root element (ya-), separating it from the local element (-hau-). It doesn't take much effort to show that the concept of plurality here is just as concrete as the idea of location “in the east,” and that the Yana form corresponds more in feeling to “Burn-several-east-s, it plurally burns in the east,” an expression which we can't fully understand due to the absence of the necessary structural forms to accommodate it.

But can we go a step farther and dispose of the category of plurality as an utterly material idea, one that would make of “books” a “plural book,” in which the “plural,” like the “white” of “white book,” falls contentedly into group I? Our “many books” and “several books” are obviously not cases in point. Even if we could say “many book” and “several book” (as we can say “many a book” and “each book”), the plural concept would still not emerge as clearly as it should for our argument; “many” and “several” are contaminated by certain notions of quantity or scale that are not essential to the idea of plurality itself. We must turn to central and eastern Asia for the type of expression we are seeking. In Tibetan, for instance, nga-s mi mthong[71] “I-by man see, by me a man is seen, I see a man” may just as well be understood to mean “I see men,” if there happens to be no reason to emphasize the fact of plurality.[72] If the fact is worth expressing, however, I can say nga-s mi rnams mthong “by me man plural see,” where rnams is the perfect conceptual analogue of -s in books, divested of all relational strings. Rnams follows its noun as would any other attributive word—“man plural” (whether two or a million) like “man white.” No need to bother about his plurality any more than about his whiteness unless we insist on the point.

But can we take it a step further and eliminate the idea of plurality as just a material concept, one that would turn “books” into a “plural book,” where the “plural,” like the “white” in “white book,” easily fits into group I? Our “many books” and “several books” clearly don’t fit the bill. Even if we could say “many book” and “several book” (just like we can say “many a book” and “each book”), the plural idea still wouldn't come through as clearly as it should for our argument; “many” and “several” are mixed with certain ideas of quantity or scale that aren't essential to the concept of plurality itself. We should look to Central and Eastern Asia for the type of expression we’re after. In Tibetan, for example, nga-s mi mthong[71] “I-by man see, by me a man is seen, I see a man” can just as easily be interpreted as “I see men,” if there’s no need to highlight that it’s plural.[72] However, if we want to express the fact, I can say nga-s mi rnams mthong “by me man plural see,” where rnams is a perfect conceptual equivalent of -s in books, stripped of all relational ties. Rnams follows its noun like any other descriptive word—“man plural” (whether two or a million) like “man white.” No need to worry about his plurality any more than about his whiteness unless we specifically want to point it out.

What is true of the idea of plurality is naturally just as true of a great many other concepts. They do not necessarily belong where we who speak English are in the habit of putting them. They may be shifted towards I or towards IV, the two poles of linguistic expression. Nor dare we look down on the Nootka Indian and the Tibetan for their material attitude towards a concept which to us is abstract and relational, lest we invite the reproaches of the Frenchman who feels a subtlety of relation in femme blanche and homme blanc that he misses in the coarser-grained white woman and white man. But the Bantu Negro, were he a philosopher, might go further and find it strange that we put in group II a category, the diminutive, which he strongly feels to belong to group III and which he uses, along with a number of other classificatory concepts,[73] to relate his subjects and objects, attributes and predicates, as a Russian or a German handles his genders and, if possible, with an even greater finesse.

What’s true about the idea of plurality is also true for many other concepts. They don’t necessarily belong where we English speakers usually place them. They can be shifted towards I or towards IV, the two ends of linguistic expression. We shouldn’t look down on the Nootka Indian or the Tibetan for their material perspective on a concept that seems abstract and relational to us, or we risk facing the critiques of the Frenchman who perceives a subtlety in femme blanche and homme blanc that he doesn’t find in the simpler white woman and white man. But the Bantu Negro, if he were a philosopher, might go further and find it odd that we categorize the diminutive in group II when he deeply feels it belongs in group III, which he uses along with several other classificatory concepts,[73] to relate his subjects and objects, attributes and predicates, similar to how a Russian or a German navigates their genders, and possibly with even greater finesse.

It is because our conceptual scheme is a sliding scale rather than a philosophical analysis of experience that we cannot say in advance just where to put a given concept. We must dispense, in other words, with a well-ordered classification of categories. What boots it to put tense and mode here or number there when the next language one handles puts tense a peg “lower down” (towards I), mode and number a peg “higher up” (towards IV)? Nor is there much to be gained in a summary work of this kind from a general inventory of the types of concepts generally found in groups II, III, and IV. There are too many possibilities. It would be interesting to show what are the most typical noun-forming and verb-forming elements of group II; how variously nouns may be classified (by gender; personal and non-personal; animate and inanimate; by form; common and proper); how the concept of number is elaborated (singular and plural; singular, dual, and plural; singular, dual, trial, and plural; single, distributive, and collective); what tense distinctions may be made in verb or noun (the “past,” for instance, may be an indefinite past, immediate, remote, mythical, completed, prior); how delicately certain languages have developed the idea of “aspect”[74] (momentaneous, durative, continuative, inceptive, cessative, durative-inceptive, iterative, momentaneous-iterative, durative-iterative, resultative, and still others); what modalities may be recognized (indicative, imperative, potential, dubitative, optative, negative, and a host of others[75]); what distinctions of person are possible (is “we,” for instance, conceived of as a plurality of “I” or is it as distinct from “I” as either is from “you” or “he”?—both attitudes are illustrated in language; moreover, does “we” include you to whom I speak or not?—“inclusive” and “exclusive” forms); what may be the general scheme of orientation, the so-called demonstrative categories (“this” and “that” in an endless procession of nuances);[76] how frequently the form expresses the source or nature of the speaker’s knowledge (known by actual experience, by hearsay,[77] by inference); how the syntactic relations may be expressed in the noun (subjective and objective; agentive, instrumental, and person affected;[78] various types of “genitive” and indirect relations) and, correspondingly, in the verb (active and passive; active and static; transitive and intransitive; impersonal, reflexive, reciprocal, indefinite as to object, and many other special limitations on the starting-point and end-point of the flow of activity). These details, important as many of them are to an understanding of the “inner form” of language, yield in general significance to the more radical group-distinctions that we have set up. It is enough for the general reader to feel that language struggles towards two poles of linguistic expression—material content and relation—and that these poles tend to be connected by a long series of transitional concepts.

It’s because our way of understanding is more of a range than a detailed analysis of experiences that we can’t determine in advance exactly where to place a specific concept. In other words, we have to move away from a neatly organized classification of categories. What’s the point of putting tense and mood here or number there when the next language you tackle places tense a step “lower” (towards I) and mood and number a step “higher” (towards IV)? Also, there isn’t much benefit in a summary like this from a general listing of the types of concepts usually found in groups II, III, and IV. There are just too many options. It would be interesting to show what the most typical noun-forming and verb-forming elements of group II are; how nouns can be classified in various ways (by gender; personal and non-personal; animate and inanimate; by form; common and proper); how the idea of number is elaborated (singular and plural; singular, dual, and plural; singular, dual, trial, and plural; single, distributive, and collective); what tense distinctions can be made in verbs or nouns (for instance, “past” can refer to an indefinite past, immediate, remote, mythical, completed, prior); how delicately certain languages have developed the idea of “aspect”[74] (momentaneous, durative, continuative, inceptive, cessative, durative-inceptive, iterative, momentaneous-iterative, durative-iterative, resultative, and more); what modalities can be recognized (indicative, imperative, potential, dubitative, optative, negative, and a lot more[75]); what distinctions of person are possible (is “we,” for example, viewed as a group of “I”s or is it as distinct from “I” as either is from “you” or “he”?—both views are represented in language; additionally, does “we” include you, the person I’m speaking to, or not?—“inclusive” and “exclusive” forms); what might be the overall orientation framework, the so-called demonstrative categories (“this” and “that” in a continuous flow of nuances);[76] how often the form indicates the source or nature of the speaker’s knowledge (known from actual experience, from hearsay,[77] by inference); how syntactic relationships can be expressed in the noun (subjective and objective; agentive, instrumental, and person affected;[78] various types of “genitive” and indirect relations) and, similarly, in the verb (active and passive; active and static; transitive and intransitive; impersonal, reflexive, reciprocal, indefinite regarding the object, and many other specific limitations on the starting-point and endpoint of the flow of activity). These details, many of which are crucial for understanding the “inner form” of language, generally take a backseat to the broader group distinctions we’ve established. It’s sufficient for the average reader to understand that language is striving towards two extremes of expression—material content and relation—and that these extremes tend to be linked by a long series of transitional concepts.

In dealing with words and their varying forms we have had to anticipate much that concerns the sentence as a whole. Every language has its special method or methods of binding words into a larger unity. The importance of these methods is apt to vary with the complexity of the individual word. The more synthetic the language, in other words, the more clearly the status of each word in the sentence is indicated by its own resources, the less need is there for looking beyond the word to the sentence as a whole. The Latin agit “(he) acts” needs no outside help to establish its place in a proposition. Whether I say agit dominus “the master acts” or sic femina agit “thus the woman acts,” the net result as to the syntactic feel of the agit is practically the same. It can only be a verb, the predicate of a proposition, and it can only be conceived as a statement of activity carried out by a person (or thing) other than you or me. It is not so with such a word as the English act. Act is a syntactic waif until we have defined its status in a proposition—one thing in “they act abominably,” quite another in “that was a kindly act.” The Latin sentence speaks with the assurance of its individual members, the English word needs the prompting of its fellows. Roughly speaking, to be sure. And yet to say that a sufficiently elaborate word-structure compensates for external syntactic methods is perilously close to begging the question. The elements of the word are related to each other in a specific way and follow each other in a rigorously determined sequence. This is tantamount to saying that a word which consists of more than a radical element is a crystallization of a sentence or of some portion of a sentence, that a form like agit is roughly the psychological[79] equivalent of a form like age is “act he.” Breaking down, then, the wall that separates word and sentence, we may ask: What, at last analysis, are the fundamental methods of relating word to word and element to element, in short, of passing from the isolated notions symbolized by each word and by each element to the unified proposition that corresponds to a thought?

In working with words and their different forms, we have had to consider a lot about the sentence as a whole. Every language has its own way of connecting words into a larger unit. The significance of these connections tends to change based on the complexity of each word. The more synthetic a language is, meaning the more clearly the role of each word in the sentence is defined by its own structure, the less we need to look beyond the word to grasp the sentence as a whole. The Latin word agit “(he) acts” doesn’t require any external context to determine its place in a statement. Whether I say agit dominus “the master acts” or sic femina agit “thus the woman acts,” the overall effect on the syntax of agit remains almost the same. It is clearly just a verb, the predicate of a sentence, and it can only be understood as an action performed by someone or something other than you or me. This isn’t the case with the English word act. Act lacks a clear syntactic role until we define its place in a sentence—it's one thing in “they act abominably” and quite another in “that was a kindly act.” The Latin sentence confidently conveys meaning through its individual components, whereas the English word needs help from its peers. Generally speaking, that is. However, claiming that a sufficiently complex word structure makes up for the lack of external syntactic connections is dangerously close to avoiding the issue. The parts of the word are related in a specific manner and follow a strictly defined order. This suggests that a word made up of more than just a root element is essentially a crystallization of a sentence or part of a sentence, making a form like agit roughly the psychological equivalent of a form like age is “act he.” By breaking down the barrier between words and sentences, we can ask: What, at the core, are the fundamental ways to connect word to word and element to element, essentially, how do we go from the individual ideas represented by each word and element to the unified statement that reflects a thought?

The answer is simple and is implied in the preceding remarks. The most fundamental and the most powerful of all relating methods is the method of order. Let us think of some more or less concrete idea, say a color, and set down its symbol—red; of another concrete idea, say a person or object, setting down its symbol—dog; finally, of a third concrete idea, say an action, setting down its symbol—run. It is hardly possible to set down these three symbols—red dog run—without relating them in some way, for example (the) red dog run(s). I am far from wishing to state that the proposition has always grown up in this analytic manner, merely that the very process of juxtaposing concept to concept, symbol to symbol, forces some kind of relational “feeling,” if nothing else, upon us. To certain syntactic adhesions we are very sensitive, for example, to the attributive relation of quality (red dog) or the subjective relation (dog run) or the objective relation (kill dog), to others we are more indifferent, for example, to the attributive relation of circumstance (to-day red dog run or red dog to-day run or red dog run to-day, all of which are equivalent propositions or propositions in embryo). Words and elements, then, once they are listed in a certain order, tend not only to establish some kind of relation among themselves but are attracted to each other in greater or in less degree. It is presumably this very greater or less that ultimately leads to those firmly solidified groups of elements (radical element or elements plus one or more grammatical elements) that we have studied as complex words. They are in all likelihood nothing but sequences that have shrunk together and away from other sequences or isolated elements in the flow of speech. While they are fully alive, in other words, while they are functional at every point, they can keep themselves at a psychological distance from their neighbors. As they gradually lose much of their life, they fall back into the embrace of the sentence as a whole and the sequence of independent words regains the importance it had in part transferred to the crystallized groups of elements. Speech is thus constantly tightening and loosening its sequences. In its highly integrated forms (Latin, Eskimo) the “energy” of sequence is largely locked up in complex word formations, it becomes transformed into a kind of potential energy that may not be released for millennia. In its more analytic forms (Chinese, English) this energy is mobile, ready to hand for such service as we demand of it.

The answer is straightforward and is hinted at in the earlier comments. The most basic and strongest way of relating ideas is through order. Let’s take a concrete idea, like a color, and note its symbol—red; then another concrete idea, like a person or object, and note its symbol—dog; finally, a third concrete idea, such as an action, and write its symbol—run. It’s almost impossible to put these three symbols—red dog run—together without relating them somehow, for example, (the) red dog run(s). I’m not claiming that every statement has developed this way, just that the act of placing one concept next to another, one symbol next to another, compels some kind of relational “feeling,” at the very least. We are quite sensitive to certain syntactic connections, like the descriptive relationship of quality (red dog) or the subjective relationship (dog run) or the objective relationship (kill dog); however, we're less concerned about others, such as the situational relationship (to-day red dog run or red dog to-day run or red dog run to-day, all of which are equivalent or embryonic propositions). Words and elements, once arranged in a specific order, tend to not only create some sort of relationship among themselves but also are drawn to each other to varying degrees. It’s likely that this variation leads to those solid groups of elements (one or more core elements plus grammatical components) we’ve examined as complex words. They probably are nothing more than sequences that have combined and separated from other sequences or isolated elements in speech. While they are lively—meaning, while they function effectively— they can maintain a psychological distance from their surroundings. As they gradually lose vitality, they merge back into the larger sentence, and the sequence of independent words regains the significance it had previously shared with the crystallized groups of elements. Speech is thus continually tightening and loosening its sequences. In its highly integrated forms (like Latin or Eskimo), the “energy” of sequence is largely contained in complex word formations, becoming a kind of potential energy that might not be tapped for thousands of years. In more analytical forms (like Chinese or English), this energy is fluid and ready for the tasks we require of it.

There can be little doubt that stress has frequently played a controlling influence in the formation of element-groups or complex words out of certain sequences in the sentence. Such an English word as withstand is merely an old sequence with stand, i.e., “against[80] stand,” in which the unstressed adverb was permanently drawn to the following verb and lost its independence as a significant element. In the same way French futures of the type irai “(I) shall go” are but the resultants of a coalescence of originally independent words: ir[81] a’i “to-go I-have,” under the influence of a unifying accent. But stress has done more than articulate or unify sequences that in their own right imply a syntactic relation. Stress is the most natural means at our disposal to emphasize a linguistic contrast, to indicate the major element in a sequence. Hence we need not be surprised to find that accent too, no less than sequence, may serve as the unaided symbol of certain relations. Such a contrast as that of go' between (“one who goes between”) and to go between' may be of quite secondary origin in English, but there is every reason to believe that analogous distinctions have prevailed at all times in linguistic history. A sequence like see' man might imply some type of relation in which see qualifies the following word, hence “a seeing man” or “a seen (or visible) man,” or is its predication, hence “the man sees” or “the man is seen,” while a sequence like see man' might indicate that the accented word in some way limits the application of the first, say as direct object, hence “to see a man” or “(he) sees the man.” Such alternations of relation, as symbolized by varying stresses, are important and frequent in a number of languages.[82]

There’s no doubt that stress often plays a key role in forming groups of elements or complex words from certain sequences in a sentence. An English word like withstand is simply an old sequence with stand, meaning “against[80] stand,” where the unstressed adverb became permanently attached to the following verb and lost its independence as a significant element. Similarly, French future verbs like irai “(I) shall go” are just the result of formerly independent words merging: ir[81] a’i “to-go I-have,” influenced by a unifying accent. But stress does more than clarify or unify sequences that imply a syntactic relation on their own. Stress is the most natural tool we have to highlight a linguistic contrast, indicating the main element in a sequence. So, it’s not surprising that accent, just like sequence, can also act as a clear symbol of certain relationships. A contrast like go' between (“one who goes between”) and to go between' may have a fairly secondary origin in English, but there's every reason to think that similar distinctions have existed throughout linguistic history. A sequence like see' man could imply some kind of relationship in which see modifies the following word, thus meaning “a seeing man” or “a seen (or visible) man,” or it serves as the predicate, meaning “the man sees” or “the man is seen,” while a sequence like see man' might indicate that the stressed word limits the application of the first, such as in “to see a man” or “(he) sees the man.” These shifts in relation, represented by different stresses, are significant and common in many languages.[82]

It is a somewhat venturesome and yet not an altogether unreasonable speculation that sees in word order and stress the primary methods for the expression of all syntactic relations and looks upon the present relational value of specific words and elements as but a secondary condition due to a transfer of values. Thus, we may surmise that the Latin -m of words like feminam, dominum, and civem did not originally[83] denote that “woman,” “master,” and “citizen” were objectively related to the verb of the proposition but indicated something far more concrete,[84] that the objective relation was merely implied by the position or accent of the word (radical element) immediately preceding the -m, and that gradually, as its more concrete significance faded away, it took over a syntactic function that did not originally belong to it. This sort of evolution by transfer is traceable in many instances. Thus, the of in an English phrase like “the law of the land” is now as colorless in content, as purely a relational indicator as the “genitive” suffix -is in the Latin lex urbis “the law of the city.” We know, however, that it was originally an adverb of considerable concreteness of meaning,[85] “away, moving from,” and that the syntactic relation was originally expressed by the case form[86] of the second noun. As the case form lost its vitality, the adverb took over its function. If we are actually justified in assuming that the expression of all syntactic relations is ultimately traceable to these two unavoidable, dynamic features of speech—sequence and stress[87]—an interesting thesis results:—All of the actual content of speech, its clusters of vocalic and consonantal sounds, is in origin limited to the concrete; relations were originally not expressed in outward form but were merely implied and articulated with the help of order and rhythm. In other words, relations were intuitively felt and could only “leak out” with the help of dynamic factors that themselves move on an intuitional plane.

It’s a bit of a bold idea, but not completely crazy, to think that word order and stress are the main ways we show all the relationships in sentences. The current meaning of specific words seems to be just a secondary situation that arises from how values have shifted. So, we might guess that the Latin -m in words like feminam, dominum, and civem didn’t originally mean that “woman,” “master,” and “citizen” were directly related to the verb in the sentence, but suggested something much more concrete, and that the actual relationship was just implied by the position or emphasis of the word (the root element) right before the -m. Over time, as its original, clearer meaning disappeared, it took on a grammatical role that it didn’t have at first. This kind of change through shifting is seen in many cases. For example, the of in the English phrase “the law of the land” is now as neutral in meaning, just a relational marker, as the “genitive” suffix -is in the Latin lex urbis “the law of the city.” We know that it originally meant something more concrete, like “away, moving from,” and that the grammatical relationship was originally shown through the case form of the second noun. As the case form lost its power, the adverb took over that role. If we can justifiably assume that all syntactical relationships come down to these two essential, dynamic elements of speech—sequence and stress—then we arrive at an intriguing argument: All actual content in speech, the combinations of vowel and consonant sounds, originally referred to the concrete; relationships were not shown in physical form but were only suggested and expressed through order and rhythm. In other words, relationships were felt instinctively and could only “leak out” through dynamic factors that operate on an intuitive level.

There is a special method for the expression of relations that has been so often evolved in the history of language that we must glance at it for a moment. This is the method of “concord” or of like signaling. It is based on the same principle as the password or label. All persons or objects that answer to the same counter-sign or that bear the same imprint are thereby stamped as somehow related. It makes little difference, once they are so stamped, where they are to be found or how they behave themselves. They are known to belong together. We are familiar with the principle of concord in Latin and Greek. Many of us have been struck by such relentless rhymes as vidi ilium bonum dominum “I saw that good master” or quarum dearum saevarum “of which stern goddesses.” Not that sound-echo, whether in the form of rhyme or of alliteration[88] is necessary to concord, though in its most typical and original forms concord is nearly always accompanied by sound repetition. The essence of the principle is simply this, that words (elements) that belong together, particularly if they are syntactic equivalents or are related in like fashion to another word or element, are outwardly marked by the same or functionally equivalent affixes. The application of the principle varies considerably according to the genius of the particular language. In Latin and Greek, for instance, there is concord between noun and qualifying word (adjective or demonstrative) as regards gender, number, and case, between verb and subject only as regards number, and no concord between verb and object.

There’s a unique way to express relationships that has been developed so frequently throughout the history of language that we should take a moment to look at it. This is the method of “concord” or similar signaling. It is based on the same idea as a password or label. All people or things that respond to the same counter-sign or that have the same mark are, therefore, identified as somehow related. Once they’re identified in this way, it doesn’t really matter where they are or how they act. They are recognized as belonging together. We see the principle of concord in Latin and Greek. Many of us have noticed relentless rhymes like vidi ilium bonum dominum “I saw that good master” or quarum dearum saevarum “of which stern goddesses.” However, sound-echo, whether in the form of rhyme or alliteration[88], isn’t essential to concord, although in its most typical and original forms, concord is almost always accompanied by sound repetition. The core of the principle is simply that words (elements) that belong together, especially if they are syntactic equivalents or relate similarly to another word or element, are outwardly marked by the same or functionally equivalent affixes. The application of this principle varies greatly depending on the characteristics of each language. In Latin and Greek, for example, there is concord between nouns and qualifying words (adjectives or demonstratives) regarding gender, number, and case, between verbs and subjects concerning number only, and no concord between verbs and objects.

In Chinook there is a more far-reaching concord between noun, whether subject or object, and verb. Every noun is classified according to five categories—masculine, feminine, neuter,[89] dual, and plural. “Woman” is feminine, “sand” is neuter, “table” is masculine. If, therefore, I wish to say “The woman put the sand on the table,” I must place in the verb certain class or gender prefixes that accord with corresponding noun prefixes. The sentence reads then, “The (fem.)-woman she (fem.)-it (neut.)-it (masc.)-on-put the (neut.)-sand the (masc.)-table.” If “sand” is qualified as “much” and “table” as “large,” these new ideas are expressed as abstract nouns, each with its inherent class-prefix (“much” is neuter or feminine, “large” is masculine) and with a possessive prefix referring to the qualified noun. Adjective thus calls to noun, noun to verb. “The woman put much sand on the large table,” therefore, takes the form: “The (fem.)-woman she (fem.)-it (neut.)-it (masc.)-on-put the (fem.)-thereof (neut.)-quantity the (neut.)-sand the (masc.)-thereof (masc.)-largeness the (masc.)-table.” The classification of “table” as masculine is thus three times insisted on—in the noun, in the adjective, and in the verb. In the Bantu languages,[90] the principle of concord works very much as in Chinook. In them also nouns are classified into a number of categories and are brought into relation with adjectives, demonstratives, relative pronouns, and verbs by means of prefixed elements that call off the class and make up a complex system of concordances. In such a sentence as “That fierce lion who came here is dead,” the class of “lion,” which we may call the animal class, would be referred to by concording prefixes no less than six times,—with the demonstrative (“that”), the qualifying adjective, the noun itself, the relative pronoun, the subjective prefix to the verb of the relative clause, and the subjective prefix to the verb of the main clause (“is dead”). We recognize in this insistence on external clarity of reference the same spirit as moves in the more familiar illum bonum dominum.

In Chinook, there’s a stronger agreement between nouns, whether they are subjects or objects, and verbs. Every noun falls into five categories—masculine, feminine, neuter, dual, and plural. “Woman” is feminine, “sand” is neuter, and “table” is masculine. So, if I want to say “The woman put the sand on the table,” I have to use specific class or gender prefixes in the verb that match the prefixes of the corresponding nouns. The sentence becomes: “The (fem.)-woman she (fem.)-it (neut.)-it (masc.)-on-put the (neut.)-sand the (masc.)-table.” If “sand” is described as “much” and “table” as “large,” these new ideas turn into abstract nouns, each with its class-prefix (“much” is neuter or feminine, “large” is masculine) and a possessive prefix linked to the noun being qualified. So, “The woman put much sand on the large table” turns into: “The (fem.)-woman she (fem.)-it (neut.)-it (masc.)-on-put the (fem.)-thereof (neut.)-quantity the (neut.)-sand the (masc.)-thereof (masc.)-largeness the (masc.)-table.” The classification of “table” as masculine is emphasized three times—in the noun, the adjective, and the verb. In the Bantu languages, the principle of concord functions much like in Chinook. In these languages, nouns are also categorized and related to adjectives, demonstratives, relative pronouns, and verbs through prefixed elements that indicate the class and create a complex system of agreements. In a sentence like “That fierce lion who came here is dead,” the class of “lion,” which we might call the animal class, would be referenced by concording prefixes six times— with the demonstrative (“that”), the descriptive adjective, the noun itself, the relative pronoun, the subject prefix of the verb in the relative clause, and the subject prefix of the verb in the main clause (“is dead”). This emphasis on clear external references reflects the same spirit found in the more familiar *illum bonum dominum*.

Psychologically the methods of sequence and accent lie at the opposite pole to that of concord. Where they are all for implication, for subtlety of feeling, concord is impatient of the least ambiguity but must have its well-certificated tags at every turn. Concord tends to dispense with order. In Latin and Chinook the independent words are free in position, less so in Bantu. In both Chinook and Bantu, however, the methods of concord and order are equally important for the differentiation of subject and object, as the classifying verb prefixes refer to subject, object, or indirect object according to the relative position they occupy. These examples again bring home to us the significant fact that at some point or other order asserts itself in every language as the most fundamental of relating principles.

Psychologically, the methods of sequence and emphasis are completely different from those of agreement. While they focus on suggestion and subtle emotions, agreement can’t stand any ambiguity and requires clear indicators at every step. Agreement often ignores the need for order. In Latin and Chinook, independent words can appear in any order, while in Bantu, this is less so. However, in both Chinook and Bantu, the methods of agreement and order are crucial for distinguishing between the subject and object, as the verb prefixes indicate whether something is the subject, object, or indirect object based on their position. These examples highlight an important fact: at some point, order becomes the most fundamental principle of relation in every language.

The observant reader has probably been surprised that all this time we have had so little to say of the time-honored “parts of speech.” The reason for this is not far to seek. Our conventional classification of words into parts of speech is only a vague, wavering approximation to a consistently worked out inventory of experience. We imagine, to begin with, that all “verbs” are inherently concerned with action as such, that a “noun” is the name of some definite object or personality that can be pictured by the mind, that all qualities are necessarily expressed by a definite group of words to which we may appropriately apply the term “adjective.” As soon as we test our vocabulary, we discover that the parts of speech are far from corresponding to so simple an analysis of reality. We say “it is red” and define “red” as a quality-word or adjective. We should consider it strange to think of an equivalent of “is red” in which the whole predication (adjective and verb of being) is conceived of as a verb in precisely the same way in which we think of “extends” or “lies” or “sleeps” as a verb. Yet as soon as we give the “durative” notion of being red an inceptive or transitional turn, we can avoid the parallel form “it becomes red, it turns red” and say “it reddens.” No one denies that “reddens” is as good a verb as “sleeps” or even “walks.” Yet “it is red” is related to “it reddens” very much as is “he stands” to “he stands up” or “he rises.” It is merely a matter of English or of general Indo-European idiom that we cannot say “it reds” in the sense of “it is red.” There are hundreds of languages that can. Indeed there are many that can express what we should call an adjective only by making a participle out of a verb. “Red” in such languages is merely a derivative “being red,” as our “sleeping” or “walking” are derivatives of primary verbs.

The attentive reader might be surprised that we've had so little to say about the traditional "parts of speech." The reason for this is straightforward. Our standard classification of words into parts of speech is only a rough approximation of a well-defined inventory of experience. To start, we think that all “verbs” are inherently related to action, that a “noun” names a specific object or person that we can picture in our minds, and that all qualities are expressed by a specific set of words we can call “adjectives.” However, when we examine our vocabulary, we find that the parts of speech don't neatly match such a simple analysis of reality. We say “it is red” and identify “red” as a quality word or adjective. It would seem odd to consider an equivalent of “is red” where the entire statement (adjective and verb of being) is viewed as a verb, similar to how we see “extends,” “lies,” or “sleeps” as verbs. Yet, as soon as we give the concept of being red a beginning or transitional aspect, we can skip the parallel form “it becomes red, it turns red” and just say “it reddens.” No one argues that “reddens” is just as valid a verb as “sleeps” or even “walks.” Yet “it is red” relates to “it reddens” much like “he stands” relates to “he stands up” or “he rises.” It’s simply a matter of English or general Indo-European usage that we can’t say “it reds” in the sense of “it is red.” Many languages can. In fact, there are numerous languages that can express what we refer to as an adjective solely by turning a verb into a participle. In those languages, “red” is just a derivative of “being red,” similar to how our “sleeping” or “walking” are derivatives of primary verbs.

Just as we can verbify the idea of a quality in such cases as “reddens,” so we can represent a quality or an action to ourselves as a thing. We speak of “the height of a building” or “the fall of an apple” quite as though these ideas were parallel to “the roof of a building” or “the skin of an apple,” forgetting that the nouns (height, fall) have not ceased to indicate a quality and an act when we have made them speak with the accent of mere objects. And just as there are languages that make verbs of the great mass of adjectives, so there are others that make nouns of them. In Chinook, as we have seen, “the big table” is “the-table its-bigness”; in Tibetan the same idea may be expressed by “the table of bigness,” very much as we may say “a man of wealth” instead of “a rich man.”

Just like we can turn an adjective into a verb in phrases like “reddens,” we can also think of a quality or an action as a thing. We say “the height of a building” or “the fall of an apple” as if these concepts are similar to “the roof of a building” or “the skin of an apple,” overlooking the fact that the nouns (height, fall) still represent a quality and an action even when we treat them like regular objects. Just as some languages turn many adjectives into verbs, others turn them into nouns. In Chinook, for example, “the big table” becomes “the-table its-bigness”; in Tibetan, that idea is expressed as “the table of bigness,” similar to how we might say “a man of wealth” instead of “a rich man.”

But are there not certain ideas that it is impossible to render except by way of such and such parts of speech? What can be done with the “to” of “he came to the house”? Well, we can say “he reached the house” and dodge the preposition altogether, giving the verb a nuance that absorbs the idea of local relation carried by the “to.” But let us insist on giving independence to this idea of local relation. Must we not then hold to the preposition? No, we can make a noun of it. We can say something like “he reached the proximity of the house” or “he reached the house-locality.” Instead of saying “he looked into the glass” we may say “he scrutinized the glass-interior.” Such expressions are stilted in English because they do not easily fit into our formal grooves, but in language after language we find that local relations are expressed in just this way. The local relation is nominalized. And so we might go on examining the various parts of speech and showing how they not merely grade into each other but are to an astonishing degree actually convertible into each other. The upshot of such an examination would be to feel convinced that the “part of speech” reflects not so much our intuitive analysis of reality as our ability to compose that reality into a variety of formal patterns. A part of speech outside of the limitations of syntactic form is but a will o’ the wisp. For this reason no logical scheme of the parts of speech—their number, nature, and necessary confines—is of the slightest interest to the linguist. Each language has its own scheme. Everything depends on the formal demarcations which it recognizes.

But aren’t there certain ideas that we can only express using specific parts of speech? What do we do with the “to” in “he came to the house”? We could say “he reached the house” and skip the preposition entirely, giving the verb a meaning that includes the local relationship suggested by “to.” But if we want to keep that idea of local relation independent, do we have to stick with the preposition? No, we can turn it into a noun. We could say something like “he reached the proximity of the house” or “he reached the house-locality.” Instead of saying “he looked into the glass,” we might say “he scrutinized the glass-interior.” Such phrases sound awkward in English because they don’t fit neatly into our usual structures, but in many languages, local relations are expressed this way. The local relation is nominalized. We could keep examining different parts of speech and demonstrate how they not only blend into one another but are surprisingly convertible into each other. The conclusion of such an examination would lead us to believe that “part of speech” reflects more our structured way of understanding reality than a clear analysis of it. A part of speech that goes beyond syntactic boundaries is just an illusion. That’s why no logical classification of parts of speech—their number, nature, and necessary boundaries—interests linguists. Each language has its own system. Everything depends on the formal distinctions it recognizes.

Yet we must not be too destructive. It is well to remember that speech consists of a series of propositions. There must be something to talk about and something must be said about this subject of discourse once it is selected. This distinction is of such fundamental importance that the vast majority of languages have emphasized it by creating some sort of formal barrier between the two terms of the proposition. The subject of discourse is a noun. As the most common subject of discourse is either a person or a thing, the noun clusters about concrete concepts of that order. As the thing predicated of a subject is generally an activity in the widest sense of the word, a passage from one moment of existence to another, the form which has been set aside for the business of predicating, in other words, the verb, clusters about concepts of activity. No language wholly fails to distinguish noun and verb, though in particular cases the nature of the distinction may be an elusive one. It is different with the other parts of speech. Not one of them is imperatively required for the life of language.[91]

Yet we must not be overly destructive. It's important to remember that language consists of a series of statements. There needs to be something to discuss, and something must be said about that topic once it’s chosen. This distinction is so crucial that most languages have created some kind of formal barrier between the two parts of a statement. The topic of discussion is a noun. Since the most common topics are either people or things, nouns are focused on concrete concepts in that regard. As the action described by a subject is generally an activity in the broadest sense, a transition from one state of existence to another, the form reserved for expressing this action, namely the verb, relates to concepts of activity. No language entirely fails to distinguish between nouns and verbs, although the nature of that distinction can sometimes be subtle. This is not the case for the other parts of speech. None of them is absolutely necessary for language to function.[91]

VI

Types of Linguistic Structure

So far, in dealing with linguistic form, we have been concerned only with single words and with the relations of words in sentences. We have not envisaged whole languages as conforming to this or that general type. Incidentally we have observed that one language runs to tight-knit synthesis where another contents itself with a more analytic, piece-meal handling of its elements, or that in one language syntactic relations appear pure which in another are combined with certain other notions that have something concrete about them, however abstract they may be felt to be in practice. In this way we may have obtained some inkling of what is meant when we speak of the general form of a language. For it must be obvious to any one who has thought about the question at all or who has felt something of the spirit of a foreign language that there is such a thing as a basic plan, a certain cut, to each language. This type or plan or structural “genius” of the language is something much more fundamental, much more pervasive, than any single feature of it that we can mention, nor can we gain an adequate idea of its nature by a mere recital of the sundry facts that make up the grammar of the language. When we pass from Latin to Russian, we feel that it is approximately the same horizon that bounds our view, even though the near, familiar landmarks have changed. When we come to English, we seem to notice that the hills have dipped down a little, yet we recognize the general lay of the land. And when we have arrived at Chinese, it is an utterly different sky that is looking down upon us. We can translate these metaphors and say that all languages differ from one another but that certain ones differ far more than others. This is tantamount to saying that it is possible to group them into morphological types.

So far, when discussing linguistic form, we've only focused on individual words and the relationships of words within sentences. We haven't considered entire languages as fitting into specific general types. Along the way, we've noticed that one language tends to have tightly connected structures while another has a more analytical, fragmented approach to its elements, or that in one language, syntactic relationships are clear, whereas in another, they are mixed with other concepts that, despite being abstract, have a tangible quality. Through this, we've gained some understanding of what we mean when we refer to the overall structure of a language. It should be clear to anyone who has thought about this topic or experienced the essence of a foreign language that there is a fundamental design, a unique structure to each language. This type or design, or the structural “genius” of the language, is much more essential and far-reaching than any single feature we can point out, and we cannot truly grasp its nature simply by listing the various facts that make up the language's grammar. When we move from Latin to Russian, we feel as though the same general landscape is surrounding us, even though the close, familiar landmarks have changed. When we get to English, it feels like the hills have lowered a bit, yet we still recognize the general layout. However, when we arrive at Chinese, it's a completely different atmosphere that greets us. We can translate these metaphors to say that all languages are different from each other, but some differ much more than others. This means that it's possible to categorize them into morphological types.

Strictly speaking, we know in advance that it is impossible to set up a limited number of types that would do full justice to the peculiarities of the thousands of languages and dialects spoken on the surface of the earth. Like all human institutions, speech is too variable and too elusive to be quite safely ticketed. Even if we operate with a minutely subdivided scale of types, we may be quite certain that many of our languages will need trimming before they fit. To get them into the scheme at all it will be necessary to overestimate the significance of this or that feature or to ignore, for the time being, certain contradictions in their mechanism. Does the difficulty of classification prove the uselessness of the task? I do not think so. It would be too easy to relieve ourselves of the burden of constructive thinking and to take the standpoint that each language has its unique history, therefore its unique structure. Such a standpoint expresses only a half truth. Just as similar social, economic, and religious institutions have grown up in different parts of the world from distinct historical antecedents, so also languages, traveling along different roads, have tended to converge toward similar forms. Moreover, the historical study of language has proven to us beyond all doubt that a language changes not only gradually but consistently, that it moves unconsciously from one type towards another, and that analogous trends are observable in remote quarters of the globe. From this it follows that broadly similar morphologies must have been reached by unrelated languages, independently and frequently. In assuming the existence of comparable types, therefore, we are not gainsaying the individuality of all historical processes; we are merely affirming that back of the face of history are powerful drifts that move language, like other social products, to balanced patterns, in other words, to types. As linguists we shall be content to realize that there are these types and that certain processes in the life of language tend to modify them. Why similar types should be formed, just what is the nature of the forces that make them and dissolve them—these questions are more easily asked than answered. Perhaps the psychologists of the future will be able to give us the ultimate reasons for the formation of linguistic types.

Strictly speaking, we know in advance that it's impossible to create a limited number of categories that would fully capture the uniqueness of the thousands of languages and dialects spoken around the world. Like all human institutions, language is too variable and too elusive to be neatly classified. Even if we use a highly detailed classification system, it's clear that many languages will require adjustments to fit in. To include them in the system at all, we will likely need to overrate the importance of certain features or temporarily overlook specific contradictions in their structure. Does the challenge of classification make the effort pointless? I don’t think so. It would be too easy to escape the responsibility of constructive thinking by claiming each language has its own unique history and, therefore, its own unique structure. This viewpoint represents only part of the truth. Just as similar social, economic, and religious institutions have developed in various parts of the world from different historical backgrounds, languages have also tended to converge toward similar forms despite their different evolutions. Furthermore, historical studies of language have shown us, without a doubt, that a language changes not only gradually but systematically, moving unconsciously from one type to another, with similar trends observable in distant parts of the globe. This suggests that broadly similar structures must have been achieved by unrelated languages independently and often. By acknowledging the existence of comparable types, we are not denying the individuality of all historical processes; we are simply recognizing that there are strong forces behind the scenes that guide language, like other social phenomena, toward balanced patterns, in other words, toward types. As linguists, we should accept that these types exist and recognize that certain processes in the life of language tend to modify them. The reasons for why similar types emerge, as well as the nature of the forces that create and change them—these questions are easier to ask than to answer. Perhaps future psychologists will be able to provide us with the ultimate explanations for the formation of linguistic types.

When it comes to the actual task of classification, we find that we have no easy road to travel. Various classifications have been suggested, and they all contain elements of value. Yet none proves satisfactory. They do not so much enfold the known languages in their embrace as force them down into narrow, straight-backed seats. The difficulties have been of various kinds. First and foremost, it has been difficult to choose a point of view. On what basis shall we classify? A language shows us so many facets that we may well be puzzled. And is one point of view sufficient? Secondly, it is dangerous to generalize from a small number of selected languages. To take, as the sum total of our material, Latin, Arabic, Turkish, Chinese, and perhaps Eskimo or Sioux as an afterthought, is to court disaster. We have no right to assume that a sprinkling of exotic types will do to supplement the few languages nearer home that we are more immediately interested in. Thirdly, the strong craving for a simple formula[92] has been the undoing of linguists. There is something irresistible about a method of classification that starts with two poles, exemplified, say, by Chinese and Latin, clusters what it conveniently can about these poles, and throws everything else into a “transitional type.” Hence has arisen the still popular classification of languages into an “isolating” group, an “agglutinative” group, and an “inflective” group. Sometimes the languages of the American Indians are made to straggle along as an uncomfortable “polysynthetic” rear-guard to the agglutinative languages. There is justification for the use of all of these terms, though not perhaps in quite the spirit in which they are commonly employed. In any case it is very difficult to assign all known languages to one or other of these groups, the more so as they are not mutually exclusive. A language may be both agglutinative and inflective, or inflective and polysynthetic, or even polysynthetic and isolating, as we shall see a little later on.

When it comes to actually classifying languages, we have no easy path ahead. Various classifications have been proposed, and they all have their merits. However, none are completely satisfactory. They don’t really encompass all known languages; instead, they force them into narrow, rigid categories. The challenges are numerous. First and foremost, deciding on a point of view has been tough. On what basis should we classify? A language has so many aspects that it can be confusing. Is one perspective enough? Secondly, generalizing from a small selection of chosen languages is risky. If we take only Latin, Arabic, Turkish, Chinese, and maybe include Eskimo or Sioux as an afterthought, we open ourselves up to problems. We can’t assume that a handful of exotic languages will adequately supplement the few we’re more interested in locally. Thirdly, the strong desire for a straightforward formula has led linguists astray. There’s something appealing about a classification method that starts with two extremes, like Chinese and Latin, groups what it can around these extremes, and categorizes everything else as a “transitional type.” This has led to the still-common classification of languages into “isolating,” “agglutinative,” and “inflective” groups. Sometimes, the languages of Native Americans are awkwardly tagged on as an uncomfortable “polysynthetic” trailing group to the agglutinative languages. While there's a basis for using all these terms, they might not be employed in the way that accurately reflects their spirit. In any case, it's challenging to fit all known languages into one of these categories, especially since they aren't mutually exclusive. A language can be both agglutinative and inflective, or inflective and polysynthetic, or even polysynthetic and isolating, as we will discuss later.

There is a fourth reason why the classification of languages has generally proved a fruitless undertaking. It is probably the most powerful deterrent of all to clear thinking. This is the evolutionary prejudice which instilled itself into the social sciences towards the middle of the last century and which is only now beginning to abate its tyrannical hold on our mind. Intermingled with this scientific prejudice and largely anticipating it was another, a more human one. The vast majority of linguistic theorists themselves spoke languages of a certain type, of which the most fully developed varieties were the Latin and Greek that they had learned in their childhood. It was not difficult for them to be persuaded that these familiar languages represented the “highest” development that speech had yet attained and that all other types were but steps on the way to this beloved “inflective” type. Whatever conformed to the pattern of Sanskrit and Greek and Latin and German was accepted as expressive of the “highest,” whatever departed from it was frowned upon as a shortcoming or was at best an interesting aberration.[93] Now any classification that starts with preconceived values or that works up to sentimental satisfactions is self-condemned as unscientific. A linguist that insists on talking about the Latin type of morphology as though it were necessarily the high-water mark of linguistic development is like the zoölogist that sees in the organic world a huge conspiracy to evolve the race-horse or the Jersey cow. Language in its fundamental forms is the symbolic expression of human intuitions. These may shape themselves in a hundred ways, regardless of the material advancement or backwardness of the people that handle the forms, of which, it need hardly be said, they are in the main unconscious. If, therefore, we wish to understand language in its true inwardness we must disabuse our minds of preferred “values”[94] and accustom ourselves to look upon English and Hottentot with the same cool, yet interested, detachment.

There is a fourth reason why classifying languages has generally been a fruitless effort. It’s probably the strongest barrier to clear thinking. This is the evolutionary bias that took hold of the social sciences around the middle of the last century and is only now starting to loosen its grip on our minds. Alongside this scientific bias, which largely anticipated it, was another, more personal one. Most linguistic theorists spoke languages of a certain kind, the most developed versions being the Latin and Greek they learned as children. It wasn’t hard for them to believe that these familiar languages represented the “highest” level that speech had reached, and that all other types were just steps toward this cherished “inflective” type. Anything that fit the mold of Sanskrit, Greek, Latin, or German was seen as the “highest” expression, while anything that diverged from it was viewed as a flaw or, at best, a curious anomaly. Now, any classification that begins with preconceived values or aims for sentimental satisfaction is automatically deemed unscientific. A linguist who insists on treating the Latin type of morphology as if it were the pinnacle of linguistic evolution is like a zoologist who perceives a vast conspiracy in the organic world to evolve the racehorse or the Jersey cow. Language, in its basic forms, is the symbolic representation of human intuitions. These can take shape in countless ways, regardless of the material progress or setbacks of the people using these forms, which, it should be noted, they are mostly unaware of. Therefore, if we want to truly understand language, we must clear our minds of any preferred “values” and train ourselves to view English and Hottentot with the same cool, yet interested, detachment.

We come back to our first difficulty. What point of view shall we adopt for our classification? After all that we have said about grammatical form in the preceding chapter, it is clear that we cannot now make the distinction between form languages and formless languages that used to appeal to some of the older writers. Every language can and must express the fundamental syntactic relations even though there is not a single affix to be found in its vocabulary. We conclude that every language is a form language. Aside from the expression of pure relation a language may, of course, be “formless”—formless, that is, in the mechanical and rather superficial sense that it is not encumbered by the use of non-radical elements. The attempt has sometimes been made to formulate a distinction on the basis of “inner form.” Chinese, for instance, has no formal elements pure and simple, no “outer form,” but it evidences a keen sense of relations, of the difference between subject and object, attribute and predicate, and so on. In other words, it has an “inner form” in the same sense in which Latin possesses it, though it is outwardly “formless” where Latin is outwardly “formal.” On the other hand, there are supposed to be languages[95] which have no true grasp of the fundamental relations but content themselves with the more or less minute expression of material ideas, sometimes with an exuberant display of “outer form,” leaving the pure relations to be merely inferred from the context. I am strongly inclined to believe that this supposed “inner formlessness” of certain languages is an illusion. It may well be that in these languages the relations are not expressed in as immaterial a way as in Chinese or even as in Latin,[96] or that the principle of order is subject to greater fluctuations than in Chinese, or that a tendency to complex derivations relieves the language of the necessity of expressing certain relations as explicitly as a more analytic language would have them expressed.[97] All this does not mean that the languages in question have not a true feeling for the fundamental relations. We shall therefore not be able to use the notion of “inner formlessness,” except in the greatly modified sense that syntactic relations may be fused with notions of another order. To this criterion of classification we shall have to return a little later.

We return to our initial challenge. What perspective should we adopt for our classification? Based on everything we discussed about grammatical form in the previous chapter, it's clear that we can no longer draw the distinction between form languages and formless languages that was favored by some older writers. Every language can and must express the essential syntactic relationships, even if there isn't a single affix in its vocabulary. Therefore, we conclude that every language is a form language. Besides expressing pure relationships, a language might be considered "formless"—meaning it lacks the mechanical and somewhat superficial characteristic of being burdened by the use of non-radical elements. Some have tried to establish a distinction based on "inner form." For example, Chinese has no simple formal elements or "outer form," but it demonstrates a strong awareness of relationships, like the difference between subject and object, attribute and predicate, and so forth. In other words, it possesses an "inner form" in the same way that Latin does, even though it appears "formless" externally while Latin appears "formal." Conversely, there are supposed languages[95] that lack a true understanding of fundamental relationships but only manage a more or less detailed expression of concrete ideas, sometimes with an extravagant representation of "outer form," leaving the pure relationships to be inferred from context. I strongly think that this supposed "inner formlessness" of certain languages is an illusion. It may be that in these languages, the relationships aren't expressed as abstractly as in Chinese or even as in Latin,[96] or that their order is more variable than in Chinese, or that a tendency for complex derivations frees the language from the need to explicitly express certain relationships as a more analytic language would. [97] However, this doesn't mean that these languages don't have a genuine sense of fundamental relationships. Thus, we won’t be able to use the idea of “inner formlessness,” except in a significantly revised sense where syntactic relations may overlap with ideas of another kind. We will revisit this classification criterion later.

More justifiable would be a classification according to the formal processes[98] most typically developed in the language. Those languages that always identify the word with the radical element would be set off as an “isolating” group against such as either affix modifying elements (affixing languages) or possess the power to change the significance of the radical element by internal changes (reduplication; vocalic and consonantal change; changes in quantity, stress, and pitch). The latter type might be not inaptly termed “symbolic” languages.[99] The affixing languages would naturally subdivide themselves into such as are prevailingly prefixing, like Bantu or Tlingit, and such as are mainly or entirely suffixing, like Eskimo or Algonkin or Latin. There are two serious difficulties with this fourfold classification (isolating, prefixing, suffixing, symbolic). In the first place, most languages fall into more than one of these groups. The Semitic languages, for instance, are prefixing, suffixing, and symbolic at one and the same time. In the second place, the classification in its bare form is superficial. It would throw together languages that differ utterly in spirit merely because of a certain external formal resemblance. There is clearly a world of difference between a prefixing language like Cambodgian, which limits itself, so far as its prefixes (and infixes) are concerned, to the expression of derivational concepts, and the Bantu languages, in which the prefixed elements have a far-reaching significance as symbols of syntactic relations. The classification has much greater value if it is taken to refer to the expression of relational concepts[100] alone. In this modified form we shall return to it as a subsidiary criterion. We shall find that the terms “isolating,” “affixing,” and “symbolic” have a real value. But instead of distinguishing between prefixing and suffixing languages, we shall find that it is of superior interest to make another distinction, one that is based on the relative firmness with which the affixed elements are united with the core of the word.[101]

A more justified way to classify languages might be based on the formal processes[98] typically found in the language. Languages that always link the word with the root element would be grouped as an “isolating” category, compared to those that use affixes to modify elements (affixing languages) or can alter the meaning of the root element through internal changes (such as reduplication, vocalic and consonantal changes, or changes in length, stress, and pitch). The latter group could aptly be called “symbolic” languages.[99] The affixing languages would naturally split into those that predominantly use prefixes, like Bantu or Tlingit, and those that are mainly or entirely suffixing, such as Eskimo, Algonkin, or Latin. There are two major issues with this four-part classification (isolating, prefixing, suffixing, symbolic). First, most languages fit into more than one category. For example, Semitic languages are simultaneously prefixing, suffixing, and symbolic. Second, the classification is superficial in its basic form. It would group languages that are fundamentally different in character simply because they have some external formal similarities. There’s a significant difference between a prefixing language like Cambodian, which limits itself to expressing derivational concepts with its prefixes (and infixes), and Bantu languages, where the prefixed elements play a crucial role as symbols of syntactic relationships. The classification is far more useful if it refers solely to the expression of relational concepts[100]. In this adjusted form, we will revisit it as a secondary criterion. We will discover that the terms “isolating,” “affixing,” and “symbolic” hold real significance. However, rather than distinguishing between prefixing and suffixing languages, it will be more interesting to make a different distinction based on how firmly the affixed elements are connected to the core of the word.[101]

There is another very useful set of distinctions that can be made, but these too must not be applied exclusively, or our classification will again be superficial. I refer to the notions of “analytic,” “synthetic,” and “polysynthetic.” The terms explain themselves. An analytic language is one that either does not combine concepts into single words at all (Chinese) or does so economically (English, French). In an analytic language the sentence is always of prime importance, the word is of minor interest. In a synthetic language (Latin, Arabic, Finnish) the concepts cluster more thickly, the words are more richly chambered, but there is a tendency, on the whole, to keep the range of concrete significance in the single word down to a moderate compass. A polysynthetic language, as its name implies, is more than ordinarily synthetic. The elaboration of the word is extreme. Concepts which we should never dream of treating in a subordinate fashion are symbolized by derivational affixes or “symbolic” changes in the radical element, while the more abstract notions, including the syntactic relations, may also be conveyed by the word. A polysynthetic language illustrates no principles that are not already exemplified in the more familiar synthetic languages. It is related to them very much as a synthetic language is related to our own analytic English.[102] The three terms are purely quantitative—and relative, that is, a language may be “analytic” from one standpoint, “synthetic” from another. I believe the terms are more useful in defining certain drifts than as absolute counters. It is often illuminating to point out that a language has been becoming more and more analytic in the course of its history or that it shows signs of having crystallized from a simple analytic base into a highly synthetic form.[103]

There is another very useful set of distinctions to make, but these should not be applied exclusively, or our classification will again be superficial. I’m talking about the concepts of “analytic,” “synthetic,” and “polysynthetic.” The terms are self-explanatory. An analytic language either doesn’t combine concepts into single words at all (like Chinese) or does so in a straightforward way (like English or French). In an analytic language, the sentence is always the most important part, while the individual word is less significant. In a synthetic language (like Latin, Arabic, or Finnish), concepts are packed more densely, and the words are more complex, but there’s a general tendency to keep the meaning of each word fairly limited. A polysynthetic language, as the name suggests, is even more synthetic. The breakdown of words is extensive. Concepts that we wouldn’t normally consider subordinate are represented by derivational affixes or “symbolic” changes in the root, while more abstract ideas, including grammatical relationships, can also be expressed through the word. A polysynthetic language doesn’t illustrate any principles that aren’t already shown in more familiar synthetic languages. It relates to them similarly to how a synthetic language relates to our own analytic English. The three terms are purely quantitative and relative, meaning a language may be “analytic” from one perspective and “synthetic” from another. I believe these terms are more useful for defining certain trends rather than being absolute categories. It can be quite revealing to note that a language has been gradually becoming more analytic over time or that it shows signs of having evolved from a simple analytic basis into a highly synthetic form.

We now come to the difference between an “inflective” and an “agglutinative” language. As I have already remarked, the distinction is a useful, even a necessary, one, but it has been generally obscured by a number of irrelevancies and by the unavailing effort to make the terms cover all languages that are not, like Chinese, of a definitely isolating cast. The meaning that we had best assign to the term “inflective” can be gained by considering very briefly what are some of the basic features of Latin and Greek that have been looked upon as peculiar to the inflective languages. First of all, they are synthetic rather than analytic. This does not help us much. Relatively to many another language that resembles them in broad structural respects, Latin and Greek are not notably synthetic; on the other hand, their modern descendants, Italian and Modern Greek, while far more analytic[104] than they, have not departed so widely in structural outlines as to warrant their being put in a distinct major group. An inflective language, we must insist, may be analytic, synthetic, or polysynthetic.

We now come to the difference between an “inflective” language and an “agglutinative” language. As I mentioned earlier, this distinction is useful, even necessary, but it has been generally overshadowed by a number of irrelevant points and the futile attempt to make the terms apply to all languages that are not, like Chinese, distinctly isolating. The best way to understand what we mean by “inflective” is to briefly consider some of the key features of Latin and Greek that are seen as unique to inflective languages. First of all, they are synthetic rather than analytic. This doesn’t provide much clarity. Compared to many other languages that share broad structural similarities, Latin and Greek are not particularly synthetic; on the other hand, their modern descendants, Italian and Modern Greek, while much more analytic than they are, haven't strayed so far in structural terms as to be placed in a completely different major group. An inflective language, we must insist, can be analytic, synthetic, or polysynthetic.

Latin and Greek are mainly affixing in their method, with the emphasis heavily on suffixing. The agglutinative languages are just as typically affixing as they, some among them favoring prefixes, others running to the use of suffixes. Affixing alone does not define inflection. Possibly everything depends on just what kind of affixing we have to deal with. If we compare our English words farmer and goodness with such words as height and depth, we cannot fail to be struck by a notable difference in the affixing technique of the two sets. The -er and -ness are affixed quite mechanically to radical elements which are at the same time independent words (farm, good). They are in no sense independently significant elements, but they convey their meaning (agentive, abstract quality) with unfailing directness. Their use is simple and regular and we should have no difficulty in appending them to any verb or to any adjective, however recent in origin. From a verb to camouflage we may form the noun camouflager “one who camouflages,” from an adjective jazzy proceeds with perfect case the noun jazziness. It is different with height and depth. Functionally they are related to high and deep precisely as is goodness to good, but the degree of coalescence between radical element and affix is greater. Radical element and affix, while measurably distinct, cannot be torn apart quite so readily as could the good and -ness of goodness. The -t of height is not the typical form of the affix (compare strength, length, filth, breadth, youth), while dep- is not identical with deep. We may designate the two types of affixing as “fusing” and “juxtaposing.” The juxtaposing technique we may call an “agglutinative” one, if we like.

Latin and Greek mainly use affixing, especially suffixes. Agglutinative languages also typically use affixing, with some favoring prefixes and others leaning more toward suffixes. Affixing alone doesn’t define inflection. It largely depends on the type of affixing involved. When we compare the English words farmer and goodness with words like height and depth, we notice a significant difference in how the affixing works in these two sets. The -er and -ness are added to root words that are also independent words (farm, good). They don’t carry meaning on their own, but they convey their meanings (agentive, abstract quality) very clearly. Their usage is straightforward and regular, so we can easily attach them to any verb or adjective, no matter how recent. From the verb to camouflage, we can create the noun camouflager meaning “one who camouflages,” and from the adjective jazzy, we can perfectly form the noun jazziness. In contrast, height and depth are functionally related to high and deep just like goodness is to good, but the connection between the root and the affix is stronger. The root and affix, while still distinct, are not as easily separated as good and -ness in goodness. The -t in height isn’t the typical form of that affix (compare strength, length, filth, breadth, youth), and dep- isn’t the same as deep. We can classify these two types of affixing as “fusing” and “juxtaposing.” We might call the juxtaposing method an “agglutinative” one if we wish.

Is the fusing technique thereby set off as the essence of inflection? I am afraid that we have not yet reached our goal. If our language were crammed full of coalescences of the type of depth, but if, on the other hand, it used the plural independently of verb concord (e.g., the books falls like the book falls, or the book fall like the books fall), the personal endings independently of tense (e.g., the book fells like the book falls, or the book fall like the book fell), and the pronouns independently of case (e.g., I see he like he sees me, or him see the man like the man sees him), we should hesitate to describe it as inflective. The mere fact of fusion does not seem to satisfy us as a clear indication of the inflective process. There are, indeed, a large number of languages that fuse radical element and affix in as complete and intricate a fashion as one could hope to find anywhere without thereby giving signs of that particular kind of formalism that marks off such languages as Latin and Greek as inflective.

Is the fusing technique considered the core of inflection? I'm afraid we haven't achieved our goal yet. If our language were packed with combinations like depth, but also used the plural without matching the verb (for example, the books falls like the book falls, or the book fall like the books fall), personal endings regardless of tense (like the book fells instead of the book falls, or the book fall instead of the book fell), and pronouns without regard to case (like I see he instead of he sees me, or him see the man like the man sees him), we should be careful to label it as inflective. The simple presence of fusion doesn't seem to clearly indicate the inflective process. In fact, many languages fuse root elements and affixes in as complete and intricate a way as one could hope to find anywhere, without showing the specific formalism that characterizes languages like Latin and Greek as inflective.

What is true of fusion is equally true of the “symbolic” processes.[105] There are linguists that speak of alternations like drink and drank as though they represented the high-water mark of inflection, a kind of spiritualized essence of pure inflective form. In such Greek forms, nevertheless, as pepomph-a “I have sent,” as contrasted with pemp-o “I send,” with its trebly symbolic change of the radical element (reduplicating pe-, change of e to o, change of p to ph), it is rather the peculiar alternation of the first person singular -a of the perfect with the -o of the present that gives them their inflective cast. Nothing could be more erroneous than to imagine that symbolic changes of the radical element, even for the expression of such abstract concepts as those of number and tense, is always associated with the syntactic peculiarities of an inflective language. If by an “agglutinative” language we mean one that affixes according to the juxtaposing technique, then we can only say that there are hundreds of fusing and symbolic languages—non-agglutinative by definition—that are, for all that, quite alien in spirit to the inflective type of Latin and Greek. We can call such languages inflective, if we like, but we must then be prepared to revise radically our notion of inflective form.

What is true for fusion is also true for the “symbolic” processes.[105] There are linguists who talk about alternations like drink and drank as if they represent the ultimate example of inflection, a sort of pure essence of inflective form. However, in certain Greek forms, such as pepomph-a “I have sent,” compared to pemp-o “I send,” with its three symbolic changes to the radical element (the reduplication of pe-, changing e to o, and changing p to ph), it is actually the unique alternation of the first-person singular -a of the perfect with the -o of the present that gives them their inflective quality. It is completely mistaken to think that symbolic changes to the radical element, even for abstract concepts like number and tense, are always linked to the syntactic features of an inflective language. If we define an “agglutinative” language as one that uses a juxtaposing technique to attach elements, then we can only say that there are hundreds of fusing and symbolic languages—non-agglutinative by definition—that, nonetheless, are quite different in nature from the inflective types like Latin and Greek. We can refer to these languages as inflective if we want, but we must be ready to completely redefine our idea of inflective form.

It is necessary to understand that fusion of the radical element and the affix may be taken in a broader psychological sense than I have yet indicated. If every noun plural in English were of the type of book: books, if there were not such conflicting patterns as deer: deer, ox: oxen, goose: geese to complicate the general form picture of plurality, there is little doubt that the fusion of the elements book and -s into the unified word books would be felt as a little less complete than it actually is. One reasons, or feels, unconsciously about the matter somewhat as follows:—If the form pattern represented by the word books is identical, as far as use is concerned, with that of the word oxen, the pluralizing elements -s and -en cannot have quite so definite, quite so autonomous, a value as we might at first be inclined to suppose. They are plural elements only in so far as plurality is predicated of certain selected concepts. The words books and oxen are therefore a little other than mechanical combinations of the symbol of a thing (book, ox) and a clear symbol of plurality. There is a slight psychological uncertainty or haze about the juncture in book-s and ox-en. A little of the force of -s and -en is anticipated by, or appropriated by, the words book and ox themselves, just as the conceptual force of -th in dep-th is appreciably weaker than that of -ness in good-ness in spite of the functional parallelism between depth and goodness. Where there is uncertainty about the juncture, where the affixed element cannot rightly claim to possess its full share of significance, the unity of the complete word is more strongly emphasized. The mind must rest on something. If it cannot linger on the constituent elements, it hastens all the more eagerly to the acceptance of the word as a whole. A word like goodness illustrates “agglutination,” books “regular fusion,” depth “irregular fusion,” geese “symbolic fusion” or “symbolism.”[106]

It’s important to realize that the blending of the root and the suffix can be understood in a broader psychological way than I’ve mentioned so far. If every plural noun in English were like book: books, and there weren't any differing forms like deer: deer, ox: oxen, and goose: geese to complicate the general notion of plurality, it’s quite likely that the combination of the elements book and -s into the unified word books would feel a bit less complete than it actually is. One thinks, or feels, unconsciously about it somewhat like this:—If the pattern represented by the word books is the same in usage as that of the word oxen, then the plural elements -s and -en might not have as clear or independent a significance as we might initially think. They are plural elements only insofar as plurality is assigned to certain chosen concepts. The words books and oxen are, therefore, a bit more than just mechanical combinations of the sign of a thing (book, ox) and a clear representation of plurality. There’s a slight psychological uncertainty or ambiguity around the junction in book-s and ox-en. A bit of the force of -s and -en is anticipated by, or taken on by, the words book and ox themselves, just like the conceptual force of -th in dep-th is noticeably weaker than that of -ness in good-ness, despite the functional similarities between depth and goodness. Where there’s uncertainty about the junction, where the affixed element can’t really claim its full significance, the unity of the complete word is emphasized more strongly. The mind has to focus on something. If it can’t dwell on the individual elements, it quickly moves to accept the word as a whole. A word like goodness shows “agglutination,” books shows “regular fusion,” depth shows “irregular fusion,” and geese shows “symbolic fusion” or “symbolism.”[106]

The psychological distinctness of the affixed elements in an agglutinative term may be even more marked than in the -ness of goodness. To be strictly accurate, the significance of the -ness is not quite as inherently determined, as autonomous, as it might be. It is at the mercy of the preceding radical element to this extent, that it requires to be preceded by a particular type of such element, an adjective. Its own power is thus, in a manner, checked in advance. The fusion here, however, is so vague and elementary, so much a matter of course in the great majority of all cases of affixing, that it is natural to overlook its reality and to emphasize rather the juxtaposing or agglutinative nature of the affixing process. If the -ness could be affixed as an abstractive element to each and every type of radical element, if we could say fightness (“the act or quality of fighting”) or waterness (“the quality or state of water”) or awayness (“the state of being away”) as we can say goodness (“the state of being good”), we should have moved appreciably nearer the agglutinative pole. A language that runs to synthesis of this loose-jointed sort may be looked upon as an example of the ideal agglutinative type, particularly if the concepts expressed by the agglutinated elements are relational or, at the least, belong to the abstracter class of derivational ideas.

The psychological difference of the attached elements in an agglutinative term can be even more pronounced than in the -ness of goodness. To be precise, the meaning of -ness isn’t entirely determined or independent as it could be. It depends on the preceding root element to the point that it needs to be preceded by a specific type of element, an adjective. Its own power is somewhat limited from the start. However, the combination here is so vague and basic, so typical in the majority of cases involving affixing, that it’s easy to overlook its reality and focus more on the juxtaposing or agglutinative nature of the affixing process. If -ness could be added as an abstract element to all types of root elements, if we could say fightness (“the act or quality of fighting”), waterness (“the quality or state of water”), or awayness (“the state of being away”) just like we can say goodness (“the state of being good”), we would be significantly closer to the agglutinative end of the spectrum. A language that tends towards this loose and flexible type of synthesis can be seen as an example of the ideal agglutinative type, especially if the concepts expressed by the agglutinated elements are relational or at least belong to a more abstract category of derivational ideas.

Instructive forms may be cited from Nootka. We shall return to our “fire in the house.”[107] The Nootka word inikw-ihl “fire in the house” is not as definitely formalized a word as its translation, suggests. The radical element inikw- “fire” is really as much of a verbal as of a nominal term; it may be rendered now by “fire,” now by “burn,” according to the syntactic exigencies of the sentence. The derivational element -ihl “in the house” does not mitigate this vagueness or generality; inikw-ihl is still “fire in the house” or “burn in the house.” It may be definitely nominalized or verbalized by the affixing of elements that are exclusively nominal or verbal in force. For example, inikw-ihl-’i, with its suffixed article, is a clear-cut nominal form: “the burning in the house, the fire in the house”; inikw-ihl-ma, with its indicative suffix, is just as clearly verbal: “it burns in the house.” How weak must be the degree of fusion between “fire in the house” and the nominalizing or verbalizing suffix is apparent from the fact that the formally indifferent inikwihl is not an abstraction gained by analysis but a full-fledged word, ready for use in the sentence. The nominalizing -’i and the indicative -ma are not fused form-affixes, they are simply additions of formal import. But we can continue to hold the verbal or nominal nature of inikwihl in abeyance long before we reach the -’i or -ma. We can pluralize it: inikw-ihl-’minih; it is still either “fires in the house” or “burn plurally in the house.” We can diminutivize this plural: inikw-ihl-’minih-’is, “little fires in the house” or “burn plurally and slightly in the house.” What if we add the preterit tense suffix -it? Is not inikw-ihl-’minih-’is-it necessarily a verb: “several small fires were burning in the house”? It is not. It may still be nominalized; inikwihl’minih’isit-’i means “the former small fires in the house, the little fires that were once burning in the house.” It is not an unambiguous verb until it is given a form that excludes every other possibility, as in the indicative inikwihl-minih’isit-a “several small fires were burning in the house.” We recognize at once that the elements -ihl, -’minih, -’is, and -it, quite aside from the relatively concrete or abstract nature of their content and aside, further, from the degree of their outer (phonetic) cohesion with the elements that precede them, have a psychological independence that our own affixes never have. They are typically agglutinated elements, though they have no greater external independence, are no more capable of living apart from the radical element to which they are suffixed, than the -ness and goodness or the -s of books. It does not follow that an agglutinative language may not make use of the principle of fusion, both external and psychological, or even of symbolism to a considerable extent. It is a question of tendency. Is the formative slant clearly towards the agglutinative method? Then the language is “agglutinative.” As such, it may be prefixing or suffixing, analytic, synthetic, or polysynthetic.

Instructive examples can be drawn from Nootka. We’ll return to our “fire in the house.”[107] The Nootka term inikw-ihl “fire in the house” isn’t as strictly defined as its translation suggests. The root inikw- “fire” functions as both a verb and a noun; it can mean “fire” or “burn,” depending on how the sentence is structured. The additional component -ihl “in the house” doesn’t clear up this ambiguity; inikw-ihl can still mean “fire in the house” or “burn in the house.” It can be clearly turned into a noun or a verb by adding elements that are purely nominal or verbal. For instance, inikw-ihl-’i, with its added article, is a distinct noun form: “the burning in the house, the fire in the house”; inikw-ihl-ma, with its indicative suffix, is clearly a verb: “it burns in the house.” The weak connection between “fire in the house” and the nominalizing or verbalizing suffix is evident since the neutral inikwihl is a fully functional word, not an abstract concept gained through analysis. The nominalizing -’i and the indicative -ma aren’t fused form-affixes; they are simply formal additions. However, we can keep the verbal or nominal quality of inikwihl in mind even before introducing -’i or -ma. We can pluralize it: inikw-ihl-’minih; it still represents either “fires in the house” or “burn plurally in the house.” We can also make this plural diminutive: inikw-ihl-’minih-’is, meaning “little fires in the house” or “burn slightly in the house.” What if we add the past tense suffix -it? Does inikw-ihl-’minih-’is-it not have to be a verb: “several small fires were burning in the house”? It isn’t. It can still be a noun; inikwihl’minih’isit-’i translates to “the former small fires in the house, the little fires that were once burning in the house.” It doesn’t become an unambiguous verb until it is given a form that eliminates all other possibilities, such as the indicative inikwihl-minih’isit-a “several small fires were burning in the house.” We can see right away that the elements -ihl, -’minih, -’is, and -it, regardless of their concrete or abstract nature and their outer (phonetic) connection to the preceding elements, maintain a psychological independence that our own suffixes rarely have. They are typically agglutinated elements, though they share no greater external independence and are no more able to exist separately from the root element they are attached to than -ness or goodness or the -s in books. This doesn’t mean that an agglutinative language can’t utilize both external and psychological fusion or even significant symbolism. It’s a matter of tendency. If the structure leans clearly toward the agglutinative style, then the language is labeled “agglutinative.” In this capacity, it may employ prefixes or suffixes, be analytic, synthetic, or polysynthetic.

To return to inflection. An inflective language like Latin or Greek uses the method of fusion, and this fusion has an inner psychological as well as an outer phonetic meaning. But it is not enough that the fusion operate merely in the sphere of derivational concepts (group II),[108] it must involve the syntactic relations, which may either be expressed in unalloyed form (group IV) or, as in Latin and Greek, as “concrete relational concepts” (group III).[109] As far as Latin and Greek are concerned, their inflection consists essentially of the fusing of elements that express logically impure relational concepts with radical elements and with elements expressing derivational concepts. Both fusion as a general method and the expression of relational concepts in the word are necessary to the notion of “inflection.”

To go back to inflection. An inflected language like Latin or Greek uses the method of fusion, which has both an internal psychological meaning and an external phonetic meaning. However, it's not enough for the fusion to only work within the realm of derivational concepts (group II),[108] it must also involve syntactic relations, which can either be expressed in their pure form (group IV) or, like in Latin and Greek, as “concrete relational concepts” (group III).[109] When it comes to Latin and Greek , their inflection fundamentally consists of fusing elements that express logically mixed relational concepts with radical elements and with elements that express derivational concepts. Both the general method of fusion and the expression of relational concepts in words are essential to the idea of “inflection.”

But to have thus defined inflection is to doubt the value of the term as descriptive of a major class. Why emphasize both a technique and a particular content at one and the same time? Surely we should be clear in our minds as to whether we set more store by one or the other. “Fusional” and “symbolic” contrast with “agglutinative,” which is not on a par with “inflective” at all. What are we to do with the fusional and symbolic languages that do not express relational concepts in the word but leave them to the sentence? And are we not to distinguish between agglutinative languages that express these same concepts in the word—in so far inflective-like—and those that do not? We dismissed the scale: analytic, synthetic, polysynthetic, as too merely quantitative for our purpose. Isolating, affixing, symbolic—this also seemed insufficient for the reason that it laid too much stress on technical externals. Isolating, agglutinative, fusional, and symbolic is a preferable scheme, but still skirts the external. We shall do best, it seems to me, to hold to “inflective” as a valuable suggestion for a broader and more consistently developed scheme, as a hint for a classification based on the nature of the concepts expressed by the language. The other two classifications, the first based on degree of synthesis, the second on degree of fusion, may be retained as intercrossing schemes that give us the opportunity to subdivide our main conceptual types.

But defining inflection this way raises doubts about the usefulness of the term as a description of a major category. Why focus on both a technique and a specific content at the same time? We need to clarify whether we prioritize one over the other. “Fusional” and “symbolic” contrast with “agglutinative,” which isn’t on the same level as “inflective” at all. What do we do with fusional and symbolic languages that don’t express relational concepts in the word but leave them to the sentence? And shouldn’t we distinguish between agglutinative languages that represent these same concepts in the word—somewhat inflective—and those that don’t? We dismissed the scale: analytic, synthetic, polysynthetic, as too focused on quantitative measures for our purpose. Isolating, affixing, symbolic—this also didn’t seem adequate because it emphasized technical externals too much. Isolating, agglutinative, fusional, and symbolic is a more suitable scheme, yet still avoids the external aspect. It seems best to me to stick with “inflective” as a useful suggestion for a more comprehensive and well-developed scheme, hinting at a classification based on the nature of the concepts expressed by the language. The other two classifications, the first based on the degree of synthesis, the second on the degree of fusion, can be kept as intersecting schemes that allow us to further break down our main conceptual types.

It is well to recall that all languages must needs express radical concepts (group I) and relational ideas (group IV). Of the two other large groups of concepts—derivational (group II) and mixed relational (group III)—both may be absent, both present, or only one present. This gives us at once a simple, incisive, and absolutely inclusive method of classifying all known languages. They are:

It’s important to remember that all languages need to express fundamental concepts (group I) and relational ideas (group IV). As for the other two large groups of concepts—derivational (group II) and mixed relational (group III)—both can be missing, both can be present, or just one can be present. This gives us a straightforward, clear, and completely inclusive way to classify all known languages. They are:

Such as express only concepts of groups I and IV; in other words, languages that keep the syntactic relations pure and that do not possess the power to modify the significance of their radical elements by means of affixes or internal changes.[110] We may call these Pure-relational non-deriving languages or, more tersely, Simple Pure-relational languages. These are the languages that cut most to the bone of linguistic expression.
Such as express concepts of groups I, II, and IV; in other words, languages that keep the syntactic relations pure and that also possess the power to modify the significance of their radical elements by means of affixes or internal changes. These are the Pure-relational deriving languages or Complex Pure-relational languages.
Such as express concepts of groups I and III;[111] in other words, languages in which the syntactic relations are expressed in necessary connection with concepts that are not utterly devoid of concrete significance but that do not, apart from such mixture, possess the power to modify the significance of their radical elements by means of affixes or internal changes.[112] These are the Mixed-relational non-deriving languages or Simple Mixed-relational languages.
Such as express concepts of groups I, II, and III; in other words, languages in which the syntactic relations are expressed in mixed form, as in C, and that also possess the power to modify the significance of their radical elements by means of affixes or internal changes. These are the Mixed-relational deriving languages or Complex Mixed-relational languages. Here belong the “inflective” languages that we are most familiar with as well as a great many “agglutinative” languages, some “polysynthetic,” others merely synthetic.

This conceptual classification of languages, I must repeat, does not attempt to take account of the technical externals of language. It answers, in effect, two fundamental questions concerning the translation of concepts into linguistic symbols. Does the language, in the first place, keep its radical concepts pure or does it build up its concrete ideas by an aggregation of inseparable elements (types A and C versus types B and D)? And, in the second place, does it keep the basic relational concepts, such as are absolutely unavoidable in the ordering of a proposition, free of an admixture of the concrete or not (types A and B versus types C and D)? The second question, it seems to me, is the more fundamental of the two. We can therefore simplify our classification and present it in the following form:

This classification of languages, I want to emphasize, doesn’t consider the technical aspects of language. It essentially addresses two key questions about translating concepts into language symbols. First, does the language maintain its core concepts as pure, or does it create its concrete ideas by combining inseparable elements (types A and C compared to types B and D)? Second, does it keep the basic relational concepts, which are essential for structuring a proposition, free from mixing in concrete elements (types A and B compared to types C and D)? I believe the second question is the more fundamental one. Therefore, we can simplify our classification and present it in the following way:

I. Pure-relational Languages	{	A.	Simple
I. Pure-relational Languages	{	B.	Complex
II. Mixed-relational Languages	{	C.	Simple
II. Mixed-relational Languages	{	D.	Complex

The classification is too sweeping and too broad for an easy, descriptive survey of the many varieties of human speech. It needs to be amplified. Each of the types A, B, C, D may be subdivided into an agglutinative, a fusional, and a symbolic sub-type, according to the prevailing method of modification of the radical element. In type A we distinguish in addition an isolating sub-type, characterized by the absence of all affixes and modifications of the radical element. In the isolating languages the syntactic relations are expressed by the position of the words in the sentence. This is also true of many languages of type B, the terms “agglutinative,” “fusional,” and “symbolic” applying in their case merely to the treatment of the derivational, not the relational, concepts. Such languages could be termed “agglutinative-isolating,” “fusional-isolating” and “symbolic-isolating.”

The classification is too broad and sweeping for an easy, descriptive overview of the many types of human speech. It needs to be expanded. Each of the types A, B, C, and D can be further divided into agglutinative, fusional, and symbolic sub-types, based on how the main element is modified. For type A, we also recognize an isolating sub-type, which is marked by the lack of affixes and modifications of the main element. In isolating languages, syntactic relationships are shown by the position of words in the sentence. This is also true for many languages of type B, where the terms “agglutinative,” “fusional,” and “symbolic” refer only to how derivational concepts are treated, not relational ones. Such languages could be called “agglutinative-isolating,” “fusional-isolating,” and “symbolic-isolating.”

This brings up the important general consideration that the method of handling one group of concepts need not in the least be identical with that used for another. Compound terms could be used to indicate this difference, if desired, the first element of the compound referring to the treatment of the concepts of group II, the second to that of the concepts of groups III and IV. An “agglutinative” language would normally be taken to mean one that agglutinates all of its affixed elements or that does so to a preponderating extent. In an “agglutinative-fusional” language the derivational elements are agglutinated, perhaps in the form of prefixes, while the relational elements (pure or mixed) are fused with the radical element, possibly as another set of prefixes following the first set or in the form of suffixes or as part prefixes and part suffixes. By a “fusional-agglutinative” language we would understand one that fuses its derivational elements but allows a greater independence to those that indicate relations. All these and similar distinctions are not merely theoretical possibilities, they can be abundantly illustrated from the descriptive facts of linguistic morphology. Further, should it prove desirable to insist on the degree of elaboration of the word, the terms “analytic,” “synthetic,” and “polysynthetic” can be added as descriptive terms. It goes without saying that languages of type A are necessarily analytic and that languages of type C also are prevailingly analytic and are not likely to develop beyond the synthetic stage.

This raises the important general idea that the way we handle one set of concepts doesn’t have to be the same as the way we handle another. We could use compound terms to show this difference if we want, with the first part of the compound referring to the concepts of group II and the second to those of groups III and IV. An “agglutinative” language typically means one that combines all of its affixed elements or does so to a large extent. In an “agglutinative-fusional” language, the derivational elements are combined, often as prefixes, while the relational elements (either pure or mixed) are fused with the base element, possibly as another set of prefixes after the first set, as suffixes, or as a mix of prefixes and suffixes. By a “fusional-agglutinative” language, we mean one that fuses its derivational elements but allows more independence to those that indicate relationships. All these distinctions and similar ones are not just theoretical possibilities; they can be well illustrated through the descriptive facts of linguistic morphology. Furthermore, if it’s important to emphasize the level of detail in the word construction, the terms “analytic,” “synthetic,” and “polysynthetic” can be used as descriptive labels. It’s obvious that languages of type A are necessarily analytic and that languages of type C are mostly analytic too and are unlikely to progress beyond the synthetic stage.

But we must not make too much of terminology. Much depends on the relative emphasis laid on this or that feature or point of view. The method of classifying languages here developed has this great advantage, that it can be refined or simplified according to the needs of a particular discussion. The degree of synthesis may be entirely ignored; “fusion” and “symbolism” may often be combined with advantage under the head of “fusion”; even the difference between agglutination and fusion may, if desired, be set aside as either too difficult to draw or as irrelevant to the issue. Languages, after all, are exceedingly complex historical structures. It is of less importance to put each language in a neat pigeon-hole than to have evolved a flexible method which enables us to place it, from two or three independent standpoints, relatively to another language. All this is not to deny that certain linguistic types are more stable and frequently represented than others that are just as possible from a theoretical standpoint. But we are too ill-informed as yet of the structural spirit of great numbers of languages to have the right to frame a classification that is other than flexible and experimental.

But we shouldn’t overthink the terminology. Much depends on how much emphasis is put on certain features or perspectives. The way we classify languages here has the significant advantage of being able to be refined or simplified based on the needs of a particular discussion. The level of synthesis can be completely ignored; “fusion” and “symbolism” can often be effectively grouped under “fusion”; even the distinction between agglutination and fusion can be overlooked if it’s seen as too complicated to draw or not relevant to the topic. Languages are, after all, incredibly complex historical constructs. It’s less important to categorize each language neatly than to develop a flexible method that allows us to compare it, from two or three independent perspectives, to another language. None of this denies that some linguistic types are more stable and commonly represented than others that are theoretically possible. However, we still don’t have enough understanding of the structural nature of many languages to create a classification that isn’t flexible and experimental.

The reader will gain a somewhat livelier idea of the possibilities of linguistic morphology by glancing down the subjoined analytical table of selected types. The columns II, III, IV refer to the groups of concepts so numbered in the preceding chapter. The letters a, b, c, d refer respectively to the processes of isolation (position in the sentence), agglutination, fusion, and symbolism. Where more than one technique is employed, they are put in the order of their importance.[113]

The reader will get a more dynamic sense of the possibilities of linguistic morphology by looking at the analytical table of selected types below. Columns II, III, and IV correspond to the groups of concepts numbered in the previous chapter. The letters a, b, c, d represent the processes of isolation (position in the sentence), agglutination, fusion, and symbolism, respectively. When more than one technique is used, they are listed in order of significance.[113]

Fundamental Type	I	II	III	Technique	Synthesis	Examples
A (Simple Pure-relational)	—	—	a	Isolating	Analytic	Chinese; Annamite
	(d)	—	a, b	Isolating (weakly agglutinative)	Analytic	Ewe (Guinea Coast)
	(b)	—	a, b, c	Agglutinative (mildly agglutinative-fusional)	Analytic	Modern Tibetan
B (Complex Pure-relational)	b, (d)	—	a	Agglutinative-isolating	Analytic	Polynesian
	b	—	a, (b)	Agglutinative-isolating	Polysynthetic	Haida
	c	—	a	Fusional-isolating	Analytic	Cambodgian
	b	—	b	Agglutinative	Synthetic	Turkish
	b, d	(b)	b	Agglutinative (symbolic tinge)	Polysynthetic	Yana (N. California)
	c, d, (b)	—	a, b	Fusional-agglutinative (symbolic tinge)	Synthetic (mildly)	Classical Tibetan
	b	—	c	Agglutinative-fusional	Synthetic (mildly polysynthetic)	Sioux
	c	—	c	Fusional	Synthetic	Salinan (S.W. California)
	d, c	(d)	d, c, a	Symbolic	Analytic	Shilluk (Upper Nile)
C (Simple Mixed-relational)	(b)	b	—	Agglutinative	Synthetic	Bantu
C (Simple Mixed-relational)	(c)	c, (d)	a	Fusional	Analytic (mildly synthetic)	French[114]
D (Complex Mixed-relational)	b, c, d	b	b	Agglutinative (symbolic tinge)	Polysynthetic	Nootka (Vancouver Island)[115]
	c, (d)	b	—	Fusional-agglutinative	Polysynthetic (mildly)	Chinook (lower Columbia R.)
	c, (d)	c, (d), (b)	—	Fusional	Polysynthetic	Algonkin
	c	c, d	a	Fusional	Analytic	English
	c, d	c, d	—	Fusional (symbolic tinge)	Synthetic	Latin, Greek, Sanskrit
	c, b, d	c, d	(a)	Fusional (strongly symbolic)	Synthetic	Takelma (S.W. Oregon)
	d, c	c, d	(a)	Symbolic-fusional	Synthetic	Semitic (Arabic, Hebrew)

I need hardly point out that these examples are far from exhausting the possibilities of linguistic structure. Nor that the fact that two languages are similarly classified does not necessarily mean that they present a great similarity on the surface. We are here concerned with the most fundamental and generalized features of the spirit, the technique, and the degree of elaboration of a given language. Nevertheless, in numerous instances we may observe this highly suggestive and remarkable fact, that languages that fall into the same class have a way of paralleling each other in many details or in structural features not envisaged by the scheme of classification. Thus, a most interesting parallel could be drawn on structural lines between Takelma and Greek,[116] languages that are as geographically remote from each other and as unconnected in a historical sense as two languages selected at random can well be. Their similarity goes beyond the generalized facts registered in the table. It would almost seem that linguistic features that are easily thinkable apart from each other, that seem to have no necessary connection in theory, have nevertheless a tendency to cluster or to follow together in the wake of some deep, controlling impulse to form that dominates their drift. If, therefore, we can only be sure of the intuitive similarity of two given languages, of their possession of the same submerged form-feeling, we need not be too much surprised to find that they seek and avoid certain linguistic developments in common. We are at present very far from able to define just what these fundamental form intuitions are. We can only feel them rather vaguely at best and must content ourselves for the most part with noting their symptoms. These symptoms are being garnered in our descriptive and historical grammars of diverse languages. Some day, it may be, we shall be able to read from them the great underlying ground-plans.

I hardly need to point out that these examples do not cover all the possibilities of linguistic structure. Also, just because two languages are in the same category doesn’t mean they’re very similar on the surface. We're focused on the most basic and general features of the spirit, technique, and complexity of a given language. Still, in many cases, we can see the intriguing and notable fact that languages in the same category often mirror each other in various details or structural features not considered in the classification scheme. For instance, an interesting structural parallel can be drawn between Takelma and Greek,[116] languages that are as far apart geographically and historically unconnected as any two randomly chosen languages. Their similarity goes beyond the generalized facts listed in the table. It almost seems that linguistic features that can easily be thought of separately, which theoretically don’t seem connected, actually have a tendency to cluster or align due to some deep, guiding impulse that shapes their direction. Therefore, if we can recognize the intuitive similarity of two languages, and that they share the same underlying form-feeling, we shouldn’t be too surprised to find that they tend to pursue and avoid certain linguistic developments in similar ways. We are currently far from being able to define exactly what these fundamental form intuitions are. At best, we can only sense them vaguely and mostly need to settle for observing their signs. These signs are being collected in our descriptive and historical grammars of various languages. One day, perhaps, we will be able to interpret them and uncover the great underlying frameworks.

Such a purely technical classification of languages as the current one into “isolating,” “agglutinative,” and “inflective” (read “fusional”) cannot claim to have great value as an entering wedge into the discovery of the intuitional forms of language. I do not know whether the suggested classification into four conceptual groups is likely to drive deeper or not. My own feeling is that it does, but classifications, neat constructions of the speculative mind, are slippery things. They have to be tested at every possible opportunity before they have the right to cry for acceptance. Meanwhile we may take some encouragement from the application of a rather curious, yet simple, historical test. Languages are in constant process of change, but it is only reasonable to suppose that they tend to preserve longest what is most fundamental in their structure. Now if we take great groups of genetically related languages,[117] we find that as we pass from one to another or trace the course of their development we frequently encounter a gradual change of morphological type. This is not surprising, for there is no reason why a language should remain permanently true to its original form. It is interesting, however, to note that of the three intercrossing classifications represented in our table (conceptual type, technique, and degree of synthesis), it is the degree of synthesis that seems to change most readily, that the technique is modifiable but far less readily so, and that the conceptual type tends to persist the longest of all.

A purely technical classification of languages, like the current one that divides them into “isolating,” “agglutinative,” and “inflective” (or “fusional”), doesn’t hold much value as a means to discover the intuitive forms of language. I’m not sure if the proposed classification into four conceptual groups will provide deeper insights or not. Personally, I think it might, but classifications, neat constructs of speculative thought, can be tricky. They need to be tested thoroughly before they deserve acceptance. In the meantime, we can find some encouragement in a curious yet straightforward historical test. Languages are constantly changing, but it makes sense to think they tend to preserve what is most fundamental in their structure for the longest time. If we look at large groups of genetically related languages,[117] we see that as we move from one to another or trace their development, we often notice a gradual shift in morphological type. This isn’t surprising because there’s no reason for a language to stay permanently true to its original form. However, it’s interesting to observe that among the three interrelated classifications in our table (conceptual type, technique, and degree of synthesis), the degree of synthesis seems to change the most easily, the technique is changeable but less so, and the conceptual type tends to last the longest.

The illustrative material gathered in the table is far too scanty to serve as a real basis of proof, but it is highly suggestive as far as it goes. The only changes of conceptual type within groups of related languages that are to be gleaned from the table are of B to A (Shilluk as contrasted with Ewe;[118] Classical Tibetan as contrasted with Modern Tibetan and Chinese) and of D to C (French as contrasted with Latin[119]). But types A : B and C : D are respectively related to each other as a simple and a complex form of a still more fundamental type (pure-relational, mixed-relational). Of a passage from a pure-relational to a mixed-relational type or vice versa I can give no convincing examples.

The examples included in the table are too limited to provide solid proof, but they are quite suggestive within their scope. The only conceptual changes between groups of related languages that can be derived from the table are from B to A (Shilluk compared to Ewe; [118] Classical Tibetan compared to Modern Tibetan and Chinese) and from D to C (French compared to Latin[119]). However, types A : B and C : D are related to each other as simpler and more complex forms of a more fundamental type (pure-relational, mixed-relational). I can't provide any convincing examples of a transition from a pure-relational type to a mixed-relational type or vice versa.

The table shows clearly enough how little relative permanence there is in the technical features of language. That highly synthetic languages (Latin; Sanskrit) have frequently broken down into analytic forms (French; Bengali) or that agglutinative languages (Finnish) have in many instances gradually taken on “inflective” features are well-known facts, but the natural inference does not seem to have been often drawn that possibly the contrast between synthetic and analytic or agglutinative and “inflective” (fusional) is not so fundamental after all. Turning to the Indo-Chinese languages, we find that Chinese is as near to being a perfectly isolating language as any example we are likely to find, while Classical Tibetan has not only fusional but strong symbolic features (e.g., g-tong-ba “to give,” past b-tang, future gtang, imperative thong); but both are pure-relational languages. Ewe is either isolating or only barely agglutinative, while Shilluk, though soberly analytic, is one of the most definitely symbolic languages I know; both of these Soudanese languages are pure-relational. The relationship between Polynesian and Cambodgian is remote, though practically certain; while the latter has more markedly fusional features than the former,[120] both conform to the complex pure-relational type. Yana and Salinan are superficially very dissimilar languages. Yana is highly polysynthetic and quite typically agglutinative, Salinan is no more synthetic than and as irregularly and compactly fusional (“inflective”) as Latin; both are pure-relational, Chinook and Takelma, remotely related languages of Oregon, have diverged very far from each other, not only as regards technique and synthesis in general but in almost all the details of their structure; both are complex mixed-relational languages, though in very different ways. Facts such as these seem to lend color to the suspicion that in the contrast of pure-relational and mixed-relational (or concrete-relational) we are confronted by something deeper, more far-reaching, than the contrast of isolating, agglutinative, and fusional.[121]

The table clearly shows how little long-term stability there is in the technical features of language. It’s well-known that highly synthetic languages like Latin and Sanskrit have often evolved into more analytic forms such as French and Bengali, or that agglutinative languages like Finnish have gradually adopted “inflective” features. However, it seems this leads to the natural inference that the difference between synthetic and analytic or agglutinative and “inflective” (fusional) might not be as fundamental as we think. Looking at the Indo-Chinese languages, we see that Chinese is as close to being a perfectly isolating language as we are likely to find, while Classical Tibetan exhibits both fusional and strong symbolic features (e.g., g-tong-ba “to give,” past b-tang, future gtang, imperative thong); both are pure-relational languages. Ewe is either isolating or just barely agglutinative, while Shilluk, though mainly analytic, is one of the most distinctly symbolic languages I know; both of these Soudanese languages are pure-relational. The connection between Polynesian and Cambodian is distant but practically certain; while Cambodian has more distinctly fusional features than Polynesian, [120] both align with the complex pure-relational type. Yana and Salinan may seem very different at first glance. Yana is highly polysynthetic and typically agglutinative, whereas Salinan is no more synthetic and has a degree of irregular and compact fusion (“inflection”) similar to Latin; both are pure-relational. Chinook and Takelma, which are distantly related languages from Oregon, have diverged significantly from each other, not only in overall technique and synthesis but in nearly all structural details; both are complex mixed-relational languages, albeit in very distinct ways. Facts like these suggest that the contrast between pure-relational and mixed-relational (or concrete-relational) reflects something deeper, more far-reaching than the differences between isolating, agglutinative, and fusional languages.[121]

VII

Language as a Historical Product: Drift

Every one knows that language is variable. Two individuals of the same generation and locality, speaking precisely the same dialect and moving in the same social circles, are never absolutely at one in their speech habits. A minute investigation of the speech of each individual would reveal countless differences of detail—in choice of words, in sentence structure, in the relative frequency with which particular forms or combinations of words are used, in the pronunciation of particular vowels and consonants and of combinations of vowels and consonants, in all those features, such as speed, stress, and tone, that give life to spoken language. In a sense they speak slightly divergent dialects of the same language rather than identically the same language.

Everyone knows that language changes. Two people from the same generation and area, speaking the same dialect and moving in the same social circles, never completely match in their speech patterns. A close look at how each person speaks would show countless differences in details—like word choices, sentence structure, how often specific words or phrases are used, the way certain vowels and consonants are pronounced, and all those features, such as speed, stress, and tone, that bring spoken language to life. In a way, they speak slightly different dialects of the same language instead of speaking the exact same language.

There is an important difference, however, between individual and dialectic variations. If we take two closely related dialects, say English as spoken by the “middle classes” of London and English as spoken by the average New Yorker, we observe that, however much the individual speakers in each city differ from each other, the body of Londoners forms a compact, relatively unified group in contrast to the body of New Yorkers. The individual variations are swamped in or absorbed by certain major agreements—say of pronunciation and vocabulary—which stand out very strongly when the language of the group as a whole is contrasted with that of the other group. This means that there is something like an ideal linguistic entity dominating the speech habits of the members of each group, that the sense of almost unlimited freedom which each individual feels in the use of his language is held in leash by a tacitly directing norm. One individual plays on the norm in a way peculiar to himself, the next individual is nearer the dead average in that particular respect in which the first speaker most characteristically departs from it but in turn diverges from the average in a way peculiar to himself, and so on. What keeps the individual’s variations from rising to dialectic importance is not merely the fact that they are in any event of small moment—there are well-marked dialectic variations that are of no greater magnitude than individual variations within a dialect—it is chiefly that they are silently “corrected” or canceled by the consensus of usage. If all the speakers of a given dialect were arranged in order in accordance with the degree of their conformity to average usage, there is little doubt that they would constitute a very finely intergrading series clustered about a well-defined center or norm. The differences between any two neighboring speakers of the series[122] would be negligible for any but the most microscopic linguistic research. The differences between the outer-most members of the series are sure to be considerable, in all likelihood considerable enough to measure up to a true dialectic variation. What prevents us from saying that these untypical individuals speak distinct dialects is that their peculiarities, as a unified whole, are not referable to another norm than the norm of their own series.

There’s a key difference between individual and dialect variations. If we compare two similar dialects, like English spoken by the “middle classes” in London and English spoken by the average New Yorker, we see that, despite the individual differences among speakers in each city, Londoners form a more cohesive, relatively unified group compared to New Yorkers. The individual variations get overshadowed or absorbed by certain major agreements—such as pronunciation and vocabulary—which become very noticeable when we contrast the language of one group with that of the other. This indicates that there’s an ideal linguistic standard influencing the speech habits of each group, while the almost limitless freedom that individual speakers feel in using their language is kept in check by an unspoken norm. One person might creatively express themselves using the norm in a unique way, while the next person may conform more closely to the average and then diverge from it in their own way, and so forth. The reason individual variations don’t become important dialect differences is not just that they are generally minor—there are clear dialect differences that are just as small as individual variations within a dialect—it’s mainly that they are quietly “corrected” by the collective usage. If all the speakers of a specific dialect were lined up based on how closely they conform to the average usage, they would likely form a well-defined series centered around a specific norm. The differences between any two neighboring speakers in this series would be minimal for any but the most detailed linguistic study. However, the differences between the furthest members of the series would be significant enough to be classified as genuine dialect variations. What stops us from claiming that these atypical individuals speak distinct dialects is that their unique features collectively align with the norm of their own series.

If the speech of any member of the series could actually be made to fit into another dialect series,[123] we should have no true barriers between dialects (and languages) at all. We should merely have a continuous series of individual variations extending over the whole range of a historically unified linguistic area, and the cutting up of this large area (in some cases embracing parts of several continents) into distinct dialects and languages would be an essentially arbitrary proceeding with no warrant save that of practical convenience. But such a conception of the nature of dialectic variation does not correspond to the facts as we know them. Isolated individuals may be found who speak a compromise between two dialects of a language, and if their number and importance increases they may even end by creating a new dialectic norm of their own, a dialect in which the extreme peculiarities of the parent dialects are ironed out. In course of time the compromise dialect may absorb the parents, though more frequently these will tend to linger indefinitely as marginal forms of the enlarged dialect area. But such phenomena—and they are common enough in the history of language—are evidently quite secondary. They are closely linked with such social developments as the rise of nationality, the formation of literatures that aim to have more than a local appeal, the movement of rural populations into the cities, and all those other tendencies that break up the intense localism that unsophisticated man has always found natural.

If any member of the dialect series could actually be fit into another dialect series,[123] we wouldn’t have true barriers between dialects (and languages) at all. We would just have a continuous series of individual variations stretching across a historically unified linguistic area, and dividing this large area (in some cases spanning parts of several continents) into distinct dialects and languages would be essentially arbitrary, only justified by practical convenience. However, this idea of dialectal variation doesn’t match the facts as we understand them. You can find isolated individuals who speak a mix of two dialects of a language, and if enough of them exist and are significant, they may end up creating a new dialect norm of their own, a dialect that smooths out the extreme peculiarities of the parent dialects. Over time, the compromise dialect may absorb the parents, though usually, these will tend to persist as marginal forms of the broader dialect area. But such phenomena—and they’re quite common in the history of language—are clearly secondary. They’re closely tied to social developments like the rise of national identity, the creation of literatures that aim for broader appeal, the migration of rural populations to cities, and all the other trends that break up the strong localism that unsophisticated people have always found natural.

The explanation of primary dialectic differences is still to seek. It is evidently not enough to say that if a dialect or language is spoken in two distinct localities or by two distinct social strata it naturally takes on distinctive forms, which in time come to be divergent enough to deserve the name of dialects. This is certainly true as far as it goes. Dialects do belong, in the first instance, to very definitely circumscribed social groups, homogeneous enough to secure the common feeling and purpose needed to create a norm. But the embarrassing question immediately arises, If all the individual variations within a dialect are being constantly leveled out to the dialectic norm, if there is no appreciable tendency for the individual’s peculiarities to initiate a dialectic schism, why should we have dialectic variations at all? Ought not the norm, wherever and whenever threatened, automatically to reassert itself? Ought not the individual variations of each locality, even in the absence of intercourse between them, to cancel out to the same accepted speech average?

We still need to explore the main differences in dialects. It’s clearly not enough to say that if a dialect or language is spoken in two different places or by two different social groups, it will naturally develop unique forms, which over time become different enough to be called dialects. This is certainly true to some extent. Dialects are initially tied to specific social groups that are similar enough to foster a shared understanding and purpose needed to establish a standard. However, this raises a tricky question: if all individual variations within a dialect are constantly being adjusted to fit the dialect's standard, and if there’s no significant tendency for individual traits to create a dialect split, why do dialect variations exist at all? Shouldn’t the standard automatically reaffirm itself whenever it's challenged? Shouldn’t the individual variations from each region, even without interaction between them, average out to the same accepted form of speech?

If individual variations “on a flat” were the only kind of variability in language, I believe we should be at a loss to explain why and how dialects arise, why it is that a linguistic prototype gradually breaks up into a number of mutually unintelligible languages. But language is not merely something that is spread out in space, as it were—a series of reflections in individual minds of one and the same timeless picture. Language moves down time in a current of its own making. It has a drift. If there were no breaking up of a language into dialects, if each language continued as a firm, self-contained unity, it would still be constantly moving away from any assignable norm, developing new features unceasingly and gradually transforming itself into a language so different from its starting point as to be in effect a new language. Now dialects arise not because of the mere fact of individual variation but because two or more groups of individuals have become sufficiently disconnected to drift apart, or independently, instead of together. So long as they keep strictly together, no amount of individual variation would lead to the formation of dialects. In practice, of course, no language can be spread over a vast territory or even over a considerable area without showing dialectic variations, for it is impossible to keep a large population from segregating itself into local groups, the language of each of which tends to drift independently. Under cultural conditions such as apparently prevail to-day, conditions that fight localism at every turn, the tendency to dialectic cleavage is being constantly counteracted and in part “corrected” by the uniformizing factors already referred to. Yet even in so young a country as America the dialectic differences are not inconsiderable.

If individual variations “on a flat” were the only type of variability in language, I think we would struggle to explain why and how dialects develop, and why a linguistic prototype gradually divides into several mutually unintelligible languages. But language isn't just a spread-out concept; it’s not just a series of reflections in individual minds of the same timeless idea. Language flows through time on its own trajectory. It has its own direction. If a language didn’t break into dialects, and each language remained a solid, self-contained unit, it would still be constantly moving away from any fixed standard, continuously developing new features and gradually transforming into a language so different from its origin that it would essentially be a new language. Dialects emerge not simply because of individual variation but because two or more groups of people become disconnected enough to drift apart instead of evolving together. As long as they stay closely connected, even significant individual variation wouldn’t lead to the creation of dialects. In reality, no language can cover a large territory or even a considerable area without showing dialectical variations because it's impossible to prevent a large population from splitting into local groups, each of which tends to evolve independently. Given the cultural conditions that seem to prevail today, conditions that push back against localism at every turn, the tendency towards dialectical separation is constantly being countered and partially “corrected” by the uniform factors mentioned earlier. Yet even in a relatively new country like America, the dialectical differences are quite noticeable.

Under primitive conditions the political groups are small, the tendency to localism exceedingly strong. It is natural, therefore, that the languages of primitive folk or of non-urban populations in general are differentiated into a great number of dialects. There are parts of the globe where almost every village has its own dialect. The life of the geographically limited community is narrow and intense; its speech is correspondingly peculiar to itself. It is exceedingly doubtful if a language will ever be spoken over a wide area without multiplying itself dialectically. No sooner are the old dialects ironed out by compromises or ousted by the spread and influence of the one dialect which is culturally predominant when a new crop of dialects arises to undo the leveling work of the past. This is precisely what happened in Greece, for instance. In classical antiquity there were spoken a large number of local dialects, several of which are represented in the literature. As the cultural supremacy of Athens grew, its dialect, the Attic, spread at the expense of the rest, until, in the so-called Hellenistic period following the Macedonian conquest, the Attic dialect, in the vulgarized form known as the “Koine,” became the standard speech of all Greece. But this linguistic uniformity[124] did not long continue. During the two millennia that separate the Greek of to-day from its classical prototype the Koine gradually split up into a number of dialects. Now Greece is as richly diversified in speech as in the time of Homer, though the present local dialects, aside from those of Attica itself, are not the lineal descendants of the old dialects of pre-Alexandrian days.[125] The experience of Greece is not exceptional. Old dialects are being continually wiped out only to make room for new ones. Languages can change at so many points of phonetics, morphology, and vocabulary that it is not surprising that once the linguistic community is broken it should slip off in different directions. It would be too much to expect a locally diversified language to develop along strictly parallel lines. If once the speech of a locality has begun to drift on its own account, it is practically certain to move further and further away from its linguistic fellows. Failing the retarding effect of dialectic interinfluences, which I have already touched upon, a group of dialects is bound to diverge on the whole, each from all of the others.

Under primitive conditions, political groups are small, and there's a strong tendency toward localism. It makes sense, then, that the languages of primitive people or non-urban populations, in general, have developed into a large number of dialects. In some parts of the world, nearly every village has its own dialect. Life in these geographically limited communities is narrow and intense, and their speech is uniquely their own. It's very unlikely that a language will be spoken widely without developing numerous dialects. As soon as the older dialects are smoothed over by compromises or replaced by the spread and influence of a dominant dialect, a new wave of dialects emerges to reverse the leveling effects of the past. This is exactly what happened in Greece. In classical times, many local dialects were spoken, some of which are represented in literature. As Athens gained cultural dominance, its dialect, the Attic, spread at the expense of others until, in the Hellenistic period after the Macedonian conquest, the Attic dialect, in its popular form known as “Koine,” became the standard speech of all Greece. However, this linguistic uniformity did not last long. Over the two millennia that separate today's Greek from its classical roots, the Koine gradually split into various dialects. Now Greece is as linguistically diverse as it was in Homer's time, although the current local dialects, except for those in Attica, are not direct descendants of the old dialects from before Alexander's era. Greece's experience is not unique. Old dialects are continually disappearing, only to be replaced by new ones. Languages can change in many areas of phonetics, morphology, and vocabulary, so it’s not surprising that once a linguistic community is fragmented, it will drift in different directions. It would be unrealistic to expect a locally diverse language to evolve in strictly parallel ways. Once the speech of a locality starts to evolve independently, it will almost certainly continue to diverge from its linguistic counterparts. Without the slowing effects of dialectical influences, which I've already mentioned, a group of dialects is bound to diverge overall, each from the others.

In course of time each dialect itself splits up into sub-dialects, which gradually take on the dignity of dialects proper while the primary dialects develop into mutually unintelligible languages. And so the budding process continues, until the divergences become so great that none but a linguistic student, armed with his documentary evidence and with his comparative or reconstructive method, would infer that the languages in question were genealogically related, represented independent lines of development, in other words, from a remote and common starting point. Yet it is as certain as any historical fact can be that languages so little resembling each other as Modern Irish, English, Italian, Greek, Russian, Armenian, Persian, and Bengali are but end-points in the present of drifts that converge to a meeting-point in the dim past. There is naturally no reason to believe that this earliest “Indo-European” (or “Aryan”) prototype which we can in part reconstruct, in part but dimly guess at, is itself other than a single “dialect” of a group that has either become largely extinct or is now further represented by languages too divergent for us, with our limited means, to recognize as clear kin.[126]

Over time, each dialect breaks down into sub-dialects, which slowly gain status as proper dialects while the main dialects evolve into languages that speakers can no longer understand each other. This process keeps happening until the differences become so significant that only a linguist, equipped with their research and methods, could deduce that these languages are related, representing separate lines of development that share a distant common origin. However, it's as certain as any historical fact can be that languages as different as Modern Irish, English, Italian, Greek, Russian, Armenian, Persian, and Bengali are simply the end points of historical changes that trace back to a common point in the distant past. There’s really no reason to think that this early "Indo-European" (or "Aryan") prototype, which we can partially reconstruct and partially guess at, is anything other than a single "dialect" within a group that has mostly vanished or is now represented by languages too varied for us to recognize as clearly related, given our limited resources. [126]

All languages that are known to be genetically related, i.e., to be divergent forms of a single prototype, may be considered as constituting a “linguistic stock.” There is nothing final about a linguistic stock. When we set it up, we merely say, in effect, that thus far we can go and no farther. At any point in the progress of our researches an unexpected ray of light may reveal the “stock” as but a “dialect” of a larger group. The terms dialect, language, branch, stock—it goes without saying—are purely relative terms. They are convertible as our perspective widens or contracts.[127] It would be vain to speculate as to whether or not we shall ever be able to demonstrate that all languages stem from a common source. Of late years linguists have been able to make larger historical syntheses than were at one time deemed feasible, just as students of culture have been able to show historical connections between culture areas or institutions that were at one time believed to be totally isolated from each other. The human world is contracting not only prospectively but to the backward-probing eye of culture-history. Nevertheless we are as yet far from able to reduce the riot of spoken languages to a small number of “stocks.” We must still operate with a quite considerable number of these stocks. Some of them, like Indo-European or Indo-Chinese, are spoken over tremendous reaches; others, like Basque,[128] have a curiously restricted range and are in all likelihood but dwindling remnants of groups that were at one time more widely distributed. As for the single or multiple origin of speech, it is likely enough that language as a human institution (or, if one prefers, as a human “faculty”) developed but once in the history of the race, that all the complex history of language is a unique cultural event. Such a theory constructed “on general principles” is of no real interest, however, to linguistic science. What lies beyond the demonstrable must be left to the philosopher or the romancer.

All languages that are known to be genetically related, meaning they are different forms of a single origin, can be considered a “linguistic stock.” There’s nothing definitive about a linguistic stock. When we establish one, we’re essentially saying that this is as far as we can go for now. At any point in our research, a surprising discovery might reveal that what we call a “stock” is actually just a “dialect” of a larger group. The terms dialect, language, branch, and stock are purely relative. They can change as our perspective expands or narrows.[127] It would be pointless to speculate on whether we will ever prove that all languages come from a common source. In recent years, linguists have managed to make broader historical connections than were once thought possible, just as cultural studies have established historical links between areas or institutions that were believed to be completely isolated from each other. The human world is shrinking, both in future prospects and through examining cultural history. However, we are still far from simplifying the multitude of spoken languages into a few “stocks.” We must continue to work with a relatively large number of these stocks. Some, like Indo-European or Indo-Chinese, are spoken over vast areas; others, like Basque,[128] have a surprisingly limited range and are likely just remnants of groups that were once more widely spread. Regarding the single or multiple origins of language, it’s likely that language as a human institution (or, if you prefer, as a human “ability”) developed only once in human history, making the entire complex history of language a unique cultural event. However, such a theory built “on general principles” is not particularly interesting to linguistic science. What lies beyond what can be proven should be left to philosophers or dreamers.

We must return to the conception of “drift” in language. If the historical changes that take place in a language, if the vast accumulation of minute modifications which in time results in the complete remodeling of the language, are not in essence identical with the individual variations that we note on every hand about us, if these variations are born only to die without a trace, while the equally minute, or even minuter, changes that make up the drift are forever imprinted on the history of the language, are we not imputing to this history a certain mystical quality? Are we not giving language a power to change of its own accord over and above the involuntary tendency of individuals to vary the norm? And if this drift of language is not merely the familiar set of individual variations seen in vertical perspective, that is historically, instead of horizontally, that is in daily experience, what is it? Language exists only in so far as it is actually used—spoken and heard, written and read. What significant changes take place in it must exist, to begin with, as individual variations. This is perfectly true, and yet it by no means follows that the general drift of language can be understood[129] from an exhaustive descriptive study of these variations alone. They themselves are random phenomena,[130] like the waves of the sea, moving backward and forward in purposeless flux. The linguistic drift has direction. In other words, only those individual variations embody it or carry it which move in a certain direction, just as only certain wave movements in the bay outline the tide. The drift of a language is constituted by the unconscious selection on the part of its speakers of those individual variations that are cumulative in some special direction. This direction may be inferred, in the main, from the past history of the language. In the long run any new feature of the drift becomes part and parcel of the common, accepted speech, but for a long time it may exist as a mere tendency in the speech of a few, perhaps of a despised few. As we look about us and observe current usage, it is not likely to occur to us that our language has a “slope,” that the changes of the next few centuries are in a sense prefigured in certain obscure tendencies of the present and that these changes, when consummated, will be seen to be but continuations of changes that have been already effected. We feel rather that our language is practically a fixed system and that what slight changes are destined to take place in it are as likely to move in one direction as another. The feeling is fallacious. Our very uncertainty as to the impending details of change makes the eventual consistency of their direction all the more impressive.

We need to revisit the idea of “drift” in language. If the historical shifts that occur in a language, and the gradual accumulation of small changes that eventually lead to a complete transformation of that language, aren’t fundamentally the same as the individual variations we observe around us, and if these variations are born only to fade away without leaving a trace, while the equally small, or even smaller, changes that contribute to the drift are forever marked in the language’s history, are we not attributing a mystical quality to this history? Are we not suggesting that language has an innate ability to change on its own, beyond the unintentional tendency of individuals to deviate from the norm? Furthermore, if this language drift isn’t just the familiar set of individual variations viewed historically instead of in our daily lives, what is it? Language only exists to the extent that it is actually used—spoken and heard, written and read. Any significant changes must start as individual variations. This is absolutely true, but that doesn’t mean we can fully understand the overall drift of language just by looking at these variations alone. They are themselves random occurrences, like the movement of ocean waves, shifting back and forth without purpose. The linguistic drift has a direction. In other words, only those individual variations that progress in a certain direction contribute to it, just as only specific wave patterns in the bay illustrate the tide. The drift of a language is shaped by the unconscious choices of its speakers, who select those individual variations that build cumulatively in a particular way. This direction can mainly be inferred from the language's past history. Over time, any new aspect of the drift becomes part of common, accepted speech, but for a long time, it might only appear as a tendency among a small, possibly marginalized group. As we observe current usage, it’s unlikely to strike us that our language has a “slope,” where the changes of the coming centuries are implicitly suggested by certain obscure trends present now, and that these changes, when fully realized, will be seen as continuations of changes that have already occurred. Instead, we tend to feel that our language is a mostly fixed system, with any minor changes likely to move in one direction as easily as another. This feeling is misleading. Our very uncertainty regarding the specifics of upcoming changes makes the eventual consistency of their direction all the more striking.

Sometimes we can feel where the drift is taking us even while we struggle against it. Probably the majority of those who read these words feel that it is quite “incorrect” to say “Who did you see?” We readers of many books are still very careful to say “Whom did you see?” but we feel a little uncomfortable (uncomfortably proud, it may be) in the process. We are likely to avoid the locution altogether and to say “Who was it you saw?” conserving literary tradition (the “whom”) with the dignity of silence.[131] The folk makes no apology. “Whom did you see?” might do for an epitaph, but “Who did you see?” is the natural form for an eager inquiry. It is of course the uncontrolled speech of the folk to which we must look for advance information as to the general linguistic movement. It is safe to prophesy that within a couple of hundred years from to-day not even the most learned jurist will be saying “Whom did you see?” By that time the “whom” will be as delightfully archaic as the Elizabethan “his” for “its.”[132] No logical or historical argument will avail to save this hapless “whom.” The demonstration “I: me = he: him = who: whom” will be convincing in theory and will go unheeded in practice.

Sometimes we can sense where the current is taking us even while we fight against it. Most people who read this probably think it's pretty “incorrect” to say “Who did you see?” While those of us who read a lot are still careful to say “Whom did you see?” we feel a bit uncomfortable (perhaps uncomfortably proud) while doing so. We often choose to avoid the expression entirely and say “Who was it you saw?" preserving literary tradition (the “whom”) with the dignity of silence.[131] The common people make no apologies. “Whom did you see?” might work for an epitaph, but “Who did you see?” is the natural way to ask out of curiosity. Of course, it's the casual speech of the people that we need to look to for insight into the overall linguistic trends. It’s safe to predict that in a couple of hundred years, not even the most educated lawyer will be saying “Whom did you see?” By then, “whom” will be as charmingly outdated as the Elizabethan “his” for “its.”[132] No logical or historical argument will be enough to save this unfortunate “whom.” The reasoning “I: me = he: him = who: whom” will sound convincing in theory but will be ignored in practice.

Even now we may go so far as to say that the majority of us are secretly wishing they could say “Who did you see?” It would be a weight off their unconscious minds if some divine authority, overruling the lifted finger of the pedagogue, gave them carte blanche. But we cannot too frankly anticipate the drift and maintain caste. We must affect ignorance of whither we are going and rest content with our mental conflict—uncomfortable conscious acceptance of the “whom,” unconscious desire for the “who.”[133] Meanwhile we indulge our sneaking desire for the forbidden locution by the use of the “who” in certain twilight cases in which we can cover up our fault by a bit of unconscious special pleading. Imagine that some one drops the remark when you are not listening attentively, “John Smith is coming to-night.” You have not caught the name and ask, not “Whom did you say?” but “Who did you say?” There is likely to be a little hesitation in the choice of the form, but the precedent of usages like “Whom did you see?” will probably not seem quite strong enough to induce a “Whom did you say?” Not quite relevant enough, the grammarian may remark, for a sentence like “Who did you say?” is not strictly analogous to “Whom did you see?” or “Whom did you mean?” It is rather an abbreviated form of some such sentence as “Who, did you say, is coming to-night?” This is the special pleading that I have referred to, and it has a certain logic on its side. Yet the case is more hollow than the grammarian thinks it to be, for in reply to such a query as “You’re a good hand at bridge, John, aren’t you?” John, a little taken aback, might mutter “Did you say me?” hardly “Did you say I?” Yet the logic for the latter (“Did you say I was a good hand at bridge?”) is evident. The real point is that there is not enough vitality in the “whom” to carry it over such little difficulties as a “me” can compass without a thought. The proportion “I : me = he : him = who : whom” is logically and historically sound, but psychologically shaky. “Whom did you see?” is correct, but there is something false about its correctness.

Even now, we can say that most of us secretly wish we could ask, “Who did you see?” It would ease our minds if some higher authority, overriding the pointed finger of the teacher, gave us carte blanche. But we can't too openly predict the direction we’re headed while maintaining our status. We must pretend we don’t know where we’re going and accept our mental struggle—uncomfortable awareness of the “whom,” and a hidden desire for the “who.”[133] Meanwhile, we secretly indulge our desire for the forbidden word by using “who” in certain borderline cases where we can justify our mistake with a bit of unconscious reasoning. Imagine someone casually mentions, while you're not paying full attention, “John Smith is coming tonight.” You miss the name and instead of asking, “Whom did you say?” you ask, “Who did you say?” There might be a brief pause in choosing the phrasing, but the precedent of “Whom did you see?” probably won't feel strong enough to prompt a “Whom did you say?” A grammarian might argue it’s not quite relevant enough, since a sentence like “Who did you say?” isn’t exactly the same as “Whom did you see?” or “Whom did you mean?” It’s more like a shortened version of something like “Who, did you say, is coming tonight?” This is the justifying reasoning I mentioned, and it does have some logic to it. Yet, the argument isn't as solid as the grammarian thinks. For example, in response to a question like, “You’re good at bridge, John, right?” John, slightly surprised, might mumble, “Did you say me?” rather than “Did you say I?” Although the logic for the latter (“Did you say I was good at bridge?”) is clear. The main point is that there isn’t enough strength in “whom” to handle such minor challenges that “me” can easily navigate without a second thought. The proportion “I : me = he : him = who : whom” is logically and historically correct, but psychologically fragile. “Whom did you see?” is technically accurate, but there’s something off about its accuracy.

It is worth looking into the reason for our curious reluctance to use locutions involving the word “whom” particularly in its interrogative sense. The only distinctively objective forms which we still possess in English are me, him, her (a little blurred because of its identity with the possessive her), us, them, and whom. In all other cases the objective has come to be identical with the subjective—that is, in outer form, for we are not now taking account of position in the sentence. We observe immediately in looking through the list of objective forms that whom is psychologically isolated. Me, him, her, us, and them form a solid, well-integrated group of objective personal pronouns parallel to the subjective series I, he, she, we, they. The forms who and whom are technically “pronouns” but they are not felt to be in the same box as the personal pronouns. Whom has clearly a weak position, an exposed flank, for words of a feather tend to flock together, and if one strays behind, it is likely to incur danger of life. Now the other interrogative and relative pronouns (which, what, that), with which whom should properly flock, do not distinguish the subjective and objective forms. It is psychologically unsound to draw the line of form cleavage between whom and the personal pronouns on the one side, the remaining interrogative and relative pronouns on the other. The form groups should be symmetrically related to, if not identical with, the function groups. Had which, what, and that objective forms parallel to whom, the position of this last would be more secure. As it is, there is something unesthetic about the word. It suggests a form pattern which is not filled out by its fellows. The only way to remedy the irregularity of form distribution is to abandon the whom altogether for we have lost the power to create new objective forms and cannot remodel our which-what-that group so as to make it parallel with the smaller group who-whom. Once this is done, who joins its flock and our unconscious desire for form symmetry is satisfied. We do not secretly chafe at “Whom did you see?” without reason.[134]

It’s interesting to explore the reason for our strange reluctance to use phrases with the word “whom,” especially in questions. The only distinctively objective forms we still have in English are me, him, her (which is a bit blurred because it’s the same as the possessive her), us, them, and whom. In all other cases, the objective form has become the same as the subjective one—in terms of outer form, not counting its position in the sentence. When we look at the list of objective forms, it’s clear that whom feels isolated. Me, him, her, us, and them make a solid, well-integrated group of objective personal pronouns that matches the subjective series I, he, she, we, they. The terms who and whom are technically “pronouns,” but they don’t feel like a part of the personal pronoun group. Whom clearly has a weaker position, an exposed side, because words that are similar tend to group together, and if one falls behind, it risks being overlooked. The other interrogative and relative pronouns (which, what, that), which whom should naturally be grouped with, don’t differentiate between subjective and objective forms. It’s not logically sound to separate whom and the personal pronouns on one side and the other interrogative and relative pronouns on the other. The form groups should be related symmetrically to, if not identical with, the function groups. If which, what, and that had objective forms that matched whom, whom would have a more secure position. As it stands, there’s something aesthetically unpleasing about the word. It suggests a pattern that isn’t completed by its companions. The only way to fix this irregular distribution of forms is to eliminate whom altogether, as we’ve lost the ability to create new objective forms and can’t reform our which-what-that group to align with the smaller who-whom group. Once that’s done, who fits in, and our subconscious craving for symmetrical form is satisfied. We don’t irrationally dislike “Whom did you see?” without a reason.[134]

But the drift away from whom has still other determinants. The words who and whom in their interrogative sense are psychologically related not merely to the pronouns which and what, but to a group of interrogative adverbs—where, when, how—all of which are invariable and generally emphatic. I believe it is safe to infer that there is a rather strong feeling in English that the interrogative pronoun or adverb, typically an emphatic element in the sentence, should be invariable. The inflective -m of whom is felt as a drag upon the rhetorical effectiveness of the word. It needs to be eliminated if the interrogative pronoun is to receive all its latent power. There is still a third, and a very powerful, reason for the avoidance of whom. The contrast between the subjective and objective series of personal pronouns (I, he, she, we, they: me, him, her, us, them) is in English associated with a difference of position. We say I see the man but the man sees me; he told him, never him he told or him told he. Such usages as the last two are distinctly poetic and archaic; they are opposed to the present drift of the language. Even in the interrogative one does not say Him did you see? It is only in sentences of the type Whom did you see? that an inflected objective before the verb is now used at all. On the other hand, the order in Whom did you see? is imperative because of its interrogative form; the interrogative pronoun or adverb normally comes first in the sentence (What are you doing? When did he go? Where are you from?). In the “whom” of Whom did you see? there is concealed, therefore, a conflict between the order proper to a sentence containing an inflected objective and the order natural to a sentence with an interrogative pronoun or adverb. The solution Did you see whom? or You saw whom?[135] is too contrary to the idiomatic drift of our language to receive acceptance. The more radical solution Who did you see? is the one the language is gradually making for.

But the shift away from whom has other factors influencing it. The words who and whom in their questioning form are psychologically connected not just to the pronouns which and what, but also to a set of questioning adverbs—where, when, how—which are all consistent and generally emphasize the point. I think it’s safe to say that there’s a strong sentiment in English that the questioning pronoun or adverb, which usually is an emphasized element in the sentence, should be consistent. The inflection -m in whom feels like a hindrance to the rhetorical power of the word. It should be removed if the questioning pronoun is to unleash all its potential. There’s also a third, very compelling reason for avoiding whom. The difference between the subjective and objective forms of personal pronouns (I, he, she, we, they: me, him, her, us, them) in English is often linked to a difference in position. We say I see the man but the man sees me; he told him, never him he told or him told he. Such usages as the last two are distinctly poetic and old-fashioned; they go against the current trend of the language. Even in questions, we don’t say Him did you see? It's only in questions like Whom did you see? that an inflected objective before the verb is now used at all. On the flip side, the order in Whom did you see? is necessary because of its question form; the questioning pronoun or adverb typically comes first in a sentence (What are you doing? When did he go? Where are you from?). In the “whom” of Whom did you see?, there’s a hidden conflict between the structure suited for a sentence with an inflected objective and the structure typical of a sentence with a questioning pronoun or adverb. The alternative Did you see whom? or You saw whom?[135] goes against the natural trend of our language and is unlikely to be accepted. The more straightforward option Who did you see? is the one the language is gradually moving towards.

These three conflicts—on the score of form grouping, of rhetorical emphasis, and of order—are supplemented by a fourth difficulty. The emphatic whom, with its heavy build (half-long vowel followed by labial consonant), should contrast with a lightly tripping syllable immediately following. In whom did, however, we have an involuntary retardation that makes the locution sound “clumsy.” This clumsiness is a phonetic verdict, quite apart from the dissatisfaction due to the grammatical factors which we have analyzed. The same prosodic objection does not apply to such parallel locutions as what did and when did. The vowels of what and when are shorter and their final consonants melt easily into the following d, which is pronounced in the same tongue position as t and n. Our instinct for appropriate rhythms makes it as difficult for us to feel content with whom did as for a poet to use words like dreamed and hummed in a rapid line. Neither common feeling nor the poet’s choice need be at all conscious. It may be that not all are equally sensitive to the rhythmic flow of speech, but it is probable that rhythm is an unconscious linguistic determinant even with those who set little store by its artistic use. In any event the poet’s rhythms can only be a more sensitive and stylicized application of rhythmic tendencies that are characteristic of the daily speech of his people.

These three conflicts—regarding form grouping, rhetorical emphasis, and order—are complicated by a fourth issue. The emphatic whom, with its heavy sound (a long vowel followed by a labial consonant), should contrast with a light, quick syllable right after it. However, in whom did, there’s an involuntary pause that makes it sound “clumsy.” This awkwardness is a phonetic issue, separate from the dissatisfaction caused by the grammatical factors we’ve analyzed. The same prosodic issue doesn’t apply to similar phrases like what did and when did. The vowels in what and when are shorter, and their final consonants blend smoothly into the following d, which is pronounced in the same position as t and n. Our instinct for the right rhythms makes it just as hard for us to feel satisfied with whom did as it is for a poet to use words like dreamed and hummed in a quick line. This sense of rhythm doesn’t need to be a conscious choice for either regular speakers or poets. Not everyone may be equally attuned to the rhythmic flow of speech, but it’s likely that rhythm plays an unconscious role in language even for those who don’t value its artistic use. In any case, a poet’s rhythms are just a more sensitive and stylized version of the rhythmic patterns found in the everyday speech of their community.

We have discovered no less than four factors which enter into our subtle disinclination to say “Whom did you see?” The uneducated folk that says “Who did you see?” with no twinge of conscience has a more acute flair for the genuine drift of the language than its students. Naturally the four restraining factors do not operate independently. Their separate energies, if we may make bold to use a mechanical concept, are “canalized” into a single force. This force or minute embodiment of the general drift of the language is psychologically registered as a slight hesitation in using the word whom. The hesitation is likely to be quite unconscious, though it may be readily acknowledged when attention is called to it. The analysis is certain to be unconscious, or rather unknown, to the normal speaker.[136] How, then, can we be certain in such an analysis as we have undertaken that all of the assigned determinants are really operative and not merely some one of them? Certainly they are not equally powerful in all cases. Their values are variable, rising and falling according to the individual and the locution.[137] But that they really exist, each in its own right, may sometimes be tested by the method of elimination. If one or other of the factors is missing and we observe a slight diminution in the corresponding psychological reaction (“hesitation” in our case), we may conclude that the factor is in other uses genuinely positive. The second of our four factors applies only to the interrogative use of whom, the fourth factor applies with more force to the interrogative than to the relative. We can therefore understand why a sentence like Is he the man whom you referred to? though not as idiomatic as Is he the man (that) you referred to? (remember that it sins against counts one and three), is still not as difficult to reconcile with our innate feeling for English expression as Whom did you see? If we eliminate the fourth factor from the interrogative usage,[138] say in Whom are you looking at? where the vowel following whom relieves this word of its phonetic weight, we can observe, if I am not mistaken, a lesser reluctance to use the whom. Who are you looking at? might even sound slightly offensive to ears that welcome Who did you see?

We have identified four factors that contribute to our subtle reluctance to say “Whom did you see?” The uneducated people who say “Who did you see?” without feeling guilty have a sharper understanding of the real direction of the language than those who study it. Naturally, the four restraining factors do not work independently. Their individual influences, if we may be bold enough to borrow a mechanical term, get “channeled” into a single force. This force, or small representation of the overall trend of the language, is psychologically recognized as a slight hesitation when using the word whom. This hesitation is likely to be unconscious, although it can be easily acknowledged when it’s pointed out. The analysis is likely to be unconscious or, more accurately, unknown to the typical speaker.[136] How, then, can we be sure in such an analysis we’ve undertaken that all the attributed factors are genuinely at play and not just one of them? Certainly, they aren’t equally compelling in every case. Their significance can vary, increasing or decreasing depending on the individual and the phrasing.[137] However, we can sometimes test their genuine existence, each in its own way, by using the method of elimination. If one of the factors is absent and we observe a slight decrease in the corresponding psychological response (“hesitation,” in our case), we can conclude that the factor is indeed positively influential in other contexts. The second of our four factors pertains only to the questioning use of whom, and the fourth factor has a stronger impact on questions than on relatives. Thus, we can see why a sentence like Is he the man whom you referred to?, while not as idiomatic as Is he the man (that) you referred to? (keeping in mind that it violates counts one and three), is still not as difficult to reconcile with our natural sense of English expression as Whom did you see? If we remove the fourth factor from the questioning usage,[138] for instance in Whom are you looking at?, where the vowel following whom lightens its phonetic burden, we can observe, if I’m not mistaken, a decreased reluctance to use whom. Who are you looking at? might even sound slightly rude to those who accept Who did you see?

We may set up a scale of “hesitation values” somewhat after this fashion:

We can create a scale of "hesitation values" like this:

Value 1: factors 1, 3. "The man I mentioned."
Value 2: factors 1, 3, 4. "The man they were talking about."
Value 3: factors 1, 2, 3. "Who are you looking at?"
Value 4: factors 1, 2, 3, 4. “Who did you see?”

We may venture to surmise that while whom will ultimately disappear from English speech, locutions of the type Whom did you see? will be obsolete when phrases like The man whom I referred to are still in lingering use. It is impossible to be certain, however, for we can never tell if we have isolated all the determinants of a drift. In our particular case we have ignored what may well prove to be a controlling factor in the history of who and whom in the relative sense. This is the unconscious desire to leave these words to their interrogative function and to concentrate on that or mere word order as expressions of the relative (e.g., The man that I referred to or The man I referred to). This drift, which does not directly concern the use of whom as such (merely of whom as a form of who), may have made the relative who obsolete before the other factors affecting relative whom have run their course. A consideration like this is instructive because it indicates that knowledge of the general drift of a language is insufficient to enable us to see clearly what the drift is heading for. We need to know something of the relative potencies and speeds of the components of the drift.

We can guess that while whom will eventually fade from English, phrases like Whom did you see? will become outdated even as constructions like The man whom I referred to continue to be used. However, it's impossible to be sure, as we can never know if we’ve identified all the factors influencing this change. In this case, we’ve overlooked what might be a significant element in the history of who and whom in their relative sense. This is the unconscious tendency to reserve these words for questions and to focus on that or simple word order as ways to express the relative (e.g., The man that I referred to or The man I referred to). This change, which doesn’t directly impact the use of whom itself (but rather whom as a variation of who), may lead to the relative who becoming outdated before other factors affecting relative whom have fully played out. Thoughts like this are enlightening because they show that understanding the overall trend of a language isn’t enough to clearly see where that trend is going. We need to understand the relative strengths and speeds of the different elements of this change.

It is hardly necessary to say that the particular drifts involved in the use of whom are of interest to us not for their own sake but as symptoms of larger tendencies at work in the language. At least three drifts of major importance are discernible. Each of these has operated for centuries, each is at work in other parts of our linguistic mechanism, each is almost certain to continue for centuries, possibly millennia. The first is the familiar tendency to level the distinction between the subjective and the objective, itself but a late chapter in the steady reduction of the old Indo-European system of syntactic cases. This system, which is at present best preserved in Lithuanian,[139] was already considerably reduced in the old Germanic language of which English, Dutch, German, Danish, and Swedish are modern dialectic forms. The seven Indo-European cases (nominative genitive, dative, accusative, ablative, locative, instrumental) had been already reduced to four (nominative genitive, dative, accusative). We know this from a careful comparison of and reconstruction based on the oldest Germanic dialects of which we still have records (Gothic, Old Icelandic, Old High German, Anglo-Saxon). In the group of West Germanic dialects, for the study of which Old High German, Anglo-Saxon, Old Frisian, and Old Saxon are our oldest and most valuable sources, we still have these four cases, but the phonetic form of the case syllables is already greatly reduced and in certain paradigms particular cases have coalesced. The case system is practically intact but it is evidently moving towards further disintegration. Within the Anglo-Saxon and early Middle English period there took place further changes in the same direction. The phonetic form of the case syllables became still further reduced and the distinction between the accusative and the dative finally disappeared. The new “objective” is really an amalgam of old accusative and dative forms; thus, him, the old dative (we still say I give him the book, not “abbreviated” from I give to him; compare Gothic imma, modern German ihm), took over the functions of the old accusative (Anglo-Saxon hine; compare Gothic ina, Modern German ihn) and dative. The distinction between the nominative and accusative was nibbled away by phonetic processes and morphological levelings until only certain pronouns retained distinctive subjective and objective forms.

It’s hardly necessary to say that the specific changes related to the use of whom interest us not for their own sake but as indicators of broader trends happening in the language. At least three major shifts are noticeable. Each of these has been happening for centuries, is active in other parts of our language system, and is likely to continue for centuries, possibly millennia. The first is the well-known tendency to blur the lines between subjective and objective, which is just a recent development in the ongoing simplification of the old Indo-European system of syntactic cases. This system, which is currently best preserved in Lithuanian,[139] was already significantly reduced in the old Germanic languages, which include the modern dialects of English, Dutch, German, Danish, and Swedish. The seven Indo-European cases (nominative, genitive, dative, accusative, ablative, locative, instrumental) were already condensed to four (nominative, genitive, dative, accusative). We know this from careful comparisons and reconstructions based on the oldest Germanic dialects for which we still have records (Gothic, Old Icelandic, Old High German, Anglo-Saxon). In the group of West Germanic dialects, for which Old High German, Anglo-Saxon, Old Frisian, and Old Saxon are our oldest and most valuable sources, we still have these four cases, but the phonetic forms of the case syllables have already been greatly simplified, and in some patterns, particular cases have merged. The case system is mostly intact, but it is clearly heading toward further breakdown. During the Anglo-Saxon and early Middle English periods, more changes in the same direction occurred. The phonetic form of the case syllables became even more reduced, and the distinction between accusative and dative eventually vanished. The new “objective” is really a blend of old accusative and dative forms; therefore, him, the old dative (we still say I give him the book, not “abbreviated” from I give to him; compare Gothic imma, modern German ihm), took on the roles of the old accusative (Anglo-Saxon hine; compare Gothic ina, Modern German ihn) and dative. The differences between nominative and accusative were diminished through phonetic changes and morphological leveling until only certain pronouns kept their distinctive subjective and objective forms.

In later medieval and in modern times there have been comparatively few apparent changes in our case system apart from the gradual replacement of thou—thee (singular) and subjective ye—objective you (plural) by a single undifferentiated form you. All the while, however, the case system, such as it is (subjective-objective, really absolutive, and possessive in nouns; subjective, objective, and possessive in certain pronouns) has been steadily weakening in psychological respects. At present it is more seriously undermined than most of us realize. The possessive has little vitality except in the pronoun and in animate nouns. Theoretically we can still say the moon’s phases or a newspaper’s vogue; practically we limit ourselves pretty much to analytic locutions like the phases of the moon and the vogue of a newspaper. The drift is clearly toward the limitation, of possessive forms to animate nouns. All the possessive pronominal forms except its and, in part, their and theirs, are also animate. It is significant that theirs is hardly ever used in reference to inanimate nouns, that there is some reluctance to so use their, and that its also is beginning to give way to of it. The appearance of it or the looks of it is more in the current of the language than its appearance. It is curiously significant that its young (referring to an animal’s cubs) is idiomatically preferable to the young of it. The form is only ostensibly neuter, in feeling it is animate; psychologically it belongs with his children, not with the pieces of it. Can it be that so common a word as its is actually beginning to be difficult? Is it too doomed to disappear? It would be rash to say that it shows signs of approaching obsolescence, but that it is steadily weakening is fairly clear.[140] In any event, it is not too much to say that there is a strong drift towards the restriction of the inflected possessive forms to animate nouns and pronouns.

In the later medieval and modern periods, there have been relatively few noticeable changes in our case system aside from the gradual replacement of thou—thee (singular) and the subjective ye—objective you (plural) with a single form, you. Throughout this time, however, the case system—such as it is, with subjective-objective, really absolutive, and possessive in nouns; and subjective, objective, and possessive in certain pronouns—has been gradually weakening psychologically. Right now, it is more undermined than most of us realize. The possessive form has little strength except in pronouns and in animate nouns. Theoretically, we can still say the moon’s phases or a newspaper’s vogue; practically, we mostly stick to phrases like the phases of the moon and the vogue of a newspaper. The trend is clearly towards limiting possessive forms to animate nouns. All possessive pronouns except its and, to some extent, their and theirs are also animate. It's notable that theirs is hardly ever used for inanimate nouns, that there is some hesitation in using their in that context, and that its is also starting to be replaced by of it. Phrases like the appearance of it or the looks of it are more common in current usage than its appearance. Strangely enough, its young (referring to an animal's cubs) is idiomatically preferred over the young of it. The form is only seemingly neuter; in feeling, it is animate and psychologically aligns more with his children than with the pieces of it. Could it be that such a common word as its is actually becoming difficult? Is it destined to disappear? It would be reckless to claim that it shows signs of becoming obsolete, but it's clear that it is steadily weakening. In any case, it's fair to say that there is a strong trend towards limiting inflected possessive forms to animate nouns and pronouns.

How is it with the alternation of subjective and objective in the pronoun? Granted that whom is a weak sister, that the two cases have been leveled in you (in it, that, and what they were never distinct, so far as we can tell[141]), and that her as an objective is a trifle weak because of its formal identity with the possessive her, is there any reason to doubt the vitality of such alternations as I see the man and the man sees me? Surely the distinction between subjective I and objective me, between subjective he and objective him, and correspondingly for other personal pronouns, belongs to the very core of the language. We can throw whom to the dogs, somehow make shift to do without an its, but to level I and me to a single case—would that not be to un-English our language beyond recognition? There is no drift toward such horrors as Me see him or I see he. True, the phonetic disparity between I and me, he and him, we and us, has been too great for any serious possibility of form leveling. It does not follow that the case distinction as such is still vital. One of the most insidious peculiarities of a linguistic drift is that where it cannot destroy what lies in its way it renders it innocuous by washing the old significance out of it. It turns its very enemies to its own uses. This brings us to the second of the major drifts, the tendency to fixed position in the sentence, determined by the syntactic relation of the word.

How does the use of subjective and objective pronouns work? It's true that whom is less common, that both forms have merged in you (with it, that, and what they were never distinct, as far as we can tell[141]), and that her as an objective form feels a bit weak since it looks the same as the possessive her. Is there any reason to doubt the effectiveness of using phrases like I see the man and the man sees me? The distinction between subjective I and objective me, and between subjective he and objective him, along with similar pairs in other personal pronouns, is fundamental to the language. We might give up whom and find ways to work without an its, but merging I and me into one form—wouldn’t that make our language unrecognizable? There's no trend towards using phrases like Me see him or I see he. True, the sound differences between I and me, he and him, we and us, have been too significant for any real chance of merging them. However, that doesn't necessarily mean the distinction still holds strong. One of the sneakiest aspects of language change is that when it can't eliminate something, it often diminishes it by stripping away its original meaning. It can turn its own opponents to serve its purpose. This leads us to the second major trend, the tendency towards fixed positions in sentences, based on the syntactic relationship of the words.

We need not go into the history of this all-important drift. It is enough to know that as the inflected forms of English became scantier, as the syntactic relations were more and more inadequately expressed by the forms of the words themselves, position in the sentence gradually took over functions originally foreign to it. The man in the man sees the dog is subjective; in the dog sees the man, objective. Strictly parallel to these sentences are he sees the dog and the dog sees him. Are the subjective value of he and the objective value of him entirely, or even mainly, dependent on the difference of form? I doubt it. We could hold to such a view if it were possible to say the dog sees he or him sees the dog. It was once possible to say such things, but we have lost the power. In other words, at least part of the case feeling in he and him is to be credited to their position before or after the verb. May it not be, then, that he and him, we and us, are not so much subjective and objective forms as pre-verbal and post-verbal[142] forms, very much as my and mine are now pre-nominal and post-nominal forms of the possessive (my father but father mine; it is my book but the book is mine)? That this interpretation corresponds to the actual drift of the English language is again indicated by the language of the folk. The folk says it is me, not it is I, which is “correct” but just as falsely so as the whom did you see? that we have analyzed. I’m the one, it’s me; we’re the ones, it’s us that will win out—such are the live parallelisms in English to-day. There is little doubt that it is I will one day be as impossible in English as c’est je, for c’est moi, is now in French.

We don’t need to delve into the history of this crucial shift. It’s enough to understand that as the inflected forms of English became less common, and as the grammatical relationships were more poorly expressed by the word forms themselves, the position in the sentence gradually took on functions that weren’t originally its. The man in the man sees the dog is in the subject position; in the dog sees the man, it’s in the object position. Strictly parallel to these sentences are he sees the dog and the dog sees him. Is the subjective function of he and the objective function of him completely, or even mainly, dependent on the difference in form? I doubt it. We could maintain that view if we could say the dog sees he or him sees the dog. It was once possible to say things like that, but we’ve lost that ability. In other words, part of the distinction in feeling between he and him is due to their position before or after the verb. Could it be, then, that he and him, we and us, are not so much subjective and objective forms as pre-verbal and post-verbal[142] forms, similar to how my and mine are now pre-nominal and post-nominal forms of possession (my father but father mine; it is my book but the book is mine)? This interpretation aligns with the actual evolution of the English language, as shown by everyday speech. People say it is me, not it is I, which is “correct” but just as wrongly so as whom did you see? that we’ve discussed. I’m the one, it’s me; we’re the ones, it’s us that will win—these are the living parallels in English today. It’s likely that it is I will one day be as impossible in English as c’est je is now in French, where c’est moi is commonly used.

How differently our I: me feels than in Chaucer’s day is shown by the Chaucerian it am I. Here the distinctively subjective aspect of the I was enough to influence the form of the preceding verb in spite of the introductory it; Chaucer’s locution clearly felt more like a Latin sum ego than a modern it is I or colloquial it is me. We have a curious bit of further evidence to prove that the English personal pronouns have lost some share of their original syntactic force. Were he and she subjective forms pure and simple, were they not striving, so to speak, to become caseless absolutives, like man or any other noun, we should not have been able to coin such compounds as he-goat and she-goat, words that are psychologically analogous to bull-moose and mother-bear. Again, in inquiring about a new-born baby, we ask Is it a he or a she? quite as though he and she were the equivalents of male and female or boy and girl. All in all, we may conclude that our English case system is weaker than it looks and that, in one way or another, it is destined to get itself reduced to an absolutive (caseless) form for all nouns and pronouns but those that are animate. Animate nouns and pronouns are sure to have distinctive possessive forms for an indefinitely long period.

How differently our I: me feels compared to Chaucer’s time is shown by the Chaucerian it am I. Here, the uniquely subjective aspect of the I was strong enough to affect the form of the preceding verb despite the introductory it; Chaucer’s way of saying it felt more like a Latin sum ego than a modern it is I or casual it is me. We have an interesting piece of additional evidence that shows the English personal pronouns have lost some of their original syntactic power. If he and she were pure subjective forms and not trying, so to speak, to become caseless absolutives like man or any other noun, we wouldn't be able to create compounds like he-goat and she-goat, words that are psychologically similar to bull-moose and mother-bear. Moreover, when asking about a newborn baby, we say Is it a he or a she? as if he and she were equivalent to male and female or boy and girl. Overall, we can conclude that our English case system is weaker than it seems and that, in one way or another, it's likely to become reduced to an absolutive (caseless) form for all nouns and pronouns except for those that are animate. Animate nouns and pronouns will undoubtedly have distinct possessive forms for a very long time.

Meanwhile observe that the old alignment of case forms is being invaded by two new categories—a positional category (pre-verbal, post-verbal) and a classificatory category (animate, inanimate). The facts that in the possessive animate nouns and pronouns are destined to be more and more sharply distinguished from inanimate nouns and pronouns (the man’s, but of the house; his, but of it) and that, on the whole, it is only animate pronouns that distinguish pre-verbal and post-verbal forms[143] are of the greatest theoretical interest. They show that, however the language strive for a more and more analytic form, it is by no means manifesting a drift toward the expression of “pure” relational concepts in the Indo-Chinese manner.[144] The insistence on the concreteness of the relational concepts is clearly stronger than the destructive power of the most sweeping and persistent drifts that we know of in the history and prehistory of our language.

Meanwhile, note that the old way of organizing case forms is being challenged by two new categories—a positional category (pre-verbal, post-verbal) and a classificatory category (animate, inanimate). The fact that possessive animate nouns and pronouns are becoming more clearly distinguished from inanimate nouns and pronouns (the man’s, but of the house; his, but of it), and that, generally, only animate pronouns distinguish between pre-verbal and post-verbal forms[143] is of significant theoretical interest. They indicate that, no matter how much the language tries to adopt a more analytical form, it is not showing a tendency towards expressing “pure” relational concepts in the Indo-Chinese style.[144] The emphasis on the concreteness of relational concepts is clearly stronger than the weakening effects of the most extensive and enduring changes we know of in the history and prehistory of our language.

The drift toward the abolition of most case distinctions and the correlative drift toward position as an all-important grammatical method are accompanied, in a sense dominated, by the last of the three major drifts that I have referred to. This is the drift toward the invariable word. In analyzing the “whom” sentence I pointed out that the rhetorical emphasis natural to an interrogative pronoun lost something by its form variability (who, whose, whom). This striving for a simple, unnuanced correspondence between idea and word, as invariable as may be, is very strong in English. It accounts for a number of tendencies which at first sight seem unconnected. Certain well-established forms, like the present third person singular -s of works or the plural -s of books, have resisted the drift to invariable words, possibly because they symbolize certain stronger form cravings that we do not yet fully understand. It is interesting to note that derivations that get away sufficiently from the concrete notion of the radical word to exist as independent conceptual centers are not affected by this elusive drift. As soon as the derivation runs danger of being felt as a mere nuancing of, a finicky play on, the primary concept, it tends to be absorbed by the radical word, to disappear as such. English words crave spaces between them, they do not like to huddle in clusters of slightly divergent centers of meaning, each edging a little away from the rest. Goodness, a noun of quality, almost a noun of relation, that takes its cue from the concrete idea of “good” without necessarily predicating that quality (e.g., I do not think much of his goodness) is sufficiently spaced from good itself not to need fear absorption. Similarly, unable can hold its own against able because it destroys the latter’s sphere of influence; unable is psychologically as distinct from able as is blundering or stupid. It is different with adverbs in -ly. These lean too heavily on their adjectives to have the kind of vitality that English demands of its words. Do it quickly! drags psychologically. The nuance expressed by quickly is too close to that of quick, their circles of concreteness are too nearly the same, for the two words to feel comfortable together. The adverbs in -ly are likely to go to the wall in the not too distant future for this very reason and in face of their obvious usefulness. Another instance of the sacrifice of highly useful forms to this impatience of nuancing is the group whence, whither, hence, hither, thence, thither. They could not persist in live usage because they impinged too solidly upon the circles of meaning represented by the words where, here and there. In saying whither we feel too keenly that we repeat all of where. That we add to where an important nuance of direction irritates rather than satisfies. We prefer to merge the static and the directive (Where do you live? like Where are you going?) or, if need be, to overdo a little the concept of direction (Where are you running to?).

The movement toward getting rid of most grammatical distinctions and the increasing importance of position as a key grammatical method are, in a way, overshadowed by the last of the three major trends I've mentioned. This trend is toward the unchanging word. In analyzing the “whom” sentence, I noted that the natural rhetorical emphasis of an interrogative pronoun loses some impact because it changes form (who, whose, whom). The desire for a straightforward, unambiguous link between an idea and a word, no matter how constant, is very strong in English. This explains several tendencies that may initially seem unrelated. Certain established forms, like the present third person singular -s of works or the plural -s of books, have resisted the push toward unchanging words, possibly because they represent stronger form cravings that we don’t fully grasp yet. It's noteworthy that words that deviate enough from the concrete idea of the root word to stand as independent concepts aren’t influenced by this subtle trend. As soon as a derivation risks being seen merely as a slight variation of the primary concept, it tends to cling to the root word and fade away. English words prefer gaps between them; they don’t like to crowd together in clusters of slightly different meanings, each slightly distancing itself from the others. Goodness, a quality noun, almost a relational noun, derives from the concrete idea of “good” without necessarily asserting that quality (e.g., I do not think much of his goodness), offers enough distance from good itself to avoid being absorbed. Similarly, unable can stand alone against able because it eliminates the latter’s influence; unable is psychologically as distinct from able as blundering or stupid. Adverbs ending in -ly, however, rely too heavily on their adjectives to have the kind of vitality that English demands. Do it quickly! feels awkward. The nuance expressed by quickly is too similar to that of quick, and their meanings overlap too closely for the two words to coexist comfortably. The adverbs ending in -ly are likely to fade away in the near future for this very reason, despite their clear usefulness. Another example of sacrificing highly useful forms due to this impatience with nuance is the group whence, whither, hence, hither, thence, thither. They can’t maintain their usage because they clash too strongly with the meanings of where, here, and there. When we say whither, it feels too much like we're just repeating where. While adding an important nuance of direction should be satisfying, it actually frustrates us. We prefer to blend the static and the directive (Where do you live? similar to Where are you going?), or, if necessary, to exaggerate the concept of direction a bit (Where are you running to?).

Now it is highly symptomatic of the nature of the drift away from word clusters that we do not object to nuances as such, we object to having the nuances formally earmarked for us. As a matter of fact our vocabulary is rich in near-synonyms and in groups of words that are psychologically near relatives, but these near-synonyms and these groups do not hang together by reason of etymology. We are satisfied with believe and credible just because they keep aloof from each other. Good and well go better together than quick and quickly. The English vocabulary is a rich medley because each English word wants its own castle. Has English long been peculiarly receptive to foreign words because it craves the staking out of as many word areas as possible, or, conversely, has the mechanical imposition of a flood of French and Latin loan-words, unrooted in our earlier tradition, so dulled our feeling for the possibilities of our native resources that we are allowing these to shrink by default? I suspect that both propositions are true. Each feeds on the other. I do not think it likely, however, that the borrowings in English have been as mechanical and external a process as they are generally represented to have been. There was something about the English drift as early as the period following the Norman Conquest that welcomed the new words. They were a compensation for something that was weakening within.

Now it’s a clear sign of the shift away from word clusters that we don’t mind nuances themselves; we just don’t like having them explicitly pointed out to us. In fact, our vocabulary is full of near-synonyms and groups of words that are psychologically related, but these near-synonyms and groups don’t connect due to their origins. We’re fine with believe and credible simply because they maintain their distance from each other. Good and well fit together better than quick and quickly. The English vocabulary is a rich mix because every English word wants its own unique space. Has English always been particularly open to foreign words because it seeks to carve out as many word areas as possible, or, alternatively, has the overwhelming influx of French and Latin loan-words, disconnected from our earlier tradition, dulled our appreciation for the possibilities of our native resources, causing them to decline by default? I suspect that both ideas are true. Each one influences the other. However, I don’t think the borrowing in English has been as mechanical and external as it’s usually portrayed. There was something about the English evolution, especially after the Norman Conquest, that embraced new words. They served as a remedy for something that was weakening inside.

VIII

Language as a Historical Product: Phonetic Law

I have preferred to take up in some detail the analysis of our hesitation in using a locution like “Whom did you see?” and to point to some of the English drifts, particular and general, that are implied by this hesitation than to discuss linguistic change in the abstract. What is true of the particular idiom that we started with is true of everything else in language. Nothing is perfectly static. Every word, every grammatical element, every locution, every sound and accent is a slowly changing configuration, molded by the invisible and impersonal drift that is the life of language. The evidence is overwhelming that this drift has a certain consistent direction. Its speed varies enormously according to circumstances that it is not always easy to define. We have already seen that Lithuanian is to-day nearer its Indo-European prototype than was the hypothetical Germanic mother-tongue five hundred or a thousand years before Christ. German has moved more slowly than English; in some respects it stands roughly midway between English and Anglo-Saxon, in others it has of course diverged from the Anglo-Saxon line. When I pointed out in the preceding chapter that dialects formed because a language broken up into local segments could not move along the same drift in all of these segments, I meant of course that it could not move along identically the same drift. The general drift of a language has its depths. At the surface the current is relatively fast. In certain features dialects drift apart rapidly. By that very fact these features betray themselves as less fundamental to the genius of the language than the more slowly modifiable features in which the dialects keep together long after they have grown to be mutually alien forms of speech. But this is not all. The momentum of the more fundamental, the pre-dialectic, drift is often such that languages long disconnected will pass through the same or strikingly similar phases. In many such cases it is perfectly clear that there could have been no dialectic interinfluencing.

I’ve chosen to dive deeper into the reasons behind our reluctance to use a phrase like “Whom did you see?” and to highlight some specific and general trends in English that result from this uncertainty, rather than just discussing linguistic change in a vague way. What holds true for this particular phrase applies to language as a whole. Nothing is completely static. Every word, every grammatical element, every phrase, every sound and accent is a slowly evolving formation, shaped by the invisible and impersonal flow that defines the life of language. The evidence clearly shows that this flow has a consistent direction. Its speed can vary greatly depending on circumstances that aren't always easy to pinpoint. We've already noted that Lithuanian today is closer to its Indo-European roots than the theoretical Germanic ancestor was five hundred or a thousand years before Christ. German has evolved more slowly than English; in some aspects, it’s roughly in between English and Anglo-Saxon, while in other ways, it has diverged from the Anglo-Saxon path. When I mentioned in the previous chapter that dialects formed because a language, broken into local segments, couldn't evolve along the same trajectory in all these segments, I meant that it couldn't evolve in exactly the same way. The overall trajectory of a language has different layers. On the surface, the current moves relatively quickly. Certain characteristics cause dialects to diverge rapidly. Because of this, these characteristics reveal themselves as less essential to the core of the language than the more slowly changing features that allow the dialects to remain connected long after they have become distinct forms of speech. But that’s not all. The momentum of the more fundamental, pre-dialectical trends is often strong enough that languages that have been separated for a long time will go through the same or remarkably similar stages. In many cases, it's clear that there could have been no dialectal influence between them.

These parallelisms in drift may operate in the phonetic as well as in the morphological sphere, or they may affect both at the same time. Here is an interesting example. The English type of plural represented by foot: feet, mouse: mice is strictly parallel to the German Fuss: Füsse, Maus: Mäuse. One would be inclined to surmise that these dialectic forms go back to old Germanic or West-Germanic alternations of the same type. But the documentary evidence shows conclusively that there could have been no plurals of this type in primitive Germanic. There is no trace of such vocalic mutation (“umlaut”) in Gothic, our most archaic Germanic language. More significant still is the fact that it does not appear in our oldest Old High German texts and begins to develop only at the very end of the Old High German period (circa 1000 A.D.). In the Middle High German period the mutation was carried through in all dialects. The typical Old High German forms are singular fuoss, plural fuossi;[145] singular mus, plural musi. The corresponding Middle High German forms are fuoss, füesse; mus, müse. Modern German Fuss: Füsse, Maus: Mäuse are the regular developments of these medieval forms. Turning to Anglo-Saxon, we find that our modern English forms correspond to fot, fet; mus, mys.[146] These forms are already in use in the earliest English monuments that we possess, dating from the eighth century, and thus antedate the Middle High German forms by three hundred years or more. In other words, on this particular point it took German at least three hundred years to catch up with a phonetic-morphological drift[147] that had long been under way in English. The mere fact that the affected vowels of related words (Old High German uo, Anglo-Saxon o) are not always the same shows that the affection took place at different periods in German and English.[148] There was evidently some general tendency or group of tendencies at work in early Germanic, long before English and German had developed as such, that eventually drove both of these dialects along closely parallel paths.

These similarities in change can occur in both sound patterns and word structure, or they might influence both at the same time. Here's an interesting example. The English plural forms like foot: feet, mouse: mice are directly comparable to the German Fuss: Füsse, Maus: Mäuse. One might think that these forms originate from old Germanic or West-Germanic variations of a similar type. However, the available evidence clearly indicates that there were no plural forms of this kind in early Germanic. There is no sign of such vowel changes (“umlaut”) in Gothic, our oldest Germanic language. Even more importantly, it doesn't appear in the earliest Old High German texts and only starts to develop at the very end of the Old High German period (around 1000 A.D.). During the Middle High German period, this vowel change became standardized across all dialects. The typical Old High German forms are singular fuoss, plural fuossi; singular mus, plural musi. The corresponding Middle High German forms are fuoss, füesse; mus, müse. Modern German Fuss: Füsse, Maus: Mäuse are the typical developments of these medieval forms. Looking at Anglo-Saxon, we see that our modern English forms correspond to fot, fet; mus, mys. These forms were already in use in the earliest English texts we have, dating back to the eighth century, which means they predate the Middle High German forms by three hundred years or more. In other words, on this specific point, it took German at least three hundred years to catch up with a phonetic and morphological change that had been happening in English for a long time. The fact that the affected vowels of related words (Old High German uo, Anglo-Saxon o) are not always the same indicates that the changes occurred at different times in German and English. Clearly, there was some general tendency, or group of tendencies, at play in early Germanic, long before English and German had become distinct languages, which eventually led both dialects down closely parallel paths.

How did such strikingly individual alternations as fot: fet, fuoss: füesse develop? We have now reached what is probably the most central problem in linguistic history, gradual phonetic change. “Phonetic laws” make up a large and fundamental share of the subject-matter of linguistics. Their influence reaches far beyond the proper sphere of phonetics and invades that of morphology, as we shall see. A drift that begins as a slight phonetic readjustment or unsettlement may in the course of millennia bring about the most profound structural changes. The mere fact, for instance, that there is a growing tendency to throw the stress automatically on the first syllable of a word may eventually change the fundamental type of the language, reducing its final syllables to zero and driving it to the use of more and more analytical or symbolic[149] methods. The English phonetic laws involved in the rise of the words foot, feet, mouse and mice from their early West-Germanic prototypes fot, foti, mus, musi[150] may be briefly summarized as follows:

How did such distinct individual changes like fot: fet, fuoss: füesse come about? We have now reached what is likely the most central issue in language history, gradual phonetic change. “Phonetic laws” form a large and essential part of linguistics. Their impact extends far beyond phonetics and encroaches on morphology, as we will explore. A shift that starts as a minor phonetic adjustment or instability can over thousands of years lead to significant structural changes. For example, the simple fact that there's an increasing tendency to place stress on the first syllable of a word may eventually alter the fundamental structure of the language, reducing its final syllables to nothing and pushing it towards more analytical or symbolic[149] methods. The English phonetic laws involved in the development of the words foot, feet, mouse, and mice from their early West-Germanic origins fot, foti, mus, musi[150] can be summarized as follows:

In foti “feet” the long o was colored by the following i to long ö, that is, o kept its lip-rounded quality and its middle height of tongue position but anticipated the front tongue position of the i; ö is the resulting compromise. This assimilatory change was regular, i.e., every accented long o followed by an i in the following syllable automatically developed to long ö; hence tothi “teeth” became töthi, fodian “to feed” became födian. At first there is no doubt the alternation between o and ö was not felt as intrinsically significant. It could only have been an unconscious mechanical adjustment such as may be observed in the speech of many to-day who modify the “oo” sound of words like you and few in the direction of German ü without, however, actually departing far enough from the “oo” vowel to prevent their acceptance of who and you as satisfactory rhyming words. Later on the quality of the ö vowel must have departed widely enough from that of o to enable ö to rise in consciousness[151] as a neatly distinct vowel. As soon as this happened, the expression of plurality in föti, töthi, and analogous words became symbolic and fusional, not merely fusional.
In musi “mice” the long u was colored by the following i to long ü. This change also was regular; lusi “lice” became lüsi, kui “cows” became küi (later simplified to kü; still preserved as ki- in kine), fulian “to make foul” became fülian (still preserved as -file in defile). The psychology of this phonetic law is entirely analogous to that of 1.
The old drift toward reducing final syllables, a rhythmic consequence of the strong Germanic stress on the first syllable, now manifested itself. The final -i, originally an important functional element, had long lost a great share of its value, transferred as that was to the symbolic vowel change (o: ö). It had little power of resistance, therefore, to the drift. It became dulled to a colorless -e; föti became föte.
The weak -e finally disappeared. Probably the forms föte and föt long coexisted as prosodic variants according to the rhythmic requirements of the sentence, very much as Füsse and Füss’ now coexist in German.
The ö of föt became “unrounded” to long e (our present a of fade). The alternation of fot: foti, transitionally fot: föti, föte, föt, now appears as fot: fet. Analogously, töth appears as teth, födian as fedian, later fedan. The new long e-vowel “fell together” with the older e-vowel already existent (e.g., her “here,” he “he”). Henceforward the two are merged and their later history is in common. Thus our present he has the same vowel as feet, teeth, and feed. In other words, the old sound pattern o, e, after an interim of o, ö, e, reappeared as o, e, except that now the e had greater “weight” than before.
Fot: fet, mus: müs (written mys) are the typical forms of Anglo-Saxon literature. At the very end of the Anglo-Saxon period, say about 1050 to 1100 A.D., the ü, whether long or short, became unrounded to i. Mys was then pronounced mis with long i (rhyming with present niece). The change is analogous to 5, but takes place several centuries later.
In Chaucer’s day (circa 1350-1400 A.D.) the forms were still fot: fet (written foot, feet) and mus: mis (written very variably, but mous, myse are typical). About 1500 all the long i-vowels, whether original (as in write, ride, wine) or unrounded from Anglo-Saxon ü (as in hide, bride, mice, defile), became diphthongized to ei (i.e., e of met + short i). Shakespeare pronounced mice as meis (almost the same as the present Cockney pronunciation of mace).
About the same time the long u-vowels were diphthongized to ou (i.e., o of present Scotch not + u of full). The Chaucerian mus: mis now appears as the Shakespearean mous: meis. This change may have manifested itself somewhat later than 7; all English dialects have diphthongized old Germanic long i,[152] but the long undiphthongized u is still preserved in Lowland Scotch, in which house and mouse rhyme with our loose. 7 and 8 are analogous developments, as were 5 and 6; 8 apparently lags behind 7 as 6, centuries earlier, lagged behind 7.
Some time before 1550 the long e of fet (written feet) took the position that had been vacated by the old long i, now diphthongized (see 7), i.e., e took the higher tongue position of i. Our (and Shakespeare’s) “long e” is, then, phonetically the same as the old long i. Feet now rhymed with the old write and the present beat.
About the same time the long o of fot (written foot) took the position that had been vacated by the old long u, now diphthongized (see 8), i.e., o took the higher tongue position of u. Our (and Shakespeare’s) “long oo” is phonetically the same as the old long u. Foot now rhymed with the old out and the present boot. To summarize 7 to 10, Shakespeare pronounced meis, mous, fit, fut, of which meis and mous would affect our ears as a rather “mincing” rendering of our present mice and mouse, fit would sound practically identical with (but probably a bit more “drawled” than) our present feet, while foot, rhyming with boot, would now be set down as “broad Scotch.”
Gradually the first vowel of the diphthong in mice (see 7) was retracted and lowered in position. The resulting diphthong now varies in different English dialects, but ai (i.e., a of father, but shorter, + short i) may be taken as a fairly accurate rendering of its average quality.[153] What we now call the “long i” (of words like ride, bite, mice) is, of course, an ai-diphthong. Mice is now pronounced mais.
Analogously to 11, the first vowel of the diphthong in mouse (see 8) was unrounded and lowered in position. The resulting diphthong may be phonetically rendered au, though it too varies considerably according to dialect. Mouse, then, is now pronounced maus.
The vowel of foot (see 10) became “open” in quality and shorter in quantity, i.e., it fell together with the old short u-vowel of words like full, wolf, wool. This change has taken place in a number of words with an originally long u (Chaucerian long close o), such as forsook, hook, book, look, rook, shook, all of which formerly had the vowel of boot. The older vowel, however, is still preserved in most words of this class, such as fool, moon, spool, stoop. It is highly significant of the nature of the slow spread of a “phonetic law” that there is local vacillation at present in several words. One hears roof, soot, and hoop, for instance, both with the “long” vowel of boot and the “short” of foot. It is impossible now, in other words, to state in a definitive manner what is the “phonetic law” that regulated the change of the older foot (rhyming with boot) to the present foot. We know that there is a strong drift towards the short, open vowel of foot, but whether or not all the old “long oo” words will eventually be affected we cannot presume to say. If they all, or practically all, are taken by the drift, phonetic law 13 will be as “regular,” as sweeping, as most of the twelve that have preceded it. If not, it may eventually be possible, if past experience is a safe guide, to show that the modified words form a natural phonetic group, that is, that the “law” will have operated under certain definable limiting conditions, e.g., that all words ending in a voiceless consonant (such as p, t, k, f) were affected (e.g., hoof, foot, look, roof), but that all words ending in the oo-vowel or in a voiced consonant remained unaffected (e.g., do, food, move, fool). Whatever the upshot, we may be reasonably certain that when the “phonetic law” has run its course, the distribution of “long” and “short” vowels in the old oo-words will not seem quite as erratic as at the present transitional moment.[154] We learn, incidentally, the fundamental fact that phonetic laws do not work with spontaneous automatism, that they are simply a formula for a consummated drift that sets in at a psychologically exposed point and gradually worms its way through a gamut of phonetically analogous forms.

It will be instructive to set down a table of form sequences, a kind of gross history of the words foot, feet, mouse, mice for the last 1500 years:[155]

It will be helpful to create a table of form sequences, a sort of general history of the words foot, feet, mouse, mice over the past 1500 years:[155]

fot: foti; mus: musi (West Germanic)
fot: föti; mus: müsi
fot: föte; mus: müse
fot: föt; mus: müs
fot: fet; mus: müs (Anglo-Saxon)
fot: fet; mus: mis(Chaucer)
fot: fet; mous: meis
fut (rhymes with boot): fit; mous: meis (Shakespeare)
fut: fit; maus: mais
fut (rhymes with put): fit; maus: mais (English of 1900)

It will not be necessary to list the phonetic laws that gradually differentiated the modern German equivalents of the original West Germanic forms from their English cognates. The following table gives a rough idea of the form sequences in German:[156]

It’s not necessary to outline the phonetic rules that slowly set apart the modern German versions of the original West Germanic forms from their English counterparts. The table below provides a general overview of the form sequences in German:[156]

fot: foti; mus: musi (West Germanic)
foss:[157] fossi; mus: musi
fuoss: fuossi; mus: musi (Old High German)
fuoss: füessi; mus: müsi
fuoss: füesse; mus: müse (Middle High German)
fuoss: füesse; mus: müze[158]
fuos: füese; mus: müze
fuos: füese; mous: möüze
fus: füse; mous: möüze (Luther)
fus: füse; maus: moize (German of 1900)

We cannot even begin to ferret out and discuss all the psychological problems that are concealed behind these bland tables. Their general parallelism is obvious. Indeed we might say that to-day the English and German forms resemble each other more than does either set the West Germanic prototypes from which each is independently derived. Each table illustrates the tendency to reduction of unaccented syllables, the vocalic modification of the radical element under the influence of the following vowel, the rise in tongue position of the long middle vowels (English o to u, e to i; German o to uo to u, üe to ü), the diphthongizing of the old high vowels (English i to ei to ai; English and German u to ou to au; German ü to öü to oi). These dialectic parallels cannot be accidental. They are rooted in a common, pre-dialectic drift.

We can't even start to uncover and discuss all the psychological issues hidden behind these simple tables. Their general similarities are clear. In fact, we could say that today the English and German forms look more alike than either does to the West Germanic prototypes from which they each independently evolved. Each table shows the tendency to reduce unstressed syllables, the vocalic change of the root element due to the following vowel, the raising of the tongue position of the long middle vowels (English o to u, e to i; German o to uo to u, üe to ü), the diphthongization of the old high vowels (English i to ei to ai; English and German u to ou to au; German ü to öü to oi). These dialect parallels can't be coincidental. They are based on a shared, pre-dialectal trend.

Phonetic changes are “regular.” All but one (English table, X.), and that as yet uncompleted, of the particular phonetic laws represented in our tables affect all examples of the sound in question or, if the phonetic change is conditional, all examples of the same sound that are analogously circumstanced.[159] An example of the first type of change is the passage in English of all old long i-vowels to diphthongal ai via ei. The passage could hardly have been sudden or automatic, but it was rapid enough to prevent an irregularity of development due to cross drifts. The second type of change is illustrated in the development of Anglo-Saxon long o to long e, via ö, under the influence of a following i. In the first case we may say that au mechanically replaced long u, in the second that the old long o “split” into two sounds—long o, eventually u, and long e, eventually i. The former type of change did no violence to the old phonetic pattern, the formal distribution of sounds into groups; the latter type rearranged the pattern somewhat. If neither of the two sounds into which an old one “splits” is a new sound, it means that there has been a phonetic leveling, that two groups of words, each with a distinct sound or sound combination, have fallen together into one group. This kind of leveling is quite frequent in the history of language. In English, for instance, we have seen that all the old long ü-vowels, after they had become unrounded, were indistinguishable from the mass of long i-vowels. This meant that the long i-vowel became a more heavily weighted point of the phonetic pattern than before. It is curious to observe how often languages have striven to drive originally distinct sounds into certain favorite positions, regardless of resulting confusions.[160] In Modern Greek, for instance, the vowel i is the historical resultant of no less than ten etymologically distinct vowels (long and short) and diphthongs of the classical speech of Athens. There is, then, good evidence to show that there are general phonetic drifts toward particular sounds.

Phonetic changes are "regular." All but one (English table, X.), which is still incomplete, of the specific phonetic rules shown in our tables affect all instances of the sound in question or, if the phonetic change is conditional, all examples of the same sound that are similarly situated.[159] An example of the first type of change is the transition in English of all old long i-vowels to diphthongal ai through ei. This change couldn't have happened suddenly or automatically, but it was quick enough to avoid irregular development due to cross influences. The second type of change is shown in the evolution of Anglo-Saxon long o to long e, through ö, under the influence of a following i. In the first case, we can say that au mechanically replaced long u, while in the second, the old long o "split" into two sounds—long o, eventually u, and long e, eventually i. The first type of change didn't disrupt the old phonetic pattern, which classified sounds into groups; the latter type modified the pattern a bit. If neither of the two sounds that an old one "splits" into is a new sound, it indicates that there has been phonetic leveling, where two groups of words, each with a distinct sound or sound combination, have merged into one group. This kind of leveling is quite common in the history of language. In English, for example, we have seen that all the old long ü-vowels, after they became unrounded, were indistinguishable from the mass of long i-vowels. This meant that the long i-vowel became a more significant part of the phonetic pattern than before. It's interesting to notice how often languages have tried to push originally distinct sounds into certain preferred positions, regardless of the resulting confusion.[160] For instance, in Modern Greek, the vowel i is the historical result of no less than ten etymologically distinct vowels (long and short) and diphthongs from the classical speech in Athens. Therefore, there is substantial evidence showing that there are general phonetic trends toward specific sounds.

More often the phonetic drift is of a more general character. It is not so much a movement toward a particular set of sounds as toward particular types of articulation. The vowels tend to become higher or lower, the diphthongs tend to coalesce into monophthongs, the voiceless consonants tend to become voiced, stops tend to become spirants. As a matter of fact, practically all the phonetic laws enumerated in the two tables are but specific instances of such far-reaching phonetic drifts. The raising of English long o to u and of long e to i, for instance, was part of a general tendency to raise the position of the long vowels, just as the change of t to ss in Old High German was part of a general tendency to make voiceless spirants of the old voiceless stopped consonants. A single sound change, even if there is no phonetic leveling, generally threatens to upset the old phonetic pattern because it brings about a disharmony in the grouping of sounds. To reëstablish the old pattern without going back on the drift the only possible method is to have the other sounds of the series shift in analogous fashion. If, for some reason or other, p becomes shifted to its voiced correspondent b, the old series p, t, k appears in the unsymmetrical form b, t, k. Such a series is, in phonetic effect, not the equivalent of the old series, however it may answer to it in etymology. The general phonetic pattern is impaired to that extent. But if t and k are also shifted to their voiced correspondents d and g, the old series is reëstablished in a new form: b, d, g. The pattern as such is preserved, or restored. Provided that the new series b, d, g does not become confused with an old series b, d, g of distinct historical antecedents. If there is no such older series, the creation of a b, d, g series causes no difficulties. If there is, the old patterning of sounds can be kept intact only by shifting the old b, d, g sounds in some way. They may become aspirated to bh, dh, gh or spirantized or nasalized or they may develop any other peculiarity that keeps them intact as a series and serves to differentiate them from other series. And this sort of shifting about without loss of pattern, or with a minimum loss of it, is probably the most important tendency in the history of speech sounds. Phonetic leveling and “splitting” counteract it to some extent but, on the whole, it remains the central unconscious regulator of the course and speed of sound changes.

Phonetic changes often have a broader nature. It’s not just about moving towards a specific group of sounds, but rather a shift towards particular ways of articulating them. Vowels usually become higher or lower, diphthongs often merge into monophthongs, voiceless consonants tend to become voiced, and stops often turn into spirants. In fact, almost all the phonetic laws listed in the two tables are just specific examples of such extensive phonetic changes. For instance, the raising of the English long o to u and long e to i was part of a general trend to elevate the long vowels, just like the change of t to ss in Old High German aimed at turning old voiceless stops into voiceless spirants. A single sound change can disrupt the old phonetic structure, even without phonetic leveling, because it causes an imbalance in the sound groupings. To restore the old pattern without reversing the change, the only way is to have the other sounds in the series shift similarly. If, for some reason, p changes to its voiced counterpart b, the old series p, t, k appears as the unbalanced form b, t, k. This new series, despite its etymological connections, does not function phonetically as the old series did, which harms the general phonetic pattern. However, if t and k also change to their voiced equivalents d and g, the old series is reestablished in a new form: b, d, g. The pattern itself is preserved or restored. Provided that the new series b, d, g isn’t confused with an older series b, d, g with different historical roots. If there’s no such older series, creating a b, d, g series poses no issues. But if there is, the original sound arrangement can only stay intact if the old b, d, g sounds shift in some way. They might become aspirated to bh, dh, gh or change into spirants or nasals, or develop any other unique qualities that help them maintain their series status and distinguish them from others. This type of rearrangement, without losing the pattern—or losing it minimally—is likely the most crucial tendency in the history of speech sounds. Phonetic leveling and “splitting” somewhat counteract it, but overall, it remains the main unconscious force guiding the direction and speed of sound changes.

The desire to hold on to a pattern, the tendency to “correct” a disturbance by an elaborate chain of supplementary changes, often spread over centuries or even millennia—these psychic undercurrents of language are exceedingly difficult to understand in terms of individual psychology, though there can be no denial of their historical reality. What is the primary cause of the unsettling of a phonetic pattern and what is the cumulative force that selects these or those particular variations of the individual on which to float the pattern readjustments we hardly know. Many linguistic students have made the fatal error of thinking of sound change as a quasi-physiological instead of as a strictly psychological phenomenon, or they have tried to dispose of the problem by bandying such catchwords as “the tendency to increased ease of articulation” or “the cumulative result of faulty perception” (on the part of children, say, in learning to speak). These easy explanations will not do. “Ease of articulation” may enter in as a factor, but it is a rather subjective concept at best. Indians find hopelessly difficult sounds and sound combinations that are simple to us; one language encourages a phonetic drift that another does everything to fight. “Faulty perception” does not explain that impressive drift in speech sounds which I have insisted upon. It is much better to admit that we do not yet understand the primary cause or causes of the slow drift in phonetics, though we can frequently point to contributing factors. It is likely that we shall not advance seriously until we study the intuitional bases of speech. How can we understand the nature of the drift that frays and reforms phonetic patterns when we have never thought of studying sound patterning as such and the “weights” and psychic relations of the single elements (the individual sounds) in these patterns?

The desire to stick to a certain pattern, the tendency to “fix” a disturbance with a complex series of additional changes, often spread over centuries or even thousands of years—these underlying psychological forces in language are really hard to grasp in terms of individual psychology, even though their historical significance is undeniable. What is the main reason for the unsettling of a phonetic pattern, and what cumulative influence decides which specific variations of the individual the pattern adjustments rely on is still unclear. Many language students have made the mistake of viewing sound change as a sort of physical process instead of a purely psychological one, or they’ve tried to simplify the issue by tossing around phrases like “the tendency for easier articulation” or “the cumulative effect of faulty perception” (like in children learning to speak). These simple explanations fall short. “Ease of articulation” might be a factor, but it’s a rather subjective idea at best. There are sounds and sound combinations that seem impossibly complicated to some Indians that are simple for us; one language fosters a phonetic shift that another actively resists. “Faulty perception” doesn’t clarify the significant shifts in speech sounds that I’ve emphasized. It's better to accept that we still don’t fully understand the main cause or causes of the slow drift in phonetics, even though we can often identify contributing factors. We probably won’t make real progress until we examine the intuitive foundations of speech. How can we grasp the nature of the changes that unravel and reshape phonetic patterns when we haven’t even considered studying sound patterning as such and the “weights” and psychological connections of the individual elements (the individual sounds) in these patterns?

Every linguist knows that phonetic change is frequently followed by morphological rearrangements, but he is apt to assume that morphology exercises little or no influence on the course of phonetic history. I am inclined to believe that our present tendency to isolate phonetics and grammar as mutually irrelevant linguistic provinces is unfortunate. There are likely to be fundamental relations between them and their respective histories that we do not yet fully grasp. After all, if speech sounds exist merely because they are the symbolic carriers of significant concepts and groupings of concepts, why may not a strong drift or a permanent feature in the conceptual sphere exercise a furthering or retarding influence on the phonetic drift? I believe that such influences may be demonstrated and that they deserve far more careful study than they have received.

Every linguist understands that changes in pronunciation often lead to shifts in word structure, but they tend to think that word structure has little to no effect on the development of pronunciation over time. I believe our current approach of viewing pronunciation and grammar as completely separate areas is a mistake. There are probably key connections between them and their histories that we haven't fully understood yet. After all, if speech sounds exist solely to symbolize important concepts and categories of concepts, why couldn't a strong trend or a lasting aspect in the realm of concepts have an impact on the changes in pronunciation? I think these influences can be proven and warrant much more thorough examination than they have received.

This brings us back to our unanswered question: How is it that both English and German developed the curious alternation of unmodified vowel in the singular (foot, Fuss) and modified vowel in the plural (feet, Füsse)? Was the pre-Anglo-Saxon alternation of fot and föti an absolutely mechanical matter, without other than incidental morphological interest? It is always so represented, and, indeed, all the external facts support such a view. The change from o to ö, later e, is by no means peculiar to the plural. It is found also in the dative singular (fet), for it too goes back to an older foti. Moreover, fet of the plural applies only to the nominative and accusative; the genitive has fota, the dative fotum. Only centuries later was the alternation of o and e reinterpreted as a means of distinguishing number; o was generalized for the singular, e for the plural. Only when this reassortment of forms took place[161] was the modern symbolic value of the foot: feet alternation clearly established. Again, we must not forget that o was modified to ö (e) in all manner of other grammatical and derivative formations. Thus, a pre-Anglo-Saxon hohan (later hon) “to hang” corresponded to a höhith, hehith (later hehth) “hangs”; to dom “doom,” blod “blood,” and fod “food” corresponded the verbal derivatives dömian (later deman) “to deem,” blödian (later bledan) “to bleed,” and födian (later fedan) “to feed.” All this seems to point to the purely mechanical nature of the modification of o to ö to e. So many unrelated functions were ultimately served by the vocalic change that we cannot believe that it was motivated by any one of them.

This brings us back to our unanswered question: How did both English and German develop the interesting pattern of an unmodified vowel in the singular (foot, Fuss) and a modified vowel in the plural (feet, Füsse)? Was the pre-Anglo-Saxon change from fot to föti just a mechanical process, with no real morphological significance other than coincidental? It's usually presented that way, and indeed, all the external facts support this perspective. The change from o to ö, later e, isn’t unique to the plural. It also occurs in the dative singular (fet), which also comes from an older foti. Furthermore, fet in the plural only applies to the nominative and accusative; the genitive is fota, and the dative is fotum. Only many years later was the change from o to e reinterpreted as a way to indicate number; o became standard for the singular, while e was used for the plural. It was only during this reorganization of forms[161] that the modern symbolic meaning of the foot: feet alternation was clearly established. Additionally, we must remember that o was changed to ö (e) in many other grammatical and derivative forms. For instance, a pre-Anglo-Saxon hohan (later hon) “to hang” corresponds with höhith, hehith (later hehth) “hangs”; to dom “doom,” blod “blood,” and fod “food” correspond the verbal derivatives dömian (later deman) “to deem,” blödian (later bledan) “to bleed,” and födian (later fedan) “to feed.” All of this suggests the purely mechanical nature of the change from o to ö to e. So many unrelated functions were ultimately served by this vowel change that it’s hard to believe it was driven by any one of them.

The German facts are entirely analogous. Only later in the history of the language was the vocalic alternation made significant for number. And yet consider the following facts. The change of foti to föti antedated that of föti to föte, föt. This may be looked upon as a “lucky accident,” for if foti had become fote, fot before the -i had had the chance to exert a retroactive influence on the o, there would have been no difference between the singular and the plural. This would have been anomalous in Anglo-Saxon for a masculine noun. But was the sequence of phonetic changes an “accident”? Consider two further facts. All the Germanic languages were familiar with vocalic change as possessed of functional significance. Alternations like sing, sang, sung (Anglo-Saxon singan, sang, sungen) were ingrained in the linguistic consciousness. Further, the tendency toward the weakening of final syllables was very strong even then and had been manifesting itself in one way and another for centuries. I believe that these further facts help us to understand the actual sequence of phonetic changes. We may go so far as to say that the o (and u) could afford to stay the change to ö (and ü) until the destructive drift had advanced to the point where failure to modify the vowel would soon result in morphological embarrassment. At a certain moment the -i ending of the plural (and analogous endings with i in other formations) was felt to be too weak to quite bear its functional burden. The unconscious Anglo-Saxon mind, if I may be allowed a somewhat summary way of putting the complex facts, was glad of the opportunity afforded by certain individual variations, until then automatically canceled out, to have some share of the burden thrown on them. These particular variations won through because they so beautifully allowed the general phonetic drift to take its course without unsettling the morphological contours of the language. And the presence of symbolic variation (sing, sang, sung) acted as an attracting force on the rise of a new variation of similar character. All these factors were equally true of the German vocalic shift. Owing to the fact that the destructive phonetic drift was proceeding at a slower rate in German than in English, the preservative change of uo to üe (u to ü) did not need to set in until 300 years or more after the analogous English change. Nor did it. And this is to my mind a highly significant fact. Phonetic changes may sometimes be unconsciously encouraged in order to keep intact the psychological spaces between words and word forms. The general drift seizes upon those individual sound variations that help to preserve the morphological balance or to lead to the new balance that the language is striving for.

The German facts are completely similar. Only later in the history of the language did vocalic alternation become important for number. And yet, think about these facts. The change from foti to föti happened before the change from föti to föte, föt. This could be seen as a "lucky accident," because if foti had turned into fote, fot before the -i had the chance to influence the o retroactively, there would have been no distinction between the singular and plural. This would be unusual in Anglo-Saxon for a masculine noun. But was the order of phonetic changes an “accident”? Consider two more facts. All the Germanic languages recognized vocalic change as functionally significant. Alternations like sing, sang, sung (Anglo-Saxon singan, sang, sungen) were deeply embedded in the linguistic awareness. Additionally, the trend toward weakening final syllables was already strong and had been showing itself in various ways for centuries. I believe these additional facts help us make sense of the actual order of phonetic changes. We can go so far as to say that the o (and u) could afford to delay the change to ö (and ü) until the destructive trend had progressed to the point where not changing the vowel would soon lead to morphological confusion. At a certain moment, the -i ending of the plural (and similar endings with i in other forms) was perceived as too weak to fully bear its functional weight. The unconscious Anglo-Saxon mind, if I can summarize the complex facts, was glad to seize the opportunity presented by certain individual variations, which had previously been automatically canceled out, to share some of the burden. These specific variations succeeded because they allowed the general phonetic drift to occur without disturbing the morphological structure of the language. And the existence of symbolic variation (sing, sang, sung) acted as a magnet for the emergence of a new variation of a similar nature. All these factors were equally true for the German vocalic shift. Since the destructive phonetic shift was happening at a slower pace in German than in English, the preservative change from uo to üe (u to ü) didn’t need to occur until 300 years or more after the corresponding English change. And it didn’t. This is, in my view, a very significant fact. Phonetic changes may sometimes be unconsciously encouraged to maintain the psychological spaces between words and word forms. The general trend capitalizes on those individual sound variations that help preserve the morphological balance or lead to a new balance that the language is striving for.

I would suggest, then, that phonetic change is compacted of at least three basic strands: (1) A general drift in one direction, concerning the nature of which we know almost nothing but which may be suspected to be of prevailingly dynamic character (tendencies, e.g., to greater or less stress, greater or less voicing of elements); (2) A readjusting tendency which aims to preserve or restore the fundamental phonetic pattern of the language; (3) A preservative tendency which sets in when a too serious morphological unsettlement is threatened by the main drift. I do not imagine for a moment that it is always possible to separate these strands or that this purely schematic statement does justice to the complex forces that guide the phonetic drift. The phonetic pattern of a language is not invariable, but it changes far less readily than the sounds that compose it. Every phonetic element that it possesses may change radically and yet the pattern remain unaffected. It would be absurd to claim that our present English pattern is identical with the old Indo-European one, yet it is impressive to note that even at this late day the English series of initial consonants:

I would suggest, then, that phonetic change consists of at least three basic components: (1) A general trend in one direction, about which we know very little but may suspect to be primarily dynamic (for example, tendencies toward more or less stress, and more or less voicing of elements); (2) A readjustment tendency that aims to preserve or restore the fundamental phonetic pattern of the language; (3) A preservative tendency that kicks in when a significant morphological disruption is threatened by the main trend. I don’t believe for a second that it’s always possible to separate these components or that this purely schematic explanation does justice to the complex forces that drive phonetic change. The phonetic pattern of a language isn’t fixed, but it changes much less readily than the sounds that make it up. Every phonetic element can change dramatically while the pattern remains unaffected. It would be ridiculous to claim that our current English pattern is identical to the old Indo-European one, yet it’s striking to note that even at this late stage, the English series of initial consonants:

p	t	k
b	d	g
f	th	h

corresponds point for point to the Sanskrit series:

corresponds exactly to the Sanskrit series:

b	d	g
bh	dh	gh
p	t	k

The relation between phonetic pattern and individual sound is roughly parallel to that which obtains between the morphologic type of a language and one of its specific morphological features. Both phonetic pattern and fundamental type are exceedingly conservative, all superficial appearances to the contrary notwithstanding. Which is more so we cannot say. I suspect that they hang together in a way that we cannot at present quite understand.

The connection between phonetic patterns and individual sounds is similar to the relationship between the morphological type of a language and one of its specific morphological features. Both phonetic patterns and fundamental types are very conservative, despite any outward appearances to the contrary. It's hard to determine which is more conservative. I suspect they are interconnected in a way that we don’t fully grasp yet.

If all the phonetic changes brought about by the phonetic drift were allowed to stand, it is probable that most languages would present such irregularities of morphological contour as to lose touch with their formal ground-plan. Sound changes work mechanically. Hence they are likely to affect a whole morphological group here—this does not matter—, only part of a morphological group there—and this may be disturbing. Thus, the old Anglo-Saxon paradigm:

If all the sound changes caused by phonetic drift were allowed to continue, it’s likely that most languages would show such irregularities in their structure that they would lose connection with their original design. Sound changes happen automatically. Therefore, they’re likely to impact an entire morphological category here—this is not an issue—while only affecting part of a morphological category there—and that can be problematic. So, the old Anglo-Saxon paradigm:

	Sing.	Plur.
N. Ac.	fot	fet (older foti)
G.	fotes	fota
D.	fet (older foti)	fotum

could not long stand unmodified. The o—e alternation was welcome in so far as it roughly distinguished the singular from the plural. The dative singular fet, however, though justified historically, was soon felt to be an intrusive feature. The analogy of simpler and more numerously represented paradigms created the form fote (compare, e.g., fisc “fish,” dative singular fisce). Fet as a dative becomes obsolete. The singular now had o throughout. But this very fact made the genitive and dative o-forms of the plural seem out of place. The nominative and accusative fet was naturally far more frequently in use than were the corresponding forms of the genitive and dative. These, in the end, could not but follow the analogy of fet. At the very beginning of the Middle English period, therefore, we find that the old paradigm has yielded to a more regular one:

could not remain unchanged for long. The o—e variation was useful because it somewhat distinguished the singular from the plural. The dative singular fet, however, although historically justified, soon felt like an unnecessary addition. The pattern of simpler and more commonly used forms led to the creation of fote (for example, fisc “fish,” dative singular fisce). Fet as a dative became outdated. The singular now consistently used o. But this change made the genitive and dative o-forms of the plural seem mismatched. The nominative and accusative fet was naturally used much more often than the corresponding genitive and dative forms. Ultimately, these had to conform to the pattern of fet. Thus, at the very start of the Middle English period, we see the old paradigm replaced by a more regular one:

		Sing.		Plur.
N. Ac.	*	fot	*	fet
G.	*	fotes		fete
D.		fote		feten

The starred forms are the old nucleus around which the new paradigm is built. The unstarred forms are not genealogical kin of their formal prototypes. They are analogical replacements.

The starred forms are the old core around which the new framework is built. The unstarred forms are not genealogical relatives of their formal prototypes. They are analogical substitutes.

The history of the English language teems with such levelings or extensions. Elder and eldest were at one time the only possible comparative and superlative forms of old (compare German alt, älter, der älteste; the vowel following the old-, alt- was originally an i, which modified the quality of the stem vowel). The general analogy of the vast majority of English adjectives, however, has caused the replacement of the forms elder and eldest by the forms with unmodified vowel, older and oldest. Elder and eldest survive only as somewhat archaic terms for the older and oldest brother or sister. This illustrates the tendency for words that are psychologically disconnected from their etymological or formal group to preserve traces of phonetic laws that have otherwise left no recognizable trace or to preserve a vestige of a morphological process that has long lost its vitality. A careful study of these survivals or atrophied forms is not without value for the reconstruction of the earlier history of a language or for suggestive hints as to its remoter affiliations.

The history of the English language is full of changes and expansions. Elder and eldest used to be the only comparative and superlative forms of old (compare German alt, älter, der älteste; the vowel following the old-, alt- was originally an i, which changed the quality of the stem vowel). However, the general pattern of the vast majority of English adjectives has led to elder and eldest being replaced by the forms with an unaltered vowel, older and oldest. Elder and eldest now mainly survive as somewhat old-fashioned terms for an older and oldest brother or sister. This shows the tendency for words that are psychologically disconnected from their origins or formal groups to keep traces of phonetic rules that have otherwise disappeared or to retain signs of a morphological process that has long stopped being active. A careful study of these remnants or diminished forms can provide valuable insights for reconstructing the earlier history of a language or offer hints about its more distant relationships.

Analogy may not only refashion forms within the confines of a related cluster of forms (a “paradigm”) but may extend its influence far beyond. Of a number of functionally equivalent elements, for instance, only one may survive, the rest yielding to its constantly widening influence. This is what happened with the English -s plural. Originally confined to a particular class of masculines, though an important class, the -s plural was gradually generalized for all nouns but a mere handful that still illustrate plural types now all but extinct (foot: feet, goose: geese, tooth: teeth, mouse: mice, louse: lice; ox: oxen; child: children; sheep: sheep, deer: deer). Thus analogy not only regularizes irregularities that have come in the wake of phonetic processes but introduces disturbances, generally in favor of greater simplicity or regularity, in a long established system of forms. These analogical adjustments are practically always symptoms of the general morphological drift of the language.

Analogy can not only reshape forms within a related group of forms (a “paradigm”) but can also influence much more broadly. For example, among a set of functionally equivalent elements, only one might survive, while the others fade away under its ever-expanding impact. This is what happened with the English -s plural. Initially limited to a specific group of masculine nouns, which was a significant group, the -s plural slowly became generalized to apply to all nouns except for a few that now showcase plural forms that are nearly extinct (foot: feet, goose: geese, tooth: teeth, mouse: mice, louse: lice; ox: oxen; child: children; sheep: sheep, deer: deer). Thus, analogy not only smooths out irregularities that arise from phonetic changes but also creates disturbances, usually promoting greater simplicity or regularity, within a long-established system of forms. These analogical changes are nearly always signs of the general morphological shift in the language.

A morphological feature that appears as the incidental consequence of a phonetic process, like the English plural with modified vowel, may spread by analogy no less readily than old features that owe their origin to other than phonetic causes. Once the e-vowel of Middle English fet had become confined to the plural, there was no theoretical reason why alternations of the type fot: fet and mus: mis might not have become established as a productive type of number distinction in the noun. As a matter of fact, it did not so become established. The fot: fet type of plural secured but a momentary foothold. It was swept into being by one of the surface drifts of the language, to be swept aside in the Middle English period by the more powerful drift toward the use of simple distinctive forms. It was too late in the day for our language to be seriously interested in such pretty symbolisms as foot: feet. What examples of the type arose legitimately, in other words via purely phonetic processes, were tolerated for a time, but the type as such never had a serious future.

A morphological feature that happens as a byproduct of a phonetic process, like the English plural with a changed vowel, can spread by analogy just as easily as older features that originated from non-phonetic causes. Once the e-vowel of Middle English fet was limited to the plural form, there was no theoretical reason why variations like fot: fet and mus: mis couldn't have developed as a common way to distinguish number in nouns. In reality, it didn't become established. The fot: fet plural type only gained a temporary foothold. It emerged as one of the surface trends in the language, only to be pushed aside in the Middle English period by a stronger trend toward using simple, distinctive forms. By that time, the language wasn't really interested in such charming symbolisms as foot: feet. Any examples of this type that did occur legitimately, meaning via purely phonetic processes, were tolerated for a while, but the type itself never had a strong future.

It was different in German. The whole series of phonetic changes comprised under the term “umlaut,” of which u: ü and au: oi (written äu) are but specific examples, struck the German language at a time when the general drift to morphological simplification was not so strong but that the resulting formal types (e.g., Fuss: Füsse; fallen “to fall”: fällen “to fell”; Horn “horn”: Gehörne “group of horns”; Haus “house”: Häuslein “little house”) could keep themselves intact and even extend to forms that did not legitimately come within their sphere of influence. “Umlaut” is still a very live symbolic process in German, possibly more alive to-day than in medieval times. Such analogical plurals as Baum “tree”: Bäume (contrast Middle High German boum: boume) and derivatives as lachen “to laugh”: Gelächter “laughter” (contrast Middle High German gelach) show that vocalic mutation has won through to the status of a productive morphologic process. Some of the dialects have even gone further than standard German, at least in certain respects. In Yiddish,[162] for instance, “umlaut” plurals have been formed where there are no Middle High German prototypes or modern literary parallels, e.g., tog “day”: teg “days” (but German Tag: Tage) on the analogy of gast “guest”: gest “guests” (German Gast: Gäste), shuch[163] “shoe”: shich “shoes” (but German Schuh: Schuhe) on the analogy of fus “foot”: fis “feet.” It is possible that “umlaut” will run its course and cease to operate as a live functional process in German, but that time is still distant. Meanwhile all consciousness of the merely phonetic nature of “umlaut” vanished centuries ago. It is now a strictly morphological process, not in the least a mechanical phonetic adjustment. We have in it a splendid example of how a simple phonetic law, meaningless in itself, may eventually color or transform large reaches of the morphology of a language.

It was different in German. The entire series of phonetic changes known as “umlaut,” like u: ü and au: oi (written äu), affected the German language at a time when the overall trend toward simpler morphology wasn't as strong. The resulting formal types (e.g., Fuss: Füsse; fallen “to fall”: fällen “to fell”; Horn “horn”: Gehörne “group of horns”; Haus “house”: Häuslein “little house”) were able to remain intact and even expand to forms that didn't originally belong to them. “Umlaut” is still a very active symbolic process in German, possibly more so today than in medieval times. Analogical plurals like Baum “tree”: Bäume (in contrast to Middle High German boum: boume) and derivatives like lachen “to laugh”: Gelächter “laughter” (contrasting with Middle High German gelach) show that vowel mutation has become a productive morphological process. Some dialects have even gone beyond standard German in some respects. In Yiddish,[162] for example, “umlaut” plurals have been created without Middle High German prototypes or modern literary parallels, such as tog “day”: teg “days” (but German Tag: Tage) based on gast “guest”: gest “guests” (German Gast: Gäste), and shuch[163] “shoe”: shich “shoes” (but German Schuh: Schuhe) based on fus “foot”: fis “feet.” It’s possible that “umlaut” will eventually fade away and stop being a functional process in German, but that time is still far off. Meanwhile, the awareness of “umlaut” being merely phonetic disappeared centuries ago. It is now a purely morphological process, not just a mechanical phonetic adjustment. We see in it a great example of how a simple phonetic rule, which is meaningless on its own, can eventually impact or transform significant aspects of a language's morphology.

IX

How Languages Influence Each Other

Languages, like cultures, are rarely sufficient unto themselves. The necessities of intercourse bring the speakers of one language into direct or indirect contact with those of neighboring or culturally dominant languages. The intercourse may be friendly or hostile. It may move on the humdrum plane of business and trade relations or it may consist of a borrowing or interchange of spiritual goods—art, science, religion. It would be difficult to point to a completely isolated language or dialect, least of all among the primitive peoples. The tribe is often so small that intermarriages with alien tribes that speak other dialects or even totally unrelated languages are not uncommon. It may even be doubted whether intermarriage, intertribal trade, and general cultural interchanges are not of greater relative significance on primitive levels than on our own. Whatever the degree or nature of contact between neighboring peoples, it is generally sufficient to lead to some kind of linguistic interinfluencing. Frequently the influence runs heavily in one direction. The language of a people that is looked upon as a center of culture is naturally far more likely to exert an appreciable influence on other languages spoken in its vicinity than to be influenced by them. Chinese has flooded the vocabularies of Corean, Japanese, and Annamite for centuries, but has received nothing in return. In the western Europe of medieval and modern times French has exercised a similar, though probably a less overwhelming, influence. English borrowed an immense number of words from the French of the Norman invaders, later also from the court French of Isle de France, appropriated a certain number of affixed elements of derivational value (e.g., -ess of princess, -ard of drunkard, -ty of royalty), may have been somewhat stimulated in its general analytic drift by contact with French,[164] and even allowed French to modify its phonetic pattern slightly (e.g., initial v and j in words like veal and judge; in words of Anglo-Saxon origin v and j can only occur after vowels, e.g., over, hedge). But English has exerted practically no influence on French.

Languages, like cultures, rarely exist in isolation. The need for communication brings speakers of one language into direct or indirect contact with those who speak neighboring or dominant languages. This interaction can be friendly or hostile. It might happen in the everyday context of business and trade or involve a sharing of cultural elements—like art, science, and religion. It's hard to find a completely isolated language or dialect, especially among primitive peoples. Tribes are often small, so intermarriages with neighboring tribes that speak different dialects or even entirely unrelated languages are common. It could also be argued that intermarriage, intertribal trade, and general cultural exchanges are even more significant at primitive levels than they are in our own society. Regardless of the level or type of contact between neighboring groups, it typically leads to some degree of linguistic influence. Often, this influence flows mainly in one direction. A language viewed as a center of culture is much more likely to have a noticeable impact on nearby languages than to be shaped by them. Chinese has significantly influenced the vocabularies of Korean, Japanese, and Vietnamese for centuries, without reciprocation. In medieval and modern Western Europe, French has had a similar, though probably less overwhelming, influence. English borrowed a vast number of words from French during the Norman invasion and later from the courtly French of Île-de-France, incorporated some derivative elements (like -ess in princess, -ard in drunkard, -ty in royalty), may have been somewhat influenced in its overall analytical development by contact with French,[164] and even allowed some slight modifications to its phonetic pattern due to French (for instance, initial v and j in words like veal and judge; in words of Anglo-Saxon origin, v and j only occur after vowels, like in over and hedge). However, English has had almost no influence on French.

The simplest kind of influence that one language may exert on another is the “borrowing” of words. When there is cultural borrowing there is always the likelihood that the associated words may be borrowed too. When the early Germanic peoples of northern Europe first learned of wine-culture and of paved streets from their commercial or warlike contact with the Romans, it was only natural that they should adopt the Latin words for the strange beverage (vinum, English wine, German Wein) and the unfamiliar type of road (strata [via], English street, German Strasse). Later, when Christianity was introduced into England, a number of associated words, such as bishop and angel, found their way into English. And so the process has continued uninterruptedly down to the present day, each cultural wave bringing to the language a new deposit of loan-words. The careful study of such loan-words constitutes an interesting commentary on the history of culture. One can almost estimate the rôle which various peoples have played in the development and spread of cultural ideas by taking note of the extent to which their vocabularies have filtered into those of other peoples. When we realize that an educated Japanese can hardly frame a single literary sentence without the use of Chinese resources, that to this day Siamese and Burmese and Cambodgian bear the unmistakable imprint of the Sanskrit and Pali that came in with Hindu Buddhism centuries ago, or that whether we argue for or against the teaching of Latin and Greek our argument is sure to be studded with words that have come to us from Rome and Athens, we get some inkling of what early Chinese culture and Buddhism and classical Mediterranean civilization have meant in the world’s history. There are just five languages that have had an overwhelming significance as carriers of culture. They are classical Chinese, Sanskrit, Arabic, Greek, and Latin. In comparison with these even such culturally important languages as Hebrew and French sink into a secondary position. It is a little disappointing to learn that the general cultural influence of English has so far been all but negligible. The English language itself is spreading because the English have colonized immense territories. But there is nothing to show that it is anywhere entering into the lexical heart of other languages as French has colored the English complexion or as Arabic has permeated Persian and Turkish. This fact alone is significant of the power of nationalism, cultural as well as political, during the last century. There are now psychological resistances to borrowing, or rather to new sources of borrowing,[165] that were not greatly alive in the Middle Ages or during the Renaissance.

The simplest type of influence one language can have on another is through “borrowing” words. When cultural borrowing happens, there’s always a chance that related words get borrowed too. When the early Germanic peoples of northern Europe first encountered wine culture and paved streets from their trade or military encounters with the Romans, it was natural for them to adopt the Latin terms for the strange drink (vinum, English wine, German Wein) and the unfamiliar kind of road (strata [via], English street, German Strasse). Later, when Christianity was introduced to England, several related words, like bishop and angel, made their way into English. This process has continued uninterrupted up to today, with each cultural wave bringing new loanwords into the language. Studying these loanwords provides fascinating insights into cultural history. We can almost gauge the role various peoples have played in the development and spread of cultural ideas by observing how much their vocabularies have influenced others. When we see that an educated Japanese person can barely construct a single literary sentence without using Chinese terms, that Siamese, Burmese, and Cambodian still clearly show the influence of Sanskrit and Pali brought by Hindu Buddhism centuries ago, or that our arguments for or against teaching Latin and Greek are sure to include words from Rome and Athens, we start to understand what early Chinese culture, Buddhism, and classical Mediterranean civilization have represented in world history. There are just five languages that have been extraordinarily significant as carriers of culture: classical Chinese, Sanskrit, Arabic, Greek, and Latin. Compared to these, even culturally important languages like Hebrew and French take a back seat. It’s a little disappointing to find out that the general cultural influence of English has so far been almost negligible. The English language itself is spreading because England has colonized vast territories. However, there’s little evidence that it is becoming embedded within the core vocabulary of other languages like French has done to English or Arabic has done to Persian and Turkish. This fact highlights the strength of nationalism, both cultural and political, over the last century. There are now psychological barriers to borrowing, or rather to new sources of borrowing,[165] that weren’t as prominent in the Middle Ages or during the Renaissance.

Are there resistances of a more intimate nature to the borrowing of words? It is generally assumed that the nature and extent of borrowing depend entirely on the historical facts of culture relation; that if German, for instance, has borrowed less copiously than English from Latin and French it is only because Germany has had less intimate relations than England with the culture spheres of classical Rome and France. This is true to a considerable extent, but it is not the whole truth. We must not exaggerate the physical importance of the Norman invasion nor underrate the significance of the fact that Germany’s central geographical position made it peculiarly sensitive to French influences all through the Middle Ages, to humanistic influences in the latter fifteenth and early sixteenth centuries, and again to the powerful French influences of the seventeenth and eighteenth centuries. It seems very probable that the psychological attitude of the borrowing language itself towards linguistic material has much to do with its receptivity to foreign words. English has long been striving for the completely unified, unanalyzed word, regardless of whether it is monosyllabic or polysyllabic. Such words as credible, certitude, intangible are entirely welcome in English because each represents a unitary, well-nuanced idea and because their formal analysis (cred-ible, cert-itude, in-tang-ible) is not a necessary act of the unconscious mind (cred-, cert-, and tang- have no real existence in English comparable to that of good- in goodness). A word like intangible, once it is acclimated, is nearly as simple a psychological entity as any radical monosyllable (say vague, thin, grasp). In German, however, polysyllabic words strive to analyze themselves into significant elements. Hence vast numbers of French and Latin words, borrowed at the height of certain cultural influences, could not maintain themselves in the language. Latin-German words like kredibel “credible” and French-German words like reussieren “to succeed” offered nothing that the unconscious mind could assimilate to its customary method of feeling and handling words. It is as though this unconscious mind said: “I am perfectly willing to accept kredibel if you will just tell me what you mean by kred-.” Hence German has generally found it easier to create new words out of its own resources, as the necessity for them arose.

Are there deeper reasons for resisting the adoption of words? It's typically believed that the nature and amount of borrowing rely solely on historical cultural connections; that if German, for example, has borrowed less from Latin and French than English, it’s only because Germany had less close ties to the cultural worlds of classical Rome and France. This is largely true, but it's not the complete story. We shouldn't overemphasize the physical importance of the Norman invasion, nor downplay the fact that Germany's central location made it particularly sensitive to French influences throughout the Middle Ages, to humanistic influences in the late 1400s and early 1500s, and again to the strong French influences of the 1600s and 1700s. It seems very likely that the borrowing language's own psychological attitude towards linguistic material has a lot to do with its openness to foreign words. English has long sought to create unified, uncomplicated words, whether they're one syllable or more. Words like credible, certitude, intangible are readily accepted in English because they each convey a cohesive, nuanced idea and because breaking them down (cred-ible, cert-itude, in-tang-ible) isn’t necessary for the unconscious mind (the components cred-, cert-, and tang- don't really exist in English like good- does in goodness). A word like intangible, once it’s familiar, becomes nearly as straightforward a psychological entity as any basic monosyllable (like vague, thin, grasp). In contrast, German tends to deconstruct polysyllabic words into meaningful parts. As a result, many French and Latin words borrowed during peak cultural influences couldn’t sustain themselves in the language. Latin-German words like kredibel “credible” and French-German words like reussieren “to succeed” didn’t provide anything that the unconscious mind could relate to its usual way of processing words. It’s as if this unconscious mind said: “I’m totally fine with accepting kredibel if you can just explain to me what kred- means.” Therefore, German has generally found it easier to invent new words from its own resources as the need arose.

The psychological contrast between English and German as regards the treatment of foreign material is a contrast that may be studied in all parts of the world. The Athabaskan languages of America are spoken by peoples that have had astonishingly varied cultural contacts, yet nowhere do we find that an Athabaskan dialect has borrowed at all freely[166] from a neighboring language. These languages have always found it easier to create new words by compounding afresh elements ready to hand. They have for this reason been highly resistant to receiving the linguistic impress of the external cultural experiences of their speakers. Cambodgian and Tibetan offer a highly instructive contrast in their reaction to Sanskrit influence. Both are analytic languages, each totally different from the highly-wrought, inflective language of India. Cambodgian is isolating, but, unlike Chinese, it contains many polysyllabic words whose etymological analysis does not matter. Like English, therefore, in its relation to French and Latin, it welcomed immense numbers of Sanskrit loan-words, many of which are in common use to-day. There was no psychological resistance to them. Classical Tibetan literature was a slavish adaptation of Hindu Buddhist literature and nowhere has Buddhism implanted itself more firmly than in Tibet, yet it is strange how few Sanskrit words have found their way into the language. Tibetan was highly resistant to the polysyllabic words of Sanskrit because they could not automatically fall into significant syllables, as they should have in order to satisfy the Tibetan feeling for form. Tibetan was therefore driven to translating the great majority of these Sanskrit words into native equivalents. The Tibetan craving for form was satisfied, though the literally translated foreign terms must often have done violence to genuine Tibetan idiom. Even the proper names of the Sanskrit originals were carefully translated, element for element, into Tibetan; e.g., Suryagarbha “Sun-bosomed” was carefully Tibetanized into Nyi-mai snying-po “Sun-of heart-the, the heart (or essence) of the sun.” The study of how a language reacts to the presence of foreign words—rejecting them, translating them, or freely accepting them—may throw much valuable light on its innate formal tendencies.

The psychological difference between English and German regarding how they handle foreign material can be seen worldwide. The Athabaskan languages in America are spoken by groups with remarkably diverse cultural interactions, yet we hardly see an Athabaskan dialect borrowing freely from a nearby language. These languages have historically preferred to create new words by combining existing elements. For this reason, they have been quite resistant to adopting the linguistic influences of the external cultural experiences of their speakers. Cambodian and Tibetan provide a striking contrast in their responses to Sanskrit influence. Both are analytic languages, each completely distinct from the complex, inflected language of India. Cambodian is isolating but, unlike Chinese, includes many polysyllabic words whose etymological breakdown isn't significant. Similar to English in its relation to French and Latin, it embraced a large number of Sanskrit loanwords, many of which are commonly used today. There was no psychological pushback against them. Classical Tibetan literature closely mirrored Hindu Buddhist literature, and Buddhism has firmly established itself in Tibet; however, it’s interesting how few Sanskrit words have entered the language. Tibetan was quite resistant to the polysyllabic words of Sanskrit because they did not fit into meaningful syllables necessary to satisfy Tibetan aesthetic sensibilities. As a result, Tibetan often translated most of these Sanskrit words into local equivalents. The Tibetan desire for form was met, even though literally translated foreign terms frequently distorted genuine Tibetan expression. Even the proper names from the Sanskrit originals were meticulously translated, element by element, into Tibetan; for instance, Suryagarbha, meaning “Sun-bosomed,” was carefully adapted into Nyi-mai snying-po, or “Sun-of heart-the, the heart (or essence) of the sun.” Studying how a language responds to foreign words—by rejecting them, translating them, or accepting them freely—can provide valuable insights into its inherent formal tendencies.

The borrowing of foreign words always entails their phonetic modification. There are sure to be foreign sounds or accentual peculiarities that do not fit the native phonetic habits. They are then so changed as to do as little violence as possible to these habits. Frequently we have phonetic compromises. Such an English word as the recently introduced camouflage, as now ordinarily pronounced, corresponds to the typical phonetic usage of neither English nor French. The aspirated k, the obscure vowel of the second syllable, the precise quality of the l and of the last a, and, above all, the strong accent on the first syllable, are all the results of unconscious assimilation to our English habits of pronunciation. They differentiate our camouflage clearly from the same word as pronounced by the French. On the other hand, the long, heavy vowel in the third syllable and the final position of the “zh” sound (like z in azure) are distinctly un-English, just as, in Middle English, the initial j and v[167] must have been felt at first as not strictly in accord with English usage, though the strangeness has worn off by now. In all four of these cases—initial j, initial v, final “zh,” and unaccented a of father—English has not taken on a new sound but has merely extended the use of an old one.

The borrowing of foreign words always involves changing their pronunciation. There are usually foreign sounds or unique stress patterns that don’t align with native pronunciation habits. They are then altered as little as possible to fit these habits. We often see phonetic compromises. For instance, the English word camouflage, as it's commonly pronounced now, doesn’t match the typical phonetic patterns of either English or French. The aspirated k, the unclear vowel in the second syllable, the specific quality of the l and the last a, and especially the strong stress on the first syllable are all results of unconscious adaptation to our English pronunciation habits. These changes clearly set our camouflage apart from the same word as pronounced by the French. Conversely, the long, heavy vowel in the third syllable and the final “zh” sound (like z in azure) are distinctly un-English, similar to how, in Middle English, the initial j and v must have originally felt out of place within English usage, although that strangeness has faded now. In all four cases—initial j, initial v, final “zh,” and unaccented a in father—English hasn’t adopted a new sound but has simply expanded the use of an old one.

Occasionally a new sound is introduced, but it is likely to melt away before long. In Chaucer’s day the old Anglo-Saxon ü (written y) had long become unrounded to i, but a new set of ü-vowels had come in from the French (in such words as due, value, nature). The new ü did not long hold its own; it became diphthongized to iu and was amalgamated with the native iw of words like new and slew. Eventually this diphthong appears as yu, with change of stress—dew (from Anglo-Saxon deaw) like due (Chaucerian dü). Facts like these show how stubbornly a language resists radical tampering with its phonetic pattern.

Sometimes a new sound pops up, but it's likely to fade away pretty quickly. In Chaucer’s time, the old Anglo-Saxon ü (written as y) had already shifted to i, but a new set of ü-vowels had come from French (as seen in words like due, value, nature). The new ü didn’t last long; it turned into a diphthong iu and merged with the native iw from words like new and slew. Eventually, this diphthong became yu, with a shift in stress—like dew (from Anglo-Saxon deaw) similar to due (Chaucerian dü). These facts illustrate how resistant a language is to drastic changes in its phonetic structure.

Nevertheless, we know that languages do influence each other in phonetic respects, and that quite aside from the taking over of foreign sounds with borrowed words. One of the most curious facts that linguistics has to note is the occurrence of striking phonetic parallels in totally unrelated or very remotely related languages of a restricted geographical area. These parallels become especially impressive when they are seen contrastively from a wide phonetic perspective. Here are a few examples. The Germanic languages as a whole have not developed nasalized vowels. Certain Upper German (Suabian) dialects, however, have now nasalized vowels in lieu of the older vowel + nasal consonant (n). Is it only accidental that these dialects are spoken in proximity to French, which makes abundant use of nasalized vowels? Again, there are certain general phonetic features that mark off Dutch and Flemish in contrast, say, to North German and Scandinavian dialects. One of these is the presence of unaspirated voiceless stops (p, t, k), which have a precise, metallic quality reminiscent of the corresponding French sounds, but which contrast with the stronger, aspirated stops of English, North German, and Danish. Even if we assume that the unaspirated stops are more archaic, that they are the unmodified descendants of the old Germanic consonants, is it not perhaps a significant historical fact that the Dutch dialects, neighbors of French, were inhibited from modifying these consonants in accordance with what seems to have been a general Germanic phonetic drift? Even more striking than these instances is the peculiar resemblance, in certain special phonetic respects, of Russian and other Slavic languages to the unrelated Ural-Altaic languages[168] of the Volga region. The peculiar, dull vowel, for instance, known in Russian as “yeri”[169] has Ural-Altaic analogues, but is entirely wanting in Germanic, Greek, Armenian, and Indo-Iranian, the nearest Indo-European congeners of Slavic. We may at least suspect that the Slavic vowel is not historically unconnected with its Ural-Altaic parallels. One of the most puzzling cases of phonetic parallelism is afforded by a large number of American Indian languages spoken west of the Rockies. Even at the most radical estimate there are at least four totally unrelated linguistic stocks represented in the region from southern Alaska to central California. Nevertheless all, or practically all, the languages of this immense area have some important phonetic features in common. Chief of these is the presence of a “glottalized” series of stopped consonants of very distinctive formation and of quite unusual acoustic effect.[170] In the northern part of the area all the languages, whether related or not, also possess various voiceless l-sounds and a series of “velar” (back-guttural) stopped consonants which are etymologically distinct from the ordinary k-series. It is difficult to believe that three such peculiar phonetic features as I have mentioned could have evolved independently in neighboring groups of languages.

However, we know that languages do influence each other in terms of sound, and this happens apart from just adopting foreign sounds through borrowed words. One of the most interesting observations in linguistics is the striking phonetic similarities found in completely unrelated or very distantly related languages in a specific geographical area. These similarities stand out even more when looked at from a broader phonetic perspective. Here are a few examples. Overall, the Germanic languages have not developed nasalized vowels. However, certain Upper German (Swabian) dialects have now adopted nasalized vowels instead of the older vowel + nasal consonant (n). Is it just a coincidence that these dialects are spoken near French, which makes heavy use of nasalized vowels? Additionally, there are certain general phonetic features that differentiate Dutch and Flemish from, let's say, North German and Scandinavian dialects. One of these is the presence of unaspirated voiceless stops (p, t, k), which have a precise, metallic quality similar to corresponding French sounds but contrast with the stronger, aspirated stops of English, North German, and Danish. Even if we assume that the unaspirated stops are more archaic and are the unaltered descendants of the old Germanic consonants, could it be a notable historical fact that the Dutch dialects, neighbors of French, were held back from changing these consonants in line with what seems to have been a general trend in Germanic phonetics? Even more striking than these examples is the surprising similarity, in specific phonetic aspects, of Russian and other Slavic languages to unrelated Ural-Altaic languages[168] from the Volga region. For instance, the unique, flat vowel known in Russian as “yeri”[169] has Ural-Altaic counterparts but is completely absent in Germanic, Greek, Armenian, and Indo-Iranian, which are the closest Indo-European relatives of Slavic. We can at least suspect that the Slavic vowel is historically connected to its Ural-Altaic parallels. One of the most puzzling cases of phonetic similarity comes from a large number of American Indian languages spoken west of the Rockies. Even in the most conservative estimates, there are at least four completely unrelated language families represented in the region from southern Alaska to central California. Nevertheless, nearly all the languages in this vast area share some important phonetic features. The most significant of these is the presence of a “glottalized” series of stopped consonants that have a very distinctive formation and unusual sound effects.[170] In the northern part of the area, all the languages, whether related or not, also have various voiceless l-sounds and a series of “velar” (back-guttural) stopped consonants that are etymologically distinct from the regular k-series. It’s hard to believe that three such unusual phonetic features could have developed independently in neighboring language groups.

How are we to explain these and hundreds of similar phonetic convergences? In particular cases we may really be dealing with archaic similarities due to a genetic relationship that it is beyond our present power to demonstrate. But this interpretation will not get us far. It must be ruled entirely out of court, for instance, in two of the three European examples I have instanced; both nasalized vowels and the Slavic “yeri” are demonstrably of secondary origin in Indo-European. However we envisage the process in detail, we cannot avoid the inference that there is a tendency for speech sounds or certain distinctive manners of articulation to spread over a continuous area in somewhat the same way that elements of culture ray out from a geographical center. We may suppose that individual variations arising at linguistic borderlands—whether by the unconscious suggestive influence of foreign speech habits or by the actual transfer of foreign sounds into the speech of bilingual individuals—have gradually been incorporated into the phonetic drift of a language. So long as its main phonetic concern is the preservation of its sound patterning, not of its sounds as such, there is really no reason why a language may not unconsciously assimilate foreign sounds that have succeeded in worming their way into its gamut of individual variations, provided always that these new variations (or reinforced old variations) are in the direction of the native drift.

How do we explain these and hundreds of similar phonetic similarities? In specific cases, we might genuinely be looking at ancient similarities due to a genetic relationship that we currently can't prove. However, this interpretation isn’t very helpful. It must be completely dismissed, for instance, in two of the three European examples I've mentioned; both nasalized vowels and the Slavic “yeri” are clearly of later origin in Indo-European. However we analyze the process in detail, we can't ignore the conclusion that there's a tendency for speech sounds or certain distinct ways of articulation to spread out over a continuous area, similar to how cultural elements radiate from a geographic center. We can assume that individual variations arising at linguistic borders—whether due to the unconscious suggestive influence of foreign speech patterns or the actual adoption of foreign sounds by bilingual individuals—have gradually been included in the phonetic evolution of a language. As long as its main phonetic concern is to maintain its sound patterns, rather than its sounds themselves, there's really no reason a language couldn't unconsciously absorb foreign sounds that have managed to make their way into its range of individual variations, as long as these new variations (or strengthened old variations) align with the native direction.

A simple illustration will throw light on this conception. Let us suppose that two neighboring and unrelated languages, A and B, each possess voiceless l-sounds (compare Welsh ll). We surmise that this is not an accident. Perhaps comparative study reveals the fact that in language A the voiceless l-sounds correspond to a sibilant series in other related languages, that an old alternation s: sh has been shifted to the new alternation l (voiceless): s.[171] Does it follow that the voiceless l of language B has had the same history? Not in the least. Perhaps B has a strong tendency toward audible breath release at the end of a word, so that the final l, like a final vowel, was originally followed by a marked aspiration. Individuals perhaps tended to anticipate a little the voiceless release and to “unvoice” the latter part of the final l-sound (very much as the l of English words like felt tends to be partly voiceless in anticipation of the voicelessness of the t). Yet this final l with its latent tendency to unvoicing might never have actually developed into a fully voiceless l had not the presence of voiceless l-sounds in A acted as an unconscious stimulus or suggestive push toward a more radical change in the line of B’s own drift. Once the final voiceless l emerged, its alternation in related words with medial voiced l is very likely to have led to its analogical spread. The result would be that both A and B have an important phonetic trait in common. Eventually their phonetic systems, judged as mere assemblages of sounds, might even become completely assimilated to each other, though this is an extreme case hardly ever realized in practice. The highly significant thing about such phonetic interinfluencings is the strong tendency of each language to keep its phonetic pattern intact. So long as the respective alignments of the similar sounds is different, so long as they have differing “values” and “weights” in the unrelated languages, these languages cannot be said to have diverged materially from the line of their inherent drift. In phonetics, as in vocabulary, we must be careful not to exaggerate the importance of interlinguistic influences.

A simple example will clarify this idea. Let’s say there are two neighboring and unrelated languages, A and B, both of which have voiceless l-sounds (think of the Welsh ll). We suspect this isn’t just a coincidence. Perhaps a comparative study shows that in language A, the voiceless l-sounds correspond to a sibilant series in other related languages, indicating an old alternation s: sh that has been replaced by a new alternation l (voiceless): s.[171] Does it mean that the voiceless l in language B has gone through the same historical changes? Not at all. Maybe B has a strong tendency for audible breath release at the end of a word, so the final l, similar to a final vowel, was originally followed by a noticeable aspiration. People might have tended to anticipate the voiceless release and partially "unvoiced" the last part of the final l-sound (very much like how the l in English words like felt tends to be partly voiceless in anticipation of the voiceless t). Still, this final l with its tendency toward unvoicing might not have actually turned into a fully voiceless l if the existence of voiceless l-sounds in A hadn’t subconsciously pushed or encouraged a more significant change in the direction of B’s development. Once the final voiceless l appeared, its variation in related words with a medial voiced l likely contributed to its analogical spread. This would mean that both A and B share an important phonetic trait. In the end, their phonetic systems, viewed merely as collections of sounds, could even end up completely assimilated to one another, though this is an extreme case that rarely happens in reality. The crucial point about such phonetic influences is that each language tends to maintain its phonetic pattern. As long as the alignments of the similar sounds differ, and as long as they hold different “values” and “weights” in the unrelated languages, these languages can’t be said to have diverged significantly from their natural direction. In phonetics, as in vocabulary, we should be cautious not to overstate the significance of influences between languages.

I have already pointed out in passing that English has taken over a certain number of morphological elements from French. English also uses a number of affixes that are derived from Latin and Greek. Some of these foreign elements, like the -ize of materialize or the -able of breakable, are even productive to-day. Such examples as these are hardly true evidences of a morphological influence exerted by one language on another. Setting aside the fact that they belong to the sphere of derivational concepts and do not touch the central morphological problem of the expression of relational ideas, they have added nothing to the structural peculiarities of our language. English was already prepared for the relation of pity to piteous by such a native pair as luck and lucky; material and materialize merely swelled the ranks of a form pattern familiar from such instances as wide and widen. In other words, the morphological influence exerted by foreign languages on English, if it is to be gauged by such examples as I have cited, is hardly different in kind from the mere borrowing of words. The introduction of the suffix -ize made hardly more difference to the essential build of the language than did the mere fact that it incorporated a given number of words. Had English evolved a new future on the model of the synthetic future in French or had it borrowed from Latin and Greek their employment of reduplication as a functional device (Latin tango: tetigi; Greek leipo: leloipa), we should have the right to speak of true morphological influence. But such far-reaching influences are not demonstrable. Within the whole course of the history of the English language we can hardly point to one important morphological change that was not determined by the native drift, though here and there we may surmise that this drift was hastened a little by the suggestive influence of French forms.[172]

I have already briefly mentioned that English has adopted a number of morphological elements from French. English also incorporates several affixes that come from Latin and Greek. Some of these foreign elements, like the -ize in materialize or the -able in breakable, are still productive today. However, these examples do not really provide strong evidence of one language influencing another morphologically. Leaving aside the fact that they relate to derivational concepts and do not address the core morphological issue of expressing relational ideas, they have not contributed anything to the structural characteristics of our language. English was already set up for the relationship of pity to piteous by a native pair like luck and lucky; material and materialize merely added to a familiar form pattern seen in examples like wide and widen. In other words, the morphological influence from foreign languages on English, when assessed by the examples I’ve mentioned, is hardly different in nature from simply borrowing words. The addition of the suffix -ize didn't significantly change the essential structure of the language any more than merely adding a number of words did. If English had developed a new future tense like the synthetic future in French or had adopted the use of reduplication as a functional feature from Latin and Greek (Latin tango: tetigi; Greek leipo: leloipa), we could legitimately talk about true morphological influence. But such significant influences are not evident. Throughout the entire history of the English language, we can hardly identify one major morphological change that was not guided by native development, although we may suspect that this development was slightly accelerated by the influence of French forms.[172]

It is important to realize the continuous, self-contained morphological development of English and the very modest extent to which its fundamental build has been affected by influences from without. The history of the English language has sometimes been represented as though it relapsed into a kind of chaos on the arrival of the Normans, who proceeded to play nine-pins with the Anglo-Saxon tradition. Students are more conservative today. That a far-reaching analytic development may take place without such external foreign influence as English was subjected to is clear from the history of Danish, which has gone even further than English in certain leveling tendencies. English may be conveniently used as an a fortiori test. It was flooded with French loan-words during the later Middle Ages, at a time when its drift toward the analytic type was especially strong. It was therefore changing rapidly both within and on the surface. The wonder, then, is not that it took on a number of external morphological features, mere accretions on its concrete inventory, but that, exposed as it was to remolding influences, it remained so true to its own type and historic drift. The experience gained from the study of the English language is strengthened by all that we know of documented linguistic history. Nowhere do we find any but superficial morphological interinfluencings. We may infer one of several things from this:—That a really serious morphological influence is not, perhaps, impossible, but that its operation is so slow that it has hardly ever had the chance to incorporate itself in the relatively small portion of linguistic history that lies open to inspection; or that there are certain favorable conditions that make for profound morphological disturbances from without, say a peculiar instability of linguistic type or an unusual degree of cultural contact, conditions that do not happen to be realized in our documentary material; or, finally, that we have not the right to assume that a language may easily exert a remolding morphological influence on another.

It’s important to recognize the ongoing, self-contained development of English and how little its core structure has been impacted by outside influences. The history of the English language has sometimes been depicted as if it fell into chaos with the arrival of the Normans, who seemed to disrupt the Anglo-Saxon tradition. Nowadays, students tend to be more conservative. It’s clear from the history of Danish, which has undergone even more significant changes than English in certain leveling aspects, that extensive analytical development can occur without the external foreign influences that English experienced. English can conveniently serve as an a fortiori example. It was inundated with French loanwords during the later Middle Ages, a period when its move toward an analytic form was particularly strong. Therefore, it was changing rapidly both internally and externally. The surprising part isn’t that it adopted some external morphological features, which are just additions to its existing structure, but that despite being subject to reshaping influences, it stayed so true to its own type and historical trajectory. The insights gained from studying the English language are supported by everything we know about documented linguistic history. Nowhere do we find anything more than superficial morphological interactions. From this, we can infer several possibilities: either that a truly significant morphological influence is not impossible, but that its effects are so slow-moving that it has rarely had the opportunity to become established in the relatively small span of linguistic history we can examine; or that there are specific favorable conditions that lead to profound external morphological changes, such as a unique instability of linguistic type or an unusual level of cultural interaction, conditions not found in our existing documentation; or, finally, that we have no right to assume that one language can easily reshape the morphological structure of another.

Meanwhile we are confronted by the baffling fact that important traits of morphology are frequently found distributed among widely differing languages within a large area, so widely differing, indeed, that it is customary to consider them genetically unrelated. Sometimes we may suspect that the resemblance is due to a mere convergence, that a similar morphological feature has grown up independently in unrelated languages. Yet certain morphological distributions are too specific in character to be so lightly dismissed. There must be some historical factor to account for them. Now it should be remembered that the concept of a “linguistic stock” is never definitive[173] in an exclusive sense. We can only say, with reasonable certainty, that such and such languages are descended from a common source, but we cannot say that such and such other languages are not genetically related. All we can do is to say that the evidence for relationship is not cumulative enough to make the inference of common origin absolutely necessary. May it not be, then, that many instances of morphological similarity between divergent languages of a restricted area are merely the last vestiges of a community of type and phonetic substance that the destructive work of diverging drifts has now made unrecognizable? There is probably still enough lexical and morphological resemblance between modern English and Irish to enable us to make out a fairly conclusive case for their genetic relationship on the basis of the present-day descriptive evidence alone. It is true that the case would seem weak in comparison to the case that we can actually make with the help of the historical and the comparative data that we possess. It would not be a bad case nevertheless. In another two or three millennia, however, the points of resemblance are likely to have become so obliterated that English and Irish, in the absence of all but their own descriptive evidence, will have to be set down as “unrelated” languages. They will still have in common certain fundamental morphological features, but it will be difficult to know how to evaluate them. Only in the light of the contrastive perspective afforded by still more divergent languages, such as Basque and Finnish, will these vestigial resemblances receive their true historic value.

Meanwhile, we face the puzzling fact that key morphological traits are often found spread across very different languages within a large area, so distinct that they are usually considered genetically unrelated. Sometimes we might think that the similarity is simply a result of convergence, where a similar morphological feature has developed independently in unrelated languages. Yet certain morphological distributions are too specific to disregard so easily. There must be some historical factor behind them. It's important to remember that the idea of a “linguistic stock” is never strictly definitive[173]. We can only reasonably claim that certain languages come from a common source, but we cannot definitively state that other languages are not genetically related. All we can say is that the evidence for a relationship isn’t strong enough to require the conclusion of a common origin. Could it be that many instances of morphological similarity between divergent languages in a limited area are just the remaining traces of a type and phonetic substance that the effects of divergence have made nearly unrecognizable? There’s probably still enough lexical and morphological similarity between modern English and Irish to make a fairly convincing case for their genetic relationship based solely on current descriptive evidence. While the case may appear weak compared to what we can build using historical and comparative data, it wouldn’t be a bad case. However, in two or three millennia, the similarities will likely be so obscured that English and Irish, without any evidence other than their own descriptions, will be classified as “unrelated” languages. They will still share certain fundamental morphological features, but it will be tough to assess their significance. Only with the contrasting perspective provided by even more divergent languages, like Basque and Finnish, will these residual similarities gain their true historical value.

I cannot but suspect that many of the more significant distributions of morphological similarities are to be explained as just such vestiges. The theory of “borrowing” seems totally inadequate to explain those fundamental features of structure, hidden away in the very core of the linguistic complex, that have been pointed out as common, say, to Semitic and Hamitic, to the various Soudanese languages, to Malayo-Polynesian and Mon-Khmer[174] and Munda,[175] to Athabaskan and Tlingit and Haida. We must not allow ourselves to be frightened away by the timidity of the specialists, who are often notably lacking in the sense of what I have called “contrastive perspective.”

I can’t help but think that a lot of the more significant patterns of morphological similarities can be explained as just leftover traces. The idea of “borrowing” doesn’t really cut it when it comes to explaining those core structural features, hidden deep within the linguistic system, that have been noted as common, for example, to Semitic and Hamitic, to the various Sudanese languages, to Malayo-Polynesian and Mon-Khmer[174] and Munda,[175] to Athabaskan and Tlingit and Haida. We shouldn’t let the fearfulness of the specialists intimidate us, as they often lack what I’ve referred to as a “contrastive perspective.”

Attempts have sometimes been made to explain the distribution of these fundamental structural features by the theory of diffusion. We know that myths, religious ideas, types of social organization, industrial devices, and other features of culture may spread from point to point, gradually making themselves at home in cultures to which they were at one time alien. We also know that words may be diffused no less freely than cultural elements, that sounds also may be “borrowed,” and that even morphological elements may be taken over. We may go further and recognize that certain languages have, in all probability, taken on structural features owing to the suggestive influence of neighboring languages. An examination of such cases,[176] however, almost invariably reveals the significant fact that they are but superficial additions on the morphological kernel of the language. So long as such direct historical testimony as we have gives us no really convincing examples of profound morphological influence by diffusion, we shall do well not to put too much reliance in diffusion theories. On the whole, therefore, we shall ascribe the major concordances and divergences in linguistic form—phonetic pattern and morphology—to the autonomous drift of language, not to the complicating effect of single, diffused features that cluster now this way, now that. Language is probably the most self-contained, the most massively resistant of all social phenomena. It is easier to kill it off than to disintegrate its individual form.

Attempts have sometimes been made to explain the distribution of these fundamental structural features using diffusion theory. We know that myths, religious ideas, types of social organization, industrial tools, and other cultural aspects can spread from one place to another, gradually becoming part of cultures that were once unfamiliar with them. We also know that words can be spread just as easily as cultural elements, that sounds can also be “borrowed,” and that even morphological elements can be adopted. We can go further and recognize that certain languages have likely taken on structural features due to the influence of nearby languages. However, an examination of such cases,[176] almost always reveals the important fact that these are just superficial additions to the core morphology of the language. As long as the historical evidence we have does not provide convincing examples of deep morphological influence through diffusion, we should be cautious about relying too heavily on diffusion theories. Overall, we will attribute the main similarities and differences in linguistic form—phonetic patterns and morphology—to the independent evolution of language, rather than to the confusing effects of single, diffused features that come together in various ways. Language is likely the most self-sufficient and resistant of all social phenomena. It is easier to extinguish it than to break down its individual form.

X

Language, Race and Culture

Language has a setting. The people that speak it belong to a race (or a number of races), that is, to a group which is set off by physical characteristics from other groups. Again, language does not exist apart from culture, that is, from the socially inherited assemblage of practices and beliefs that determines the texture of our lives. Anthropologists have been in the habit of studying man under the three rubrics of race, language, and culture. One of the first things they do with a natural area like Africa or the South Seas is to map it out from this threefold point of view. These maps answer the questions: What and where are the major divisions of the human animal, biologically considered (e.g., Congo Negro, Egyptian White; Australian Black, Polynesian)? What are the most inclusive linguistic groupings, the “linguistic stocks,” and what is the distribution of each (e.g., the Hamitic languages of northern Africa, the Bantu languages of the south; the Malayo-Polynesian languages of Indonesia, Melanesia, Micronesia, and Polynesia)? How do the peoples of the given area divide themselves as cultural beings? what are the outstanding “cultural areas” and what are the dominant ideas in each (e.g., the Mohammedan north of Africa; the primitive hunting, non-agricultural culture of the Bushmen in the south; the culture of the Australian natives, poor in physical respects but richly developed in ceremonialism; the more advanced and highly specialized culture of Polynesia)?

Language has a context. The people who speak it belong to a race (or multiple races), meaning a group defined by physical traits that set them apart from others. Additionally, language is intertwined with culture, which is the collection of practices and beliefs passed down socially that shapes our lives. Anthropologists typically study humans through the lenses of race, language, and culture. One of the first steps they take when examining a natural region like Africa or the South Seas is to map it out from these three perspectives. These maps address questions like: What and where are the major categories of humans, from a biological standpoint (e.g., Congo Black, Egyptian White; Australian Black, Polynesian)? What are the largest linguistic groups, known as “linguistic stocks,” and how are they distributed (e.g., the Hamitic languages of northern Africa, the Bantu languages of the south; the Malayo-Polynesian languages of Indonesia, Melanesia, Micronesia, and Polynesia)? How do the people in this area categorize themselves culturally? What are the key “cultural areas,” and what prominent ideas exist in each (e.g., the Muslim north of Africa; the traditional hunting, non-agricultural culture of the Bushmen in the south; the culture of the Aboriginal Australians, lacking in some physical aspects but rich in ceremonial practices; the more advanced and specialized culture of Polynesia)?

The man in the street does not stop to analyze his position in the general scheme of humanity. He feels that he is the representative of some strongly integrated portion of humanity—now thought of as a “nationality,” now as a “race”—and that everything that pertains to him as a typical representative of this large group somehow belongs together. If he is an Englishman, he feels himself to be a member of the “Anglo-Saxon” race, the “genius” of which race has fashioned the English language and the “Anglo-Saxon” culture of which the language is the expression. Science is colder. It inquires if these three types of classification—racial, linguistic, and cultural—are congruent, if their association is an inherently necessary one or is merely a matter of external history. The answer to the inquiry is not encouraging to “race” sentimentalists. Historians and anthropologists find that races, languages, and cultures are not distributed in parallel fashion, that their areas of distribution intercross in the most bewildering fashion, and that the history of each is apt to follow a distinctive course. Races intermingle in a way that languages do not. On the other hand, languages may spread far beyond their original home, invading the territory of new races and of new culture spheres. A language may even die out in its primary area and live on among peoples violently hostile to the persons of its original speakers. Further, the accidents of history are constantly rearranging the borders of culture areas without necessarily effacing the existing linguistic cleavages. If we can once thoroughly convince ourselves that race, in its only intelligible, that is biological, sense, is supremely indifferent to the history of languages and cultures, that these are no more directly explainable on the score of race than on that of the laws of physics and chemistry, we shall have gained a viewpoint that allows a certain interest to such mystic slogans as Slavophilism, Anglo-Saxondom, Teutonism, and the Latin genius but that quite refuses to be taken in by any of them. A careful study of linguistic distributions and of the history of such distributions is one of the driest of commentaries on these sentimental creeds.

The average person on the street doesn’t pause to consider his role in the larger picture of humanity. He senses that he represents a solidly connected part of humanity—sometimes thought of as a “nationality,” other times as a “race”—and that everything related to him as a typical representative of this large group somehow fits together. If he’s English, he identifies as a member of the “Anglo-Saxon” race, whose “genius” has created the English language and the “Anglo-Saxon” culture that the language expresses. Science takes a more detached approach. It questions whether these three types of classification—racial, linguistic, and cultural—are aligned, whether their connection is fundamentally necessary, or simply a matter of historical happenstance. The findings aren't encouraging for those who romanticize “race.” Historians and anthropologists discover that races, languages, and cultures do not align neatly; their distribution overlaps in complex ways, and each follows its own unique historical path. Races mix in ways that languages do not. Conversely, languages can spread well beyond their original locations, penetrating the territories of new races and cultural spheres. A language might even vanish in its native area and survive among groups that are hostile to the original speakers. Moreover, the twists of history continually reshape cultural borders without necessarily erasing existing language divides. If we can fully convince ourselves that race, in its only understandable biological sense, is completely indifferent to the history of languages and cultures, and that these are no more directly explained by race than by the laws of physics and chemistry, we will have achieved a perspective that allows for some curiosity about mystical concepts like Slavophilism, Anglo-Saxondom, Teutonism, and the Latin genius, but ultimately dismisses them. A detailed examination of language distributions and their histories offers one of the most mundane critiques of these sentimental beliefs.

That a group of languages need not in the least correspond to a racial group or a culture area is easily demonstrated. We may even show how a single language intercrosses with race and culture lines. The English language is not spoken by a unified race. In the United States there are several millions of negroes who know no other language. It is their mother-tongue, the formal vesture of their inmost thoughts and sentiments. It is as much their property, as inalienably “theirs,” as the King of England’s. Nor do the English-speaking whites of America constitute a definite race except by way of contrast to the negroes. Of the three fundamental white races in Europe generally recognized by physical anthropologists—the Baltic or North European, the Alpine, and the Mediterranean—each has numerous English-speaking representatives in America. But does not the historical core of English-speaking peoples, those relatively “unmixed” populations that still reside in England and its colonies, represent a race, pure and single? I cannot see that the evidence points that way. The English people are an amalgam of many distinct strains. Besides the old “Anglo-Saxon,” in other words North German, element which is conventionally represented as the basic strain, the English blood comprises Norman French,[177] Scandinavian, “Celtic,”[178] and pre-Celtic elements. If by “English” we mean also Scotch and Irish,[179] then the term “Celtic” is loosely used for at least two quite distinct racial elements—the short, dark-complexioned type of Wales and the taller, lighter, often ruddy-haired type of the Highlands and parts of Ireland. Even if we confine ourselves to the Saxon element, which, needless to say, nowhere appears “pure,” we are not at the end of our troubles. We may roughly identify this strain with the racial type now predominant in southern Denmark and adjoining parts of northern Germany. If so, we must content ourselves with the reflection that while the English language is historically most closely affiliated with Frisian, in second degree with the other West Germanic dialects (Low Saxon or “Plattdeutsch,” Dutch, High German), only in third degree with Scandinavian, the specific “Saxon” racial type that overran England in the fifth and sixth centuries was largely the same as that now represented by the Danes, who speak a Scandinavian language, while the High German-speaking population of central and southern Germany[180] is markedly distinct.

A group of languages doesn’t have to match a racial group or a cultural area at all. We can even show how a single language crosses over race and culture lines. The English language isn’t spoken by a single racial group. In the United States, there are several million Black people who speak no other language. It is their mother tongue, the formal expression of their deepest thoughts and feelings. It belongs to them just as much, and in the same unchangeable way, as it belongs to the King of England. The English-speaking white population in America doesn’t form a definite race except in contrast to Black people. Among the three main white racial groups in Europe recognized by physical anthropologists—the Baltic or North European, the Alpine, and the Mediterranean—each has many English-speaking representatives in America. But doesn’t the historical core of English-speaking people, those relatively “pure” populations still living in England and its colonies, represent a pure and single race? I don’t see any evidence supporting that idea. The English people are a mix of many distinct backgrounds. Besides the traditional “Anglo-Saxon,” or North German element that’s typically viewed as the core strain, English heritage includes Norman French, Scandinavian, “Celtic,” and pre-Celtic elements. If we define “English” to also include Scottish and Irish, then the term “Celtic” is used vaguely to refer to at least two quite different racial groups—the shorter, darker-skinned type from Wales and the taller, lighter, often red-haired type from the Highlands and parts of Ireland. Even if we focus solely on the Saxon element, which, of course, is never “pure,” we still aren't done. We can roughly associate this strain with the racial type now mainly found in southern Denmark and nearby areas of northern Germany. If that’s the case, we must accept that while the English language shares its historical roots mostly with Frisian, secondarily with other West Germanic dialects (Low Saxon or “Plattdeutsch,” Dutch, High German), and only thirdly with Scandinavian, the specific “Saxon” racial type that took over England in the fifth and sixth centuries was largely the same as what you see now in the Danes, who speak a Scandinavian language, while the High German-speaking population in central and southern Germany is noticeably distinct.

But what if we ignore these finer distinctions and simply assume that the “Teutonic” or Baltic or North European racial type coincided in its distribution with that of the Germanic languages? Are we not on safe ground then? No, we are now in hotter water than ever. First of all, the mass of the German-speaking population (central and southern Germany, German Switzerland, German Austria) do not belong to the tall, blond-haired, long-headed[181] “Teutonic” race at all, but to the shorter, darker-complexioned, short-headed[182] Alpine race, of which the central population of France, the French Swiss, and many of the western and northern Slavs (e.g., Bohemians and Poles) are equally good representatives. The distribution of these “Alpine” populations corresponds in part to that of the old continental “Celts,” whose language has everywhere given way to Italic, Germanic, and Slavic pressure. We shall do well to avoid speaking of a “Celtic race,” but if we were driven to give the term a content, it would probably be more appropriate to apply it to, roughly, the western portion of the Alpine peoples than to the two island types that I referred to before. These latter were certainly “Celticized,” in speech and, partly, in blood, precisely as, centuries later, most of England and part of Scotland was “Teutonized” by the Angles and Saxons. Linguistically speaking, the “Celts” of to-day (Irish Gaelic, Manx, Scotch Gaelic, Welsh, Breton) are Celtic and most of the Germans of to-day are Germanic precisely as the American Negro, Americanized Jew, Minnesota Swede, and German-American are “English.” But, secondly, the Baltic race was, and is, by no means an exclusively Germanic-speaking people. The northernmost “Celts,” such as the Highland Scotch, are in all probability a specialized offshoot of this race. What these people spoke before they were Celticized nobody knows, but there is nothing whatever to indicate that they spoke a Germanic language. Their language may quite well have been as remote from any known Indo-European idiom as are Basque and Turkish to-day. Again, to the east of the Scandinavians are non-Germanic members of the race—the Finns and related peoples, speaking languages that are not definitely known to be related to Indo-European at all.

But what if we overlook these finer distinctions and just assume that the “Teutonic” or Baltic or North European racial type aligns with the distribution of the Germanic languages? Are we on solid ground then? No, we’re actually in deeper trouble than before. First of all, the majority of the German-speaking population (central and southern Germany, German Switzerland, German Austria) don’t belong to the tall, blond-haired, long-headed[181] “Teutonic” race at all; they belong to the shorter, darker-skinned, short-headed[182] Alpine race, which also includes the central population of France, the French Swiss, and many of the western and northern Slavs (like Bohemians and Poles). The distribution of these “Alpine” populations partially corresponds to that of the old continental “Celts,” whose language has been replaced everywhere by Italic, Germanic, and Slavic influences. We should avoid talking about a “Celtic race,” but if we had to define the term, it would probably make more sense to apply it to, roughly, the western part of the Alpine people rather than the two island types I mentioned earlier. These island types were certainly “Celticized” in language and, to some extent, in blood, just as most of England and part of Scotland were “Teutonized” by the Angles and Saxons centuries later. Linguistically speaking, today’s “Celts” (Irish Gaelic, Manx, Scottish Gaelic, Welsh, Breton) are Celtic, and most Germans today are Germanic just as the American Negro, Americanized Jew, Minnesota Swede, and German-American are “English.” But, second, the Baltic race was, and still is, not exclusively a Germanic-speaking people. The northernmost “Celts,” like the Highland Scots, are likely a specialized offshoot of this race. Nobody knows what these people spoke before they were Celticized, but there’s nothing to suggest they spoke a Germanic language. Their language may have been as unrelated to any known Indo-European language as Basque and Turkish are today. Additionally, to the east of the Scandinavians are non-Germanic members of this race—the Finns and related peoples—speaking languages that are not clearly linked to Indo-European at all.

We cannot stop here. The geographical position of the Germanic languages is such[183] as to make it highly probable that they represent but an outlying transfer of an Indo-European dialect (possibly a Celto-Italic prototype) to a Baltic people speaking a language or a group of languages that was alien to Indo-European.[184] Not only, then, is English not spoken by a unified race at present but its prototype, more likely than not, was originally a foreign language to the race with which English is more particularly associated. We need not seriously entertain the idea that English or the group of languages to which it belongs is in any intelligible sense the expression of race, that there are embedded in it qualities that reflect the temperament or “genius” of a particular breed of human beings.

We can’t stop here. The geographical position of the Germanic languages is such[183] that it’s very likely they are just an outlier transfer of an Indo-European dialect (possibly a Celto-Italic prototype) to a Baltic people who spoke a language or a group of languages that was different from Indo-European.[184] So, not only is English not spoken by a unified race today, but its original form was probably a foreign language to the people most associated with it. We don’t need to seriously consider the idea that English or the group of languages it belongs to is in any clear way an expression of race, or that it holds qualities that reflect the temperament or “genius” of a specific group of people.

Many other, and more striking, examples of the lack of correspondence between race and language could be given if space permitted. One instance will do for many. The Malayo-Polynesian languages form a well-defined group that takes in the southern end of the Malay Peninsula and the tremendous island world to the south and east (except Australia and the greater part of New Guinea). In this vast region we find represented no less than three distinct races—the Negro-like Papuans of New Guinea and Melanesia, the Malay race of Indonesia, and the Polynesians of the outer islands. The Polynesians and Malays all speak languages of the Malayo-Polynesian group, while the languages of the Papuans belong partly to this group (Melanesian), partly to the unrelated languages (“Papuan”) of New Guinea.[185] In spite of the fact that the greatest race cleavage in this region lies between the Papuans and the Polynesians, the major linguistic division is of Malayan on the one side, Melanesian and Polynesian on the other.

Many other, and more notable, examples of the disconnect between race and language could be provided if there was enough space. One example will suffice for many. The Malayo-Polynesian languages make up a distinct group that includes the southern part of the Malay Peninsula and the vast island region to the south and east (excluding Australia and most of New Guinea). In this extensive area, we see three distinct races represented—the Negro-like Papuans of New Guinea and Melanesia, the Malay race of Indonesia, and the Polynesians from the outer islands. Both the Polynesians and Malays speak languages from the Malayo-Polynesian group, while the languages of the Papuans partially belong to this group (Melanesian) and partly to the unrelated languages (“Papuan”) of New Guinea.[185] Even though the biggest racial divide in this region is between the Papuans and the Polynesians, the main linguistic division is Malayan on one side and Melanesian and Polynesian on the other.

As with race, so with culture. Particularly in more primitive levels, where the secondarily unifying power of the “national”[186] ideal does not arise to disturb the flow of what we might call natural distributions, is it easy to show that language and culture are not intrinsically associated. Totally unrelated languages share in one culture, closely related languages—even a single language—belong to distinct culture spheres. There are many excellent examples in aboriginal America. The Athabaskan languages form as clearly unified, as structurally specialized, a group as any that I know of.[187] The speakers of these languages belong to four distinct culture areas—the simple hunting culture of western Canada and the interior of Alaska (Loucheux, Chipewyan), the buffalo culture of the Plains (Sarcee), the highly ritualized culture of the southwest (Navaho), and the peculiarly specialized culture of northwestern California (Hupa). The cultural adaptability of the Athabaskan-speaking peoples is in the strangest contrast to the inaccessibility to foreign influences of the languages themselves.[188] The Hupa Indians are very typical of the culture area to which they belong. Culturally identical with them are the neighboring Yurok and Karok. There is the liveliest intertribal intercourse between the Hupa, Yurok, and Karok, so much so that all three generally attend an important religious ceremony given by any one of them. It is difficult to say what elements in their combined culture belong in origin to this tribe or that, so much at one are they in communal action, feeling, and thought. But their languages are not merely alien to each other; they belong to three of the major American linguistic groups, each with an immense distribution on the northern continent. Hupa, as we have seen, is Athabaskan and, as such, is also distantly related to Haida (Queen Charlotte Islands) and Tlingit (southern Alaska); Yurok is one of the two isolated Californian languages of the Algonkin stock, the center of gravity of which lies in the region of the Great Lakes; Karok is the northernmost member of the Hokan group, which stretches far to the south beyond the confines of California and has remoter relatives along the Gulf of Mexico.

Just like with race, the same goes for culture. Especially in more primitive societies, where the secondary unifying force of the “national” [186] ideal doesn’t disrupt the flow of what we can call natural distributions, it’s easy to demonstrate that language and culture aren’t inherently linked. Completely unrelated languages can exist within one culture, while closely related languages—even a single language—can belong to different cultural spheres. There are many great examples in indigenous America. The Athabaskan languages form a clearly unified and structurally specialized group like any I know of.[187] The speakers of these languages belong to four distinct cultural areas: the simple hunting culture of western Canada and interior Alaska (Loucheux, Chipewyan), the buffalo culture of the Plains (Sarcee), the highly ritualized culture of the Southwest (Navaho), and the uniquely specialized culture of northwestern California (Hupa). The cultural adaptability of the Athabaskan-speaking peoples sharply contrasts with how inaccessible their languages are to foreign influences.[188] The Hupa Indians are very representative of the cultural area they belong to. They share cultural similarities with the neighboring Yurok and Karok tribes. There is vibrant intertribal interaction among the Hupa, Yurok, and Karok, to the extent that all three typically participate in important religious ceremonies hosted by any one of them. It’s hard to pinpoint which cultural elements originated from this tribe or that because they are so unified in communal actions, feelings, and thoughts. However, their languages are not just different from each other; they belong to three of the major American linguistic families, each with a vast distribution across the northern continent. Hupa, as we’ve noted, is Athabaskan and is also distantly related to Haida (Queen Charlotte Islands) and Tlingit (southern Alaska); Yurok is one of the two isolated Californian languages from the Algonkin family, which is primarily centered around the Great Lakes; Karok is the northernmost member of the Hokan group, which extends south beyond California and has distant relatives along the Gulf of Mexico.

Returning to English, most of us would readily admit, I believe, that the community of language between Great Britain and the United States is far from arguing a like community of culture. It is customary to say that they possess a common “Anglo-Saxon” cultural heritage, but are not many significant differences in life and feeling obscured by the tendency of the “cultured” to take this common heritage too much for granted? In so far as America is still specifically “English,” it is only colonially or vestigially so; its prevailing cultural drift is partly towards autonomous and distinctive developments, partly towards immersion in the larger European culture of which that of England is only a particular facet. We cannot deny that the possession of a common language is still and will long continue to be a smoother of the way to a mutual cultural understanding between England and America, but it is very clear that other factors, some of them rapidly cumulative, are working powerfully to counteract this leveling influence. A common language cannot indefinitely set the seal on a common culture when the geographical, political, and economic determinants of the culture are no longer the same throughout its area.

Returning to English, most of us would agree that the language connection between Great Britain and the United States doesn't imply a similar cultural connection. It's common to say they share a common “Anglo-Saxon” cultural heritage, but aren't there many important differences in life and emotions that are overlooked because the “cultured” tend to take this shared heritage for granted? As far as America is still notably “English,” it's only in a colonial or superficial way; its main cultural direction is partly toward unique and independent developments and partly toward immersing itself in the broader European culture, of which England is just one aspect. We can't deny that having a common language is still and will continue to be a way to facilitate mutual cultural understanding between England and America, but it's quite obvious that other factors, some of which are quickly accumulating, are strongly working against this unifying effect. A common language can't indefinitely define a shared culture when the geographical, political, and economic aspects of that culture are no longer the same across its territory.

Language, race, and culture are not necessarily correlated. This does not mean that they never are. There is some tendency, as a matter of fact, for racial and cultural lines of cleavage to correspond to linguistic ones, though in any given case the latter may not be of the same degree of importance as the others. Thus, there is a fairly definite line of cleavage between the Polynesian languages, race, and culture on the one hand and those of the Melanesians on the other, in spite of a considerable amount of overlapping.[189] The racial and cultural division, however, particularly the former, are of major importance, while the linguistic division is of quite minor significance, the Polynesian languages constituting hardly more than a special dialectic subdivision of the combined Melanesian-Polynesian group. Still clearer-cut coincidences of cleavage may be found. The language, race, and culture of the Eskimo are markedly distinct from those of their neighbors;[190] in southern Africa the language, race, and culture of the Bushmen offer an even stronger contrast to those of their Bantu neighbors. Coincidences of this sort are of the greatest significance, of course, but this significance is not one of inherent psychological relation between the three factors of race, language, and culture. The coincidences of cleavage point merely to a readily intelligible historical association. If the Bantu and Bushmen are so sharply differentiated in all respects, the reason is simply that the former are relatively recent arrivals in southern Africa. The two peoples developed in complete isolation from each other; their present propinquity is too recent for the slow process of cultural and racial assimilation to have set in very powerfully. As we go back in time, we shall have to assume that relatively scanty populations occupied large territories for untold generations and that contact with other masses of population was not as insistent and prolonged as it later became. The geographical and historical isolation that brought about race differentiations was naturally favorable also to far-reaching variations in language and culture. The very fact that races and cultures which are brought into historical contact tend to assimilate in the long run, while neighboring languages assimilate each other only casually and in superficial respects[191], indicates that there is no profound causal relation between the development of language and the specific development of race and of culture.

Language, race, and culture aren’t always connected. That doesn't mean they're never linked. In fact, there’s a tendency for racial and cultural differences to line up with linguistic ones, although in any specific situation, the importance of language may not be as significant as the others. For example, there’s a clear distinction between Polynesian languages, race, and culture on one side and those of the Melanesians on the other, even though there is some overlap. The racial and cultural differences, especially the former, are very important, while the linguistic differences are much less significant, with Polynesian languages being barely more than a unique dialect within the broader Melanesian-Polynesian group. Even clearer examples can be found. The language, race, and culture of the Eskimos are distinctly different from those of their neighbors; in southern Africa, the language, race, and culture of the Bushmen contrast even more sharply with those of the Bantu people. These kinds of connections are very important, but this importance doesn't imply a deep psychological link between race, language, and culture. The overlapping differences simply indicate a straightforward historical connection. The reason the Bantu and Bushmen are so clearly different in every way is that the Bantu are relatively new arrivals in southern Africa. The two groups developed completely separately; their current proximity is too recent for significant cultural and racial mixing to have occurred. Looking back in time, we can assume that relatively small populations inhabited large areas for many generations, and that contact with other population groups wasn’t as frequent or long-lasting as it became later. The geographical and historical separation that caused racial differences also allowed for extensive variations in language and culture. The fact that races and cultures that come into historical contact tend to assimilate over time, while neighboring languages tend to influence each other only slightly and superficially, shows that there’s no deep causal relationship between the development of language and the distinct development of race and culture.

But surely, the wary reader will object, there must be some relation between language and culture, and between language and at least that intangible aspect of race that we call “temperament”. Is it not inconceivable that the particular collective qualities of mind that have fashioned a culture are not precisely the same as were responsible for the growth of a particular linguistic morphology? This question takes us into the heart of the most difficult problems of social psychology. It is doubtful if any one has yet attained to sufficient clarity on the nature of the historical process and on the ultimate psychological factors involved in linguistic and cultural drifts to answer it intelligently. I can only very briefly set forth my own views, or rather my general attitude. It would be very difficult to prove that “temperament”, the general emotional disposition of a people[192], is basically responsible for the slant and drift of a culture, however much it may manifest itself in an individual’s handling of the elements of that culture. But granted that temperament has a certain value for the shaping of culture, difficult though it be to say just how, it does not follow that it has the same value for the shaping of language. It is impossible to show that the form of a language has the slightest connection with national temperament. Its line of variation, its drift, runs inexorably in the channel ordained for it by its historic antecedents; it is as regardless of the feelings and sentiments of its speakers as is the course of a river of the atmospheric humors of the landscape. I am convinced that it is futile to look in linguistic structure for differences corresponding to the temperamental variations which are supposed to be correlated with race. In this connection it is well to remember that the emotional aspect of our psychic life is but meagerly expressed in the build of language[193].

But surely, a cautious reader might argue, there has to be some connection between language and culture, as well as language and that elusive aspect of race we call “temperament.” Is it really hard to believe that the unique collective traits that shaped a culture are exactly the same ones that influenced the development of a specific linguistic structure? This question leads us into the core of the most challenging issues in social psychology. It’s unlikely that anyone has yet gained enough clarity on the nature of the historical process and the ultimate psychological factors involved in linguistic and cultural changes to answer it thoughtfully. I can only briefly share my own perspective, or rather my general outlook. It would be very difficult to demonstrate that “temperament,” the overall emotional tendency of a people[192], is fundamentally responsible for the direction and evolution of a culture, no matter how much it may show up in an individual’s engagement with that culture. However, even if we accept that temperament holds some significance in shaping culture, though it's hard to define how, it doesn't mean it carries the same weight in shaping language. It’s impossible to prove that the structure of a language has any connection to national temperament. Its variations and evolution follow an unyielding path determined by its historical background; they are as indifferent to the feelings and emotions of its speakers as a river is to the weather conditions of the surrounding landscape. I believe it’s pointless to search for differences in linguistic structure that correspond to the supposed temperamental variations associated with race. In this context, it’s important to remember that the emotional aspects of our psychological life are only minimally reflected in the structure of language[193].

Language and our thought-grooves are inextricably interwoven, are, in a sense, one and the same. As there is nothing to show that there are significant racial differences in the fundamental conformation of thought, it follows that the infinite variability of linguistic form, another name for the infinite variability of the actual process of thought, cannot be an index of such significant racial differences. This is only apparently a paradox. The latent content of all languages is the same—the intuitive science of experience. It is the manifest form that is never twice the same, for this form, which we call linguistic morphology, is nothing more nor less than a collective art of thought, an art denuded of the irrelevancies of individual sentiment. At last analysis, then, language can no more flow from race as such than can the sonnet form.

Language and our ways of thinking are tightly connected; they’re basically the same thing. Since there's no evidence of significant racial differences in how we fundamentally think, it follows that the endless variety of language forms—which is just another way of describing the endless variety of actual thought processes—can’t indicate any significant racial differences. This might seem like a paradox, but it's not. The underlying meaning of all languages is the same—the intuitive science of experience. The surface forms, however, are never identical, because these forms, which we call linguistic morphology, are really just a collective art of thought, stripped of the distractions of personal feelings. Ultimately, language cannot arise from race any more than the sonnet form can.

Nor can I believe that culture and language are in any true sense causally related. Culture may be defined as what a society does and thinks. Language is a particular how of thought. It is difficult to see what particular causal relations may be expected to subsist between a selected inventory of experience (culture, a significant selection made by society) and the particular manner in which the society expresses all experience. The drift of culture, another way of saying history, is a complex series of changes in society’s selected inventory—additions, losses, changes of emphasis and relation. The drift of language is not properly concerned with changes of content at all, merely with changes in formal expression. It is possible, in thought, to change every sound, word, and concrete concept of a language without changing its inner actuality in the least, just as one can pour into a fixed mold water or plaster or molten gold. If it can be shown that culture has an innate form, a series of contours, quite apart from subject-matter of any description whatsoever, we have a something in culture that may serve as a term of comparison with and possibly a means of relating it to language. But until such purely formal patterns of culture are discovered and laid bare, we shall do well to hold the drifts of language and of culture to be non-comparable and unrelated processes. From this it follows that all attempts to connect particular types of linguistic morphology with certain correlated stages of cultural development are vain. Rightly understood, such correlations are rubbish. The merest coup d’oeil verifies our theoretical argument on this point. Both simple and complex types of language of an indefinite number of varieties may be found spoken at any desired level of cultural advance. When it comes to linguistic form, Plato walks with the Macedonian swineherd, Confucius with the head-hunting savage of Assam.

I also can’t accept that culture and language are really connected in a causal way. Culture can be defined as what a society thinks and does. Language is a particular how of thinking. It's hard to understand what kind of causal relationships might exist between a curated collection of experiences (culture, a meaningful selection made by society) and how that society expresses all experiences. The evolution of culture, another way to say history, involves a complicated series of changes in the society’s chosen collection—additions, losses, shifts in emphasis, and changes in relationships. The evolution of language doesn’t really deal with changes in content at all, but rather with changes in formal expression. It’s possible, in thought, to alter every sound, word, and concrete concept of a language without modifying its core essence at all, just as you can pour water or plaster or molten gold into a fixed mold. If it can be shown that culture has an inherent form, a series of patterns completely separate from any subject matter, then we have something in culture that could serve as a reference point with and perhaps a way to relate it to language. But until such purely formal structures of culture are discovered and revealed, we would do well to consider the developments of language and culture as non-comparable and unrelated processes. This means that any attempts to link specific types of linguistic structure with certain corresponding stages of cultural development are futile. If understood correctly, such connections are nonsense. A mere coup d’oeil confirms our theoretical argument on this matter. Both simple and complex types of language of various kinds can be found spoken at any level of cultural advancement. When it comes to linguistic form, Plato walks alongside the Macedonian swineherd, and Confucius with the head-hunting savage of Assam.

It goes without saying that the mere content of language is intimately related to culture. A society that has no knowledge of theosophy need have no name for it; aborigines that had never seen or heard of a horse were compelled to invent or borrow a word for the animal when they made his acquaintance. In the sense that the vocabulary of a language more or less faithfully reflects the culture whose purposes it serves it is perfectly true that the history of language and the history of culture move along parallel lines. But this superficial and extraneous kind of parallelism is of no real interest to the linguist except in so far as the growth or borrowing of new words incidentally throws light on the formal trends of the language. The linguistic student should never make the mistake of identifying a language with its dictionary.

It goes without saying that the content of language is closely tied to culture. A society that has no knowledge of theosophy doesn’t need a name for it; indigenous people who have never seen or heard of a horse had to create or borrow a word for the animal when they encountered it. In the sense that a language's vocabulary generally reflects the culture it serves, it is true that the history of language and the history of culture run along similar paths. However, this superficial and external kind of similarity doesn't really interest linguists except to the extent that the growth or borrowing of new words sheds light on the formal trends of the language. A linguistic student should never mistakenly equate a language with its dictionary.

If both this and the preceding chapter have been largely negative in their contentions, I believe that they have been healthily so. There is perhaps no better way to learn the essential nature of speech than to realize what it is not and what it does not do. Its superficial connections with other historic processes are so close that it needs to be shaken free of them if we are to see it in its own right. Everything that we have so far seen to be true of language points to the fact that it is the most significant and colossal work that the human spirit has evolved—nothing short of a finished form of expression for all communicable experience. This form may be endlessly varied by the individual without thereby losing its distinctive contours; and it is constantly reshaping itself as is all art. Language is the most massive and inclusive art we know, a mountainous and anonymous work of unconscious generations.

If both this chapter and the one before it have largely focused on negatives, I think that's actually a good thing. There’s probably no better way to understand the true nature of speech than to recognize what it isn’t and what it doesn’t do. Its superficial links to other historical processes are so strong that we need to detach it from them to appreciate it on its own. Everything we've seen so far about language shows that it’s the most significant and monumental creation of the human spirit—nothing less than a complete form of expression for all experiences we can share. This form can be endlessly personalized without losing its unique shape; and it constantly evolves, just like all art does. Language is the most vast and all-encompassing art we know, a colossal and collective creation of countless generations.

XI

Language and Literature

Languages are more to us than systems of thought-transference. They are invisible garments that drape themselves about our spirit and give a predetermined form to all its symbolic expression. When the expression is of unusual significance, we call it literature.[194] Art is so personal an expression that we do not like to feel that it is bound to predetermined form of any sort. The possibilities of individual expression are infinite, language in particular is the most fluid of mediums. Yet some limitation there must be to this freedom, some resistance of the medium. In great art there is the illusion of absolute freedom. The formal restraints imposed by the material—paint, black and white, marble, piano tones, or whatever it may be—are not perceived; it is as though there were a limitless margin of elbow-room between the artist’s fullest utilization of form and the most that the material is innately capable of. The artist has intuitively surrendered to the inescapable tyranny of the material, made its brute nature fuse easily with his conception.[195] The material “disappears” precisely because there is nothing in the artist’s conception to indicate that any other material exists. For the time being, he, and we with him, move in the artistic medium as a fish moves in the water, oblivious of the existence of an alien atmosphere. No sooner, however, does the artist transgress the law of his medium than we realize with a start that there is a medium to obey.

Languages are more to us than just ways to share thoughts. They're like invisible clothes that envelop our spirit and shape all its symbolic expressions. When these expressions hold special meaning, we call it literature.[194] Art is such a personal expression that we don't want to think it's restricted by any set form. The possibilities for individual expression are endless; language, in particular, is the most adaptable medium. However, some limitations must exist within this freedom, some resistance from the medium itself. In great art, there's the illusion of total freedom. We don’t notice the formal constraints that the material imposes—be it paint, black and white, marble, piano tones, or anything else—it feels like there's limitless space between the artist's full use of form and the maximum that the material can do. The artist has intuitively accepted the unavoidable dominance of the material, allowing its raw nature to blend seamlessly with their vision.[195] The material “disappears” precisely because the artist’s vision doesn’t suggest that any other material could exist. For the moment, he, and we along with him, navigate the artistic medium like a fish in water, unaware of an outside atmosphere. Yet, as soon as the artist breaks the rules of their medium, we suddenly realize that there are rules to follow.

Language is the medium of literature as marble or bronze or clay are the materials of the sculptor. Since every language has its distinctive peculiarities, the innate formal limitations—and possibilities—of one literature are never quite the same as those of another. The literature fashioned out of the form and substance of a language has the color and the texture of its matrix. The literary artist may never be conscious of just how he is hindered or helped or otherwise guided by the matrix, but when it is a question of translating his work into another language, the nature of the original matrix manifests itself at once. All his effects have been calculated, or intuitively felt, with reference to the formal “genius” of his own language; they cannot be carried over without loss or modification. Croce[196] is therefore perfectly right in saying that a work of literary art can never be translated. Nevertheless literature does get itself translated, sometimes with astonishing adequacy. This brings up the question whether in the art of literature there are not intertwined two distinct kinds or levels of art—a generalized, non-linguistic art, which can be transferred without loss into an alien linguistic medium, and a specifically linguistic art that is not transferable.[197] I believe the distinction is entirely valid, though we never get the two levels pure in practice. Literature moves in language as a medium, but that medium comprises two layers, the latent content of language—our intuitive record of experience—and the particular conformation of a given language—the specific how of our record of experience. Literature that draws its sustenance mainly—never entirely—from the lower level, say a play of Shakespeare’s, is translatable without too great a loss of character. If it moves in the upper rather than in the lower level—a fair example is a lyric of Swinburne’s—it is as good as untranslatable. Both types of literary expression may be great or mediocre.

Language is the medium of literature just like marble, bronze, or clay are the materials for a sculptor. Since every language has its own unique features, the inherent formal limitations—and possibilities—of one type of literature are never quite the same as those of another. The literature created from the form and substance of a language has the color and texture of its base. The literary artist may not always be aware of how the language shapes or influences their work, but when it comes to translating their work into another language, the characteristics of the original language become immediately clear. All their effects have been calculated or felt intuitively with reference to the inherent "genius" of their own language; they cannot be transferred without some loss or alteration. Croce[196] is completely correct in saying that a work of literary art can never be fully translated. However, literature can be translated, sometimes quite effectively. This raises the question of whether the art of literature consists of two intertwined types or levels of art—a general, non-linguistic art that can be transferred without loss into a different language, and a specifically linguistic art that cannot be transferred.[197] I think this distinction is entirely valid, although in practice we rarely encounter the two levels in their pure forms. Literature operates within language as its medium, but that medium consists of two layers: the underlying content of language—our intuitive record of experience—and the specific structure of a given language—the particular way we record our experiences. Literature that primarily draws its strength—though never completely—from the lower level, like a play by Shakespeare, can be translated with minimal loss of character. If it relies more on the upper level instead of the lower—like a lyric by Swinburne—it is nearly impossible to translate. Both types of literary expression can be either great or mediocre.

There is really no mystery in the distinction. It can be clarified a little by comparing literature with science. A scientific truth is impersonal, in its essence it is untinctured by the particular linguistic medium in which it finds expression. It can as readily deliver its message in Chinese[198] as in English. Nevertheless it must have some expression, and that expression must needs be a linguistic one. Indeed the apprehension of the scientific truth is itself a linguistic process, for thought is nothing but language denuded of its outward garb. The proper medium of scientific expression is therefore a generalized language that may be defined as a symbolic algebra of which all known languages are translations. One can adequately translate scientific literature because the original scientific expression is itself a translation. Literary expression is personal and concrete, but this does not mean that its significance is altogether bound up with the accidental qualities of the medium. A truly deep symbolism, for instance, does not depend on the verbal associations of a particular language but rests securely on an intuitive basis that underlies all linguistic expression. The artist’s “intuition,” to use Croce’s term, is immediately fashioned out of a generalized human experience—thought and feeling—of which his own individual experience is a highly personalized selection. The thought relations in this deeper level have no specific linguistic vesture; the rhythms are free, not bound, in the first instance, to the traditional rhythms of the artist’s language. Certain artists whose spirit moves largely in the non-linguistic (better, in the generalized linguistic) layer even find a certain difficulty in getting themselves expressed in the rigidly set terms of their accepted idiom. One feels that they are unconsciously striving for a generalized art language, a literary algebra, that is related to the sum of all known languages as a perfect mathematical symbolism is related to all the roundabout reports of mathematical relations that normal speech is capable of conveying. Their art expression is frequently strained, it sounds at times like a translation from an unknown original—which, indeed, is precisely what it is. These artists—Whitmans and Brownings—impress us rather by the greatness of their spirit than the felicity of their art. Their relative failure is of the greatest diagnostic value as an index of the pervasive presence in literature of a larger, more intuitive linguistic medium than any particular language.

There’s really no mystery in the difference. It can be made clearer by comparing literature with science. A scientific truth is impersonal; it isn’t influenced by the specific language it’s expressed in. It can just as easily convey its message in Chinese[198] as in English. However, it does need to be expressed, and that expression must be linguistic. In fact, understanding scientific truth is itself a linguistic process, because thought is essentially language stripped of its outer form. The right medium for scientific expression is therefore a generalized language that can be defined as a symbolic algebra, with all known languages being its translations. One can accurately translate scientific literature because the original scientific expression is a translation itself. Literary expression is personal and concrete, but that doesn’t mean its significance is completely tied to the random qualities of the medium. A truly deep symbolism, for example, doesn’t depend on the verbal associations of a specific language; it rests on an intuitive foundation that underlies all linguistic expression. The artist’s “intuition,” to use Croce’s term, is directly shaped by a generalized human experience—thought and feeling—of which their own individual experience is a highly personal selection. The thought connections at this deeper level don’t have specific linguistic forms; the rhythms are free, not initially tied to the traditional rhythms of the artist’s language. Some artists, who primarily operate in the non-linguistic (or better, generalized linguistic) realm, find it a challenge to express themselves within the rigid confines of their usual language. It seems they are unconsciously seeking a generalized language of art, a literary algebra, that relates to the sum of all known languages just as perfect mathematical symbolism relates to all the indirect ways normal speech can convey mathematical relationships. Their artistic expression often feels strained, sometimes sounding like a translation from an unknown original—which, in fact, is exactly what it is. These artists—Whitmans and Brownings—impress us more with the greatness of their spirit than the quality of their art. Their relative shortcomings are highly valuable as an indicator of the overarching presence in literature of a broader, more intuitive linguistic medium than any specific language.

Nevertheless, human expression being what it is, the greatest—or shall we say the most satisfying—literary artists, the Shakespeares and Heines, are those who have known subconsciously to fit or trim the deeper intuition to the provincial accents of their daily speech. In them there is no effect of strain. Their personal “intuition” appears as a completed synthesis of the absolute art of intuition and the innate, specialized art of the linguistic medium. With Heine, for instance, one is under the illusion that the universe speaks German. The material “disappears.”

Nevertheless, human expression being what it is, the greatest—or should we say the most fulfilling—literary artists, like Shakespeares and Heines, are those who have subconsciously learned to adjust their deeper insights to the everyday language they spoke. In their work, there’s no sense of struggle. Their personal “intuition” comes across as a complete blend of pure artistic intuition and the natural, specialized craft of language. With Heine, for example, one feels as though the universe is speaking German. The content “vanishes.”

Every language is itself a collective art of expression. There is concealed in it a particular set of esthetic factors—phonetic, rhythmic, symbolic, morphological—which it does not completely share with any other language. These factors may either merge their potencies with those of that unknown, absolute language to which I have referred—this is the method of Shakespeare and Heine—or they may weave a private, technical art fabric of their own, the innate art of the language intensified or sublimated. The latter type, the more technically “literary” art of Swinburne and of hosts of delicate “minor” poets, is too fragile for endurance. It is built out of spiritualized material, not out of spirit. The successes of the Swinburnes are as valuable for diagnostic purposes as the semi-failures of the Brownings. They show to what extent literary art may lean on the collective art of the language itself. The more extreme technical practitioners may so over-individualize this collective art as to make it almost unendurable. One is not always thankful to have one’s flesh and blood frozen to ivory.

Every language is a collective form of expression. Within it lies a unique set of aesthetic elements—phonetic, rhythmic, symbolic, and morphological—that it does not fully share with any other language. These elements can either merge with that unknown, absolute language I mentioned—this is how Shakespeare and Heine operated—or they can create their own specialized, technical art unique to that language, enhancing or elevating its inherent artistry. The latter type, the more technically "literary" art of Swinburne and many other delicate "minor" poets, is too fragile to last. It is made from spiritualized material, not pure spirit. The achievements of Swinburne are as valuable for understanding as the partial failures of the Brownings. They reveal how much literary art can rely on the collective expression of the language itself. The more extreme technical creators can individualize this collective art to the point that it becomes nearly unbearable. One is not always grateful to have one's flesh and blood turned to ivory.

An artist must utilize the native esthetic resources of his speech. He may be thankful if the given palette of colors is rich, if the springboard is light. But he deserves no special credit for felicities that are the language’s own. We must take for granted this language with all its qualities of flexibility or rigidity and see the artist’s work in relation to it. A cathedral on the lowlands is higher than a stick on Mont Blanc. In other words, we must not commit the folly of admiring a French sonnet because the vowels are more sonorous than our own or of condemning Nietzsche’s prose because it harbors in its texture combinations of consonants that would affright on English soil. To so judge literature would be tantamount to loving “Tristan und Isolde” because one is fond of the timbre of horns. There are certain things that one language can do supremely well which it would be almost vain for another to attempt. Generally there are compensations. The vocalism of English is an inherently drabber thing than the vowel scale of French, yet English compensates for this drawback by its greater rhythmical alertness. It is even doubtful if the innate sonority of a phonetic system counts for as much, as esthetic determinant, as the relations between the sounds, the total gamut of their similarities and contrasts. As long as the artist has the wherewithal to lay out his sequences and rhythms, it matters little what are the sensuous qualities of the elements of his material.

An artist must use the natural aesthetic resources of their language. They can be grateful if the available palette of colors is rich and if the springboard is light. But they shouldn't get extra credit for the happy qualities that come from the language itself. We need to accept this language with all its flexibility or rigidity and view the artist’s work in relation to it. A cathedral in a lowland area is taller than a stick on Mont Blanc. In other words, we should avoid the mistake of admiring a French sonnet just because the vowels sound nicer than our own, or condemning Nietzsche’s prose because its consonant combinations might scare someone who only speaks English. Judging literature this way would be like loving “Tristan und Isolde” simply because you like the sound of horns. There are certain things that one language can do exceptionally well that it would be almost pointless for another to try. Generally, there are trade-offs. The sound system of English is inherently less vibrant than the vowel scale of French, but English makes up for this with its greater rhythmic variety. It’s even questionable whether the natural richness of a phonetic system matters as much from an aesthetic perspective as the relationships between the sounds, and the full range of their similarities and contrasts. As long as the artist can arrange their sequences and rhythms, it doesn’t really matter what the sensory qualities of their materials are.

The phonetic groundwork of a language, however, is only one of the features that give its literature a certain direction. Far more important are its morphological peculiarities. It makes a great deal of difference for the development of style if the language can or cannot create compound words, if its structure is synthetic or analytic, if the words of its sentences have considerable freedom of position or are compelled to fall into a rigidly determined sequence. The major characteristics of style, in so far as style is a technical matter of the building and placing of words, are given by the language itself, quite as inescapably, indeed, as the general acoustic effect of verse is given by the sounds and natural accents of the language. These necessary fundamentals of style are hardly felt by the artist to constrain his individuality of expression. They rather point the way to those stylistic developments that most suit the natural bent of the language. It is not in the least likely that a truly great style can seriously oppose itself to the basic form patterns of the language. It not only incorporates them, it builds on them. The merit of such a style as W.H. Hudson’s or George Moore’s[199] is that it does with ease and economy what the language is always trying to do. Carlylese, though individual and vigorous, is yet not style; it is a Teutonic mannerism. Nor is the prose of Milton and his contemporaries strictly English; it is semi-Latin done into magnificent English words.

The phonetic foundation of a language is just one aspect that shapes its literature. Much more significant are its morphological characteristics. The ability to create compound words, whether the language is more synthetic or analytic, and the freedom of word placement in sentences can greatly influence the style's development. The major features of style, as it relates to the technical aspects of constructing and arranging words, are determined by the language itself, just as the overall sound of verse is shaped by the language's sounds and natural stresses. These fundamental aspects of style typically don't feel like constraints on an artist's individual expression; instead, they guide the stylistic developments that align best with the language’s natural tendencies. It's unlikely that a truly great style would stand in opposition to the language's basic structures; rather, it incorporates and builds upon them. The value of styles like those of W.H. Hudson or George Moore[199] lies in their ability to do what the language naturally strives for with ease and efficiency. Carlylese, while unique and forceful, is not true style; it’s more of a Teutonic quirk. Similarly, Milton’s prose and that of his contemporaries isn't purely English; it's semi-Latin expressed through magnificent English words.

It is strange how long it has taken the European literatures to learn that style is not an absolute, a something that is to be imposed on the language from Greek or Latin models, but merely the language itself, running in its natural grooves, and with enough of an individual accent to allow the artist’s personality to be felt as a presence, not as an acrobat. We understand more clearly now that what is effective and beautiful in one language is a vice in another. Latin and Eskimo, with their highly inflected forms, lend themselves to an elaborately periodic structure that would be boring in English. English allows, even demands, a looseness that would be insipid in Chinese. And Chinese, with its unmodified words and rigid sequences, has a compactness of phrase, a terse parallelism, and a silent suggestiveness that would be too tart, too mathematical, for the English genius. While we cannot assimilate the luxurious periods of Latin nor the pointilliste style of the Chinese classics, we can enter sympathetically into the spirit of these alien techniques.

It's interesting how long it took European literature to realize that style isn't an absolute concept imposed on language based on Greek or Latin models. Instead, it's just the language itself, flowing naturally and with enough personal touch to let the artist's personality come through as a presence, not just a performance. We understand better now that what works and is beautiful in one language can be seen as a flaw in another. Latin and Eskimo, with their complex forms, are suited to a detailed, periodic structure that would be dull in English. English allows—and even requires—a looseness that would come off as bland in Chinese. On the other hand, Chinese, with its unmodified words and strict sequences, conveys a compactness, a concise parallelism, and a subtle suggestion that would be too sharp and too formulaic for English sensibilities. While we may struggle to adopt the elaborate structures of Latin or the pointillist style of the Chinese classics, we can still empathetically engage with the essence of these foreign techniques.

I believe that any English poet of to-day would be thankful for the concision that a Chinese poetaster attains without effort. Here is an example:[200]

I think that any English poet today would appreciate the brevity that a Chinese poet achieves effortlessly. Here’s an example:[200]

Wu River[201] at sunset by the river's mouth,
Looking north toward Liao-Tung,[202] unable to see my home.
The sound of steam whistles fills the air, limitless sky and earth,
A single reed floats out from the Middle Kingdom.

These twenty-eight syllables may be clumsily interpreted: “At the mouth of the Yangtsze River, as the sun is about to sink, I look north toward Liao-Tung but do not see my home. The steam-whistle shrills several times on the boundless expanse where meet sky and earth. The steamer, floating gently like a hollow reed, sails out of the Middle Kingdom.”[203] But we must not envy Chinese its terseness unduly. Our more sprawling mode of expression is capable of its own beauties, and the more compact luxuriance of Latin style has its loveliness too. There are almost as many natural ideals of literary style as there are languages. Most of these are merely potential, awaiting the hand of artists who will never come. And yet in the recorded texts of primitive tradition and song there are many passages of unique vigor and beauty. The structure of the language often forces an assemblage of concepts that impresses us as a stylistic discovery. Single Algonkin words are like tiny imagist poems. We must be careful not to exaggerate a freshness of content that is at least half due to our freshness of approach, but the possibility is indicated none the less of utterly alien literary styles, each distinctive with its disclosure of the search of the human spirit for beautiful form.

These twenty-eight syllables can be awkwardly interpreted: “At the mouth of the Yangtze River, as the sun is about to set, I look north toward Liao-Tung but don’t see my home. The steam-whistle shrills several times across the vast stretch where sky and earth meet. The steamer, floating softly like a hollow reed, sails out of the Middle Kingdom.”[203] But we shouldn’t envy the Chinese too much for their brevity. Our more expansive way of expressing ourselves has its own beauty, and the more compact richness of Latin style has its own charm as well. There are nearly as many natural ideals of literary style as there are languages. Most of these are just potential, waiting for artists who may never come. Yet, in the documented texts of primitive tradition and song, there are many passages filled with unique strength and beauty. The structure of the language often compels a collection of concepts that strikes us as a stylistic breakthrough. Single Algonkin words are like tiny imagist poems. We need to be careful not to overstate a freshness of content that is at least partly due to our fresh perspective, but it still suggests the existence of completely different literary styles, each distinct in its expression of the human spirit's quest for beautiful form.

Probably nothing better illustrates the formal dependence of literature on language than the prosodic aspect of poetry. Quantitative verse was entirely natural to the Greeks, not merely because poetry grew up in connection with the chant and the dance,[204] but because alternations of long and short syllables were keenly live facts in the daily economy of the language. The tonal accents, which were only secondarily stress phenomena, helped to give the syllable its quantitative individuality. When the Greek meters were carried over into Latin verse, there was comparatively little strain, for Latin too was characterized by an acute awareness of quantitative distinctions. However, the Latin accent was more markedly stressed than that of Greek. Probably, therefore, the purely quantitative meters modeled after the Greek were felt as a shade more artificial than in the language of their origin. The attempt to cast English verse into Latin and Greek molds has never been successful. The dynamic basis of English is not quantity,[205] but stress, the alternation of accented and unaccented syllables. This fact gives English verse an entirely different slant and has determined the development of its poetic forms, is still responsible for the evolution of new forms. Neither stress nor syllabic weight is a very keen psychologic factor in the dynamics of French. The syllable has great inherent sonority and does not fluctuate significantly as to quantity and stress. Quantitative or accentual metrics would be as artificial in French as stress metrics in classical Greek or quantitative or purely syllabic metrics in English. French prosody was compelled to develop on the basis of unit syllable-groups. Assonance, later rhyme, could not but prove a welcome, an all but necessary, means of articulating or sectioning the somewhat spineless flow of sonorous syllables. English was hospitable to the French suggestion of rhyme, but did not seriously need it in its rhythmic economy. Hence rhyme has always been strictly subordinated to stress as a somewhat decorative feature and has been frequently dispensed with. It is no psychologic accident that rhyme came later into English than in French and is leaving it sooner.[206] Chinese verse has developed along very much the same lines as French verse. The syllable is an even more integral and sonorous unit than in French, while quantity and stress are too uncertain to form the basis of a metric system. Syllable-groups—so and so many syllables per rhythmic unit—and rhyme are therefore two of the controlling factors in Chinese prosody. The third factor, the alternation of syllables with level tone and syllables with inflected (rising or falling) tone, is peculiar to Chinese.

Probably nothing illustrates the formal dependence of literature on language better than the rhythmic aspect of poetry. Quantitative verse was completely natural for the Greeks, not just because poetry developed alongside chant and dance,[204] but because the changes between long and short syllables were very much a part of everyday language. Tonal accents, which were primarily secondary stress phenomena, helped to give each syllable its unique quantitative character. When Greek meters were adapted into Latin verse, the transition was relatively smooth, as Latin also had a strong awareness of quantitative differences. However, Latin accents were more stressed than those in Greek. Therefore, the purely quantitative meters imitating Greek were perceived as slightly more artificial in the Latin language. The attempt to shape English verse in the style of Latin and Greek has never really worked. The dynamic foundation of English isn’t quantity,[205] but stress, which is the alternation of stressed and unstressed syllables. This difference gives English verse a completely different perspective and has shaped the evolution of its poetic forms, and continues to influence new forms. Neither stress nor syllabic weight is a strong psychological factor in the dynamics of French. The syllable in French is inherently sonorous and doesn’t vary much in quantity or stress. Quantitative or accentual metrics would feel just as artificial in French as stress metrics would in classical Greek or quantitative or purely syllabic metrics in English. French prosody had to develop based on unit syllable groups. Assonance, later rhyme, became an essential and almost necessary way to articulate or break up the somewhat seamless flow of sonorous syllables. English was open to the French influence of rhyme, but it didn’t really need it for its rhythm. Thus, rhyme has always been subordinate to stress as more of a decorative element and has often been left out. It’s not a psychological coincidence that rhyme appeared later in English than in French and is departing sooner.[206] Chinese verse has developed along very similar lines to French verse. The syllable forms an even more integral and sonorous unit than in French, while quantity and stress are too variable to serve as the foundation for a metric system. Syllable groups—specifying a certain number of syllables per rhythmic unit—and rhyme are thus two key factors in Chinese prosody. The third factor, the alternation of syllables with level tone and those with inflected (rising or falling) tone, is unique to Chinese.

To summarize, Latin and Greek verse depends on the principle of contrasting weights; English verse, on the principle of contrasting stresses; French verse, on the principles of number and echo; Chinese verse, on the principles of number, echo, and contrasting pitches. Each of these rhythmic systems proceeds from the unconscious dynamic habit of the language, falling from the lips of the folk. Study carefully the phonetic system of a language, above all its dynamic features, and you can tell what kind of a verse it has developed—or, if history has played pranks with its phychology, what kind of verse it should have developed and some day will.

To sum up, Latin and Greek verse relies on the idea of contrasting weights; English verse relies on the idea of contrasting stresses; French verse is based on the principles of number and echo; Chinese verse is based on the principles of number, echo, and contrasting pitches. Each of these rhythmic systems comes from the natural habits of the language, flowing from the speech of the people. Examine the phonetic system of a language closely, especially its dynamic aspects, and you can determine what kind of verse it has developed—or, if history has altered its psychology, what kind of verse it ought to have developed and eventually will.

Whatever be the sounds, accents, and forms of a language, however these lay hands on the shape of its literature, there is a subtle law of compensations that gives the artist space. If he is squeezed a bit here, he can swing a free arm there. And generally he has rope enough to hang himself with, if he must. It is not strange that this should be so. Language is itself the collective art of expression, a summary of thousands upon thousands of individual intuitions. The individual goes lost in the collective creation, but his personal expression has left some trace in a certain give and flexibility that are inherent in all collective works of the human spirit. The language is ready, or can be quickly made ready, to define the artist’s individuality. If no literary artist appears, it is not essentially because the language is too weak an instrument, it is because the culture of the people is not favorable to the growth of such personality as seeks a truly individual verbal expression.

Whatever the sounds, accents, and forms of a language are, and however these influence the shape of its literature, there exists a subtle law of compensations that provides the artist with room to maneuver. If he feels restricted in one area, he can find freedom in another. Overall, he usually has enough leeway to take risks if necessary. It's not surprising that this is the case. Language is the collective art of expression, summarizing countless individual insights. The individual may get lost in this collective creation, but their personal expression leaves a mark in the unique flexibility and give that are part of all collective works of the human spirit. The language is ready, or can quickly be made ready, to express the artist’s individuality. If no literary artist emerges, it’s not primarily because the language is too weak a tool; it's because the culture of the people isn't conducive to the development of a personality that seeks a genuinely individual verbal expression.

Index

Note. Italicized entries are names of languages or groups of languages.

A

Abbreviation of stem, (26)
Accent, stress, (26) (36) (48) (55) (61) (64)
1. as grammatical process, (82) (83)
2. importance of, (118) (119) (120)
3. metrical value of (244) (245) (246)
“Accent,” (44)
“Adam’s apple,” (48)
Adjective, (123) (124) (125)
Affixation, (26) (64) (70-6)
Affixing languages, (133) (134) (137)
African languages, pitch in, (55)
Agglutination, (140-3)
Agglutinative languages, (130) (136-8) (139) (146) (147) (148) (150) (151) (155)
Agglutinative-fusional, (148) (150)
Agglutinative-isolating, (148) (150)
Algonkin languages (N. Amer.), (70) (74) (134) (151) (229) (244)
Alpine race, (223) (225)
Analogical leveling, (193) (197) (200-3)
Analytic tendency, (135) (136) (148) (150) (151) (154) (216) (217)
Angles, (224) (225)
Anglo-Saxon, (28) (175) (183) (185) (186-8) (191) (197) (198) (201)
Anglo-Saxon:
1. culture, (229)
2. race, (222) (223) (224)
Annamite (S.E. Asia), (66) (150) (205)
Apache (N. Amer.), (71)
Arabic, (76) (77) (135) (151) (207)
Armenian, (163) (212)
Art, (236-40)
1. language as, (233) (235) (240) (241) (246) (247)
2. transferability of, (237) (238)
Articulation:
1. ease of, (196)
2. types of, drift toward, (194)
Articulations:
1. laryngeal, (49)
2. manner of consonantal, (52) (53)
3. nasal, (50) (51)
4. oral, (51) (52)
5. place of consonantal, (53) (54)
6. vocalic, (52)
Aryan. See Indo-European.
Aspect, (114)
Association of concepts and speech elements, (38) (39)
Associations fundamental to speech, (10) (11)
Athabaskan languages (N. Amer.), (6) (71) (77) (83) (105) (209) (214) (219) (228) (229)
Athabaskans, cultures of, (228)
Attic dialect, (162)
Attribution, (101)
Auditory cycle in language, (17)
Australian culture, (221) (222)
Avestan, (175)

B

Bach, (238)
Baltic race, (223) (225) (226)
Bantu languages (Africa), (71) (113) (122) (123) (134) (135) (151) (221) (230)
Bantus, (230) (231)
Basque (Pyrenees), (164) (219)
Bengali (India), (155) (163)
Berber. See Hamitic.
Bohemians, (225)
Bontoc Igorot (Philippines), (75) (81)
Borrowing, morphological, (215-17) (219) (220)
Borrowing, word, (205-7)
1. phonetic adaptation in, (210) (211)
2. resistances to, (207-10)
Breton, (225)
Bronchial tubes, (48)
Browning, (239) (240)
Buddhism, influence of, (207) (209)
Burmese, (207)
Bushman (S. Africa), (55) (230)
Bushmen, (221) (230) (231)

C

Cambodgian (S.E. Asia), (71) (75) (108) (134) (150) (155) (207) (209) (219)
Carlyle, (242)
Carrier (British Columbia), (71)
Case, (115)
1. See Attribution; Object; Personal relations; Subject.
Case-system, history of, (174-7)
Caucasus, languages of, (213)
Celtic. See Celts.
Celtic languages, (78) (79)
Celts, (224) (225) (226)
1. Brythonic, (224)
“Cerebral” articulations, (54)
Chaucer, English of, (179) (188) (191) (211)
Chimariko (N. California), (73)
Chinese:
1. absence of affixes, (70)
2. analytic character, (135) (136)
3. attribution, (101)
4. compounds, (67)
5. grammatical concepts illustrated, (96) (97)
6. influence, (205) (207)
7. “inner form,”, (132)
8. pitch accent, (55) (83) (84)
9. radical words, (29)
10. relational use of material words, (108)
11. sounds, (49)
12. stress, (119)
13. structure, (150) (154) (155)
14. style, (243)
15. survivals, morphological, (152)
16. symbolism, (134)
17. verse, (243) (244) (245)
18. word duplication, (80)
19. word order, (66) (97) (118)
Chinook (N. Amer.), (66) (73) (76) (80) (121) (122) (123) (124) (135) (136) (151) (155) (220)
Chipewyan (N. Amer.), (71)
1. C. Indians, (228)
Chopin, (238)
Christianity, influence of, (206)
Chukchi, (230)
Classification:
1. of concepts, rigid, (104) (105)
2. of linguistic types, (129-56)
3. See Structure, linguistic.
“Clicks,” (55) (81)
Composition, (29) (30) (64) (145)
1. absence of, in certain languages, (68)
2. types of, (69) (70)
3. word order as related to, (67) (68)
Concepts, (12) (25-30) (31)
Concepts, grammatical:
1. analysis of, in sentence, (86-94)
2. classification of, (104) (105)
3. concrete, (86) (87) (92) (106)
4. concrete relational, (98-102) (107)
5. concreteness in, varying degree of, (108) (109)
6. derivational, (87) (88) (92) (106)
7. derivational, abstract, (109-11)
8. essential, (98) (99) (107) (108)
9. grouping of, non-logical, (94)
10. lack of expression of certain, (97) (98)
11. pure relational, (99) (107) (179)
12. radical, (88) (92) (98)
13. redistribution of, (94-8)
14. relational, (89-93) (98) (99)
15. thinning-out of significance of, (102-4)
16. types of, (106) (107) (108) (109)
17. typical categories of, (113-15)
18. See Structure, linguistic.
Concord, (100) (120-23)
Concrete concepts. See Concepts.
Conflict, (167) (168) (171) (172)
Consonantal change, (26) (61) (64) (78) (79)
Consonants, (52-4)
1. combinations of, (56)
Coördinate sentences, (37)
Corean, (205)
Croce, Benedetto, (237) (239)
Culture, (221)
1. language and, (227-30) (231) (232) (233-5)
2. language as aspect of, (2) (10)
3. language, race and, (222) (223) (230) (231)
4. reflection of history of, in language, (206) (207)
Culture areas, (221) (222) (228)

D

Danish, (49) (110) (136) (175) (217)
Demonstrative ideas, (97) (98) (114)
Dental articulations, (54) (192)
Derivational concepts. See Concepts.
Determinative structure, (135)
Dialects:
1. causes of, (160-3)
2. compromise between, (159)
3. distinctness of, (159)
4. drifts in, diverging, (183) (184)
5. drifts in, parallel, (184-93)
6. splitting up of, (162) (164)
7. unity of, (157-9)
Diffusion, morphological, (217-20)
Diphthongs, (56)
Drift, linguistic, (160-3) (183) (184)
1. components of, (172-4)
2. determinants of, in English, (168-82)
3. direction of, (165) (166) (183)
4. direction of, illustrated in English, (166-8)
5. examples of general, in English, (174-82)
6. parallelisms in, (184-93)
7. speed of, (183) (184)
8. See Phonetic Law; Phonetic processes.
Duplication of words, (79-81)
Dutch, (175) (188) (212) (224)

E

Elements of speech, (24-42)
Emotion, expression of:
1. involuntary, (3)
2. linguistic, (39-41)
English:
1. agentive suffix, (87)
2. analogical leveling, (202) (203)
3. analytic tendency, (135) (136) (216) (217)
4. animate and inanimate, (176) (177) (179) (180)
5. aspect, (114)
6. attribution, (101)
7. case, history of, (169) (170) (175-7) (179)
8. compounds, (67) (68) (69) (70)
9. concepts, grammatical, in sentence, (86-94)
10. concepts, passage of concrete into derivational, (108) (109)
11. consonantal change, (64) (78)
12. culture of speakers of, (229) (230)
13. desire, expression of, (39)
14. diminutive suffix, (87)
15. drift, (166-82)
16. duplication, word, (79) (80)
17. esthetic qualities, (241) (243)
18. feeling-tone, (41) (42)
19. form, word, (59) (60) (61)
20. French influence on, (206) (207) (208) (210) (211) (215) (216)
21. function and form, (93) (94)
22. fusing and juxtaposing, (137) (138) (139-41)
23. gender, (100)
24. Greek influence on, (215) (216)
25. influence of, (207)
26. influence on, morphological, lack of deep, (215-17)
27. interrogative words, (170)
28. invariable words, tendency to, (180-2) (208)
29. infixing, (75)
30. Latin influence on, (206) (207) (208) (215) (216)
31. loan-words, (182)
32. modality, (90) (91) (92) (93)
33. number, (90) (91)
34. order, word, (65) (66) (170) (171) (177-9) (191) (192)
35. parts of speech, (123-5)
36. patterning, formal, (62) (63)
37. personal relations, (91) (92) (93)
38. phonetic drifts, history of, (184-93) (194) (197-9)
39. phonetic leveling, (193) (194)
40. phonetic pattern, (200) (206)
41. plurality, (38) (39) (100) (105) (106) (202)
42. race of speakers of, (223-7)
43. reference, definiteness of, (89) (90) (92) (93)
44. relational words, (32)
45. relations, genetic, (163) (175) (183) (218)
46. rhythm, (171) (172)
47. sentence, analysis of, (37)
48. sentence, dependence of word on, (116)
49. sound-imitative words, (6) (80)
50. sounds, (44) (45) (49) (51) (53) (54)
51. stress and pitch, (36) (55) (83)
52. structure, (151) (180)
53. survivals, morphological, (149) (152)
54. symbolism, (134)
55. syntactic adhesions, (117) (118)
56. syntactic values, transfer of, (120)
57. tense, (91) (93) (102) (103) (104)
58. verb, syntactic relations of, (115)
59. verse, (245) (246)
60. vocalic change, (76)
61. word and element, analysis of, (25) (26) (27) (28) (29) (30) (35)
English, Middle, (175) (176) (188) (191) (201) (202) (203)
English people, (223) (224)
Eskimo, (60) (68) (70) (74) (118) (134) (135) (230) (243)
Eskimos, (230)
Ewe (Guinea coast, Africa), (80) (84) (150) (154) (155)
Expiratory sounds, (55)
“Explosives,” (52)

F

Faucal position, (53)
Feeling-tones of words, (41) (42)
Fijians, (230)
Finnish, (135) (155) (219)
Finns, (226)
Flemish, (212)
“Foot, feet” (English), history of, (184-93) (197-9) (201) (202)
Form, cultural, (233) (234)
1. feeling of language for, (58) (62) (63) (152) (153) (210) (220)
2. “inner,” (132) (133)
Form, linguistic:
1. conservatism of, (102-4)
2. differences of, mechanical origin of, (105) (106)
3. elaboration of, reasons for, (102-6)
4. function and, independence of, (59-63) (93) (94)
5. grammatical concepts embodied in, (82-126)
6. grammatical processes embodying, (59-85)
7. permanence of different aspects of, relative, (153-6)
8. twofold consideration of, (59-61)
9. See Structure, linguistic.
Form-classes, (105) (113)
1. See Gender.
Formal units of speech, (33)
“Formlessness, inner,” (132) (133)
Fox (N. Amer.), (74)
French:
1. analytical tendency, (135) (136) (137)
2. esthetic qualities, (241)
3. gender, (102) (104) (113)
4. influence, (205) (206) (207) (208) (209) (210) (211) (212) (215) (216)
5. order, word, (67)
6. plurality, (99)
7. sounds, (51) (212)
8. sounds as words, single, (24)
9. stress, (55) (118)
10. structure, (151) (154)
11. tense forms, (103)
12. verse, (245) (246)
French, Norman, (224)
French people, (224) (225)
Freud, (168)
Fricatives, (52)
Frisian, (175) (224)
Ful (Soudan), (79) (81)
Function, independence of form and, (59-63) (93) (94)
Functional units of speech, (33)
Fusion, (137) (138) (139) (140) (141) (149)
Fusional languages, (147) (150) (151)
1. See Fusion.
Fusional-agglutinative, (148) (150) (151)
Fusional-isolating, (148) (150)
“Fuss, Füsse” (German), history of, (184) (185) (191-3) (197-99)

G

Gaelic, (225)
Gender, (100-2) (113)
German:
1. French influence on, (208) (209) (212)
2. grammatical
3. concepts in sentence, (95)
4. Latin influence on, (206) (208)
5. phonetic drifts, history of, (184) (185) (188) (191-3) (197-9)
6. plurality, (100)
7. relations, (175) (183)
8. sound-imitative words, (6)
9. sounds, (56) (212)
10. tense forms, (103)
11. “umlaut,” (202) (203) (204)
12. unanalyzable words, resistance to, (208) (209)
German, High, (224)
German, Middle High, (184) (185) (192) (204)
German, Old High, (175) (184) (185) (192) (194)
Germanic languages, (175) (183) (184) (185) (186) (206) (212) (226)
Germanic, West, (175) (184) (185) (186) (187) (191) (192) (224)
Germans, (224) (225) (226)
Gesture languages, (20) (21)
Ginneken, Jac van, (40)
Glottal cords, (48)
1. action of, (48-50)
Glottal stop, (49)
Gothic, (82) (175) (184)
Grammar, (39)
Grammatical element, (26-32)
Grammatical concepts. See Concepts, grammatical.
Grammatical processes:
1. classified by, languages, (133-5)
2. particular, development by each language of, (62) (63)
3. types of, (63) (64)
4. variety of, use in one language of, (61) (62)
Greek, dialectic history of, (162)
Greek, classical:
1. affixing, (137)
2. compounds, (67) (68)
3. concord, (121)
4. infixing, (75)
5. influence, (207) (215) (216)
6. pitch accent, (83)
7. plurality, (100)
8. reduplicated perfects, (82) (216)
9. stress, (82) (83)
10. structure, (139) (151) (152)
11. synthetic character, (137)
12. verse, (244) (246)
Greek, modern, (137) (163) (194) (212)

H

Haida (British Columbia), (56) (57) (150) (219) (229)
Hamitic languages (N. Africa), (77) (219) (221)
Hausa (Soudan), (81)
Hebrew, (61) (62) (73) (76) (151) (207)
Heine, (240)
Hesitation, (172) (173) (183)
History, linguistic, (153-6) (7-204)
Hokan languages (N. Amer.), (220) (229)
Hottentot (S. Africa), (55) (70) (80) (81)
Hudson, W.H., (242)
Humming, (50)
Hupa (N. California), (71) (72)
Hupa Indians, (228)

I

Icelandic, Old, (175)
India, languages of, (54)
Indians, American, languages of, (34) (35) (49) (51) (56) (57) (58) (84) (85) (105) (130) (212) (213)
1. See also Algonkin; Athabaskan; Chimariko; Chinook; Eskimo; Fox; Haida; Hokan; Hupa; Iroquois; Karok; Kwakiutl; Nahuatl; Nass; Navaho; Nootka; Ojibwa; Paiute; Sahaptin; Salinan; Shasta; Siouan; Sioux; Takelma; Tlingit; Tsimshian; Washo; Yana; Yokuts; Yurok.
Indo-Chinese languages, (155) (164)
Indo-European, (24) (75) (82) (163) (164) (174) (175) (186) (200) (226)
Indo-Iranian languages, (175) (212)
Infixes, (26) (64) (75) (76)
Inflection. See Inflective languages.
Inflective languages, (130) (136-41) (143) (144) (146) (155)
Influence:
1. cultural, reflected in language, (205-10)
2. morphological, of alien language, (215-17) (220)
3. phonetic, of alien language, (210-15)
Inspiratory sounds, (55)
Interjections, (4) (5)
Irish, (224)
Irish, (78) (79) (163) (218)
Iroquois (N. Amer.), (69) (70)
Isolating languages, (130) (133) (147) (150)
Italian, (54) (55) (137) (163)
“Its,” history of, (167) (176) (177)

J

Japanese, (205) (207)
Jutes, (224)
Juxtaposing. See Agglutination.

K

Karok (N. California), (220) (229)
1. K. Indians, (227)
Khmer. See Cambodgian.
Knowledge, source of, as grammatical category, (115)
Koine, (162)
Kwakiutl (British Columbia), (81) (97) (98)

L

Labial trills, (53)
Language:
1. associations in, (38) (39)
2. associations underlying elements of, (10) (11)
3. auditory cycle in, (17)
4. concepts expressed in, (12)
5. a cultural function, (2) (10)
6. definition of, (7)
7. diversity of, (21-3)
8. elements of, (24-38)
9. emotion expressed in, (39-41)
10. feeling-tones in, (41) (42)
11. grammatical concepts of, (86-126)
12. grammatical processes of, (59-85)
13. historical aspects of, (157-204)
14. imitations of sounds, not evolved from, (5) (6)
15. influences on, exotic, (205-20)
16. interjections, not evolved from, (5)
17. literature and, (236-47)
18. modifications and transfers of typical form of, (17-21)
19. an “overlaid” function, (8)
20. psycho-physical basis of, (8) (9)
21. race, culture and, (221-35)
22. simplification of experience in, (11) (12)
23. sounds of, (43-58)
24. structure of, (127-56)
25. thought and, (12-17) (232) (233)
26. universality of, (21-3)
27. variability of, (157-65)
28. volition expressed in, (39-41)
Larynx, (48-50)
Lateral sounds, (52) (53)
Latin:
1. attribution, (101)
2. concord, (121)
3. infixing, (26) (75)
4. influence of, (206) (207) (215) (216)
5. objective -m, (119) (120)
6. order of words, (65) (66) (123)
7. plurality, (100)
8. prefixes and suffixes, (71)
9. reduplicated perfects, (82) (216)
10. relational concepts expressed, (101) (102)
11. sentence-word, (33) (36)
12. sound as word in, single, (24)
13. structure, (151) (154)
14. style, (243) (244)
15. suffixing character, (134) (137)
16. syntactic nature of sentence, (116) (118)
17. synthetic character, (135) (137)
18. verse, (244) (245) (246)
19. word and element in, analysis of, (27) (29) (30)
Lettish, (49)
Leveling, phonetic, (193) (194) (195)
1. See Analogical leveling.
Lips, (48)
1. action of, (52) (53)
Literature:
1. compensations in, formal, (246) (247)
2. language and, (42) (236-47)
3. levels in, linguistic, (237-41)
4. medium of, language as, (236) (237)
5. science and, (238-40)
Literature, determinants of:
1. linguistic, (240) (241)
2. metrical, (244-6)
3. morphological, (241-4)
4. phonetic, (241)
Lithuanian, (55) (175) (183)
Localism, (161)
Localization of speech, (8) (9)
Loucheux (N. Amer.), (71)
1. L. Indians, (228)
Lungs, (48)
Luther, German of, (192)

M

Malay, (132)
1. M. race, (227)
Malayan, (227)
Malayo-Polynesian languages, (219) (221) (227)
Manchu, (80)
Manx, (225)
“Maus, Mäuse” (German), history of, (184) (185) (191-3)
Mediterranean race, (223)
Melanesian languages, (227) (230)
Meter. See Verse.
Milton, (242)
Mixed-relational languages, (146) (147) (154)
1. complex, (146) (147) (151) (155)
2. simple, (146) (147) (151)
Modality, (90) (91) (92) (93) (114)
Mon-Khmer (S.E. Asia), (219)
Moore, George, (242)
Morphological features, diffusion of, (217-20)
Morphology. See Structure, linguistic.
“Mouse, mice” (English), history of, (184-93)
Munda languages (E. India), (219)
Murmuring, (50)
Mutation, vocalic, (184) (185) (197-9) (203) (204)

N

Nahuatl (Mexico), (69) (70)
Nasal sounds, (51)
“Nasal twang,” (51)
Nasalized stops, (52)
Nass (British Columbia), (62) (81)
Nationality, (222) (227) (228)
Navaho (Arizona, New Mexico), (71) (77) (83) (136)
1. N. Indians, (228)
Nietzsche, (241)
Nootka (Vancouver Id.), (29) (33) (35) (68) (70) (74) (79) (82) (95) (109-11) (135) (141-3) (151)
Nose, (48)
1. action of, (50) (51)
Noun, (123) (124) (126)
Nouns, classification of, (113)
Number, (90) (91) (93) (114)
1. See Plurality.

O

Object, (92) (98)
1. See Personal relations.
Ojibwa (N, Amer.), (55)
Onomatopoetic theory of origin of speech, (5) (6)
Oral sounds, (51-4)
Order, word, (64-6) (91) (92)
1. composition as related to, (67) (68)
2. fixed, English tendency, (177-9)
3. sentence molded by, (117) (118)
4. significance of, fundamental, (119) (120) (123)
Organs of speech, (7) (8) (47) (48)
1. action of, (48-54)

P

Paiute (N. Amer.), (31) (32) (36) (52) (53) (69) (70)
Palate, (48)
1. action of soft, (51)
2. articulations of, (53)
Pali (India), (207)
Papuan languages, (227)
Papuans, (227) (230)
Parts of speech, (123-5) (126)
Pattern:
1. formal, (61) (63) (234) (242)
2. phonetic, (57) (58) (187) (93-6) (99) (200) (206) (211) (214) (215) (220)
Persian, (163) (207)
Person, (114)
Personal relations, (91) (92) (93) (115)
Phonetic adaptation, (210) (211)
Phonetic diffusion, (211-15)
Phonetic law:
1. basis of, (195) (196) (199) (200)
2. direction of, (194) (195) (199)
3. examples of, (186-93)
4. influence of, on morphology, (203) (204)
5. influence of morphology on, (196-9)
6. regularity of, (193) (194)
7. significance of, (186)
8. spread of, slow, (190) (191)
9. See Leveling, phonetic; Pattern, phonetic.
Phonetic processes,
1. form caused by, differences of, (105) (106)
2. parallel drifts in, (184-93) (197-9)
Pitch, grammatical use of, (83-5)
1. metrical use of, (246)
2. production of, (49)
3. significant differences in, (55) (64)
Plains Indians, gesture language of, (20)
“Plattdeutsch,” (224) (225)
Plurality:
1. classification of concept of, variable, (110) (111) (112)
2. a concrete relational category, (99) (100)
3. a derivational or radical concept, (99)
4. expression of, multiple, (38) (62)
5. See Number.
Poles, (225)
Polynesian, (132) (150) (155) (227) (230)
Polynesians, (221) (222) (227) (230)
Polysynthetic languages, (130) (135) (146) (148) (150) (151)
Portuguese, (137)
Predicate, (37) (126)
Prefixes, (26) (64) (70) (71-5)
Prefixing languages, (134) (135)
Preposition, (125)
Psycho-physical aspect of speech, (8) (9)
Pure-relational languages, (145) (147) (154) (155)
1. complex, (145) (147) (150) (155)
2. simple, (145) (147) (150)

Q

Qualifying concepts. See Concepts, derivational.
Quality
1. of speech sounds, (48)
2. of individual’s voice, (48)
Quantity of speech sounds, (55) (64)

R

Race, (221) (222)
1. language and, lack of correspondence between, (227)
2. language and, theoretical relation between, (231-3)
3. language as correlated with, English, (223-7)
4. language, culture and, correspondence between, (230) (231)
5. language, culture and, independence of, (222) (223)
Radical concepts. See Concepts.
Radical element, (26-32)
Radical word, (28) (29)
“Reading from the lips,” (19)
Reduplication, (64) (79-82)
Reference, definite and indefinite, (89) (90)
Repetition of stem, (26)
1. See Reduplication.
Repression of impulse, (167) (168)
Rhyme, (245) (246)
Rolled consonants, (53)
Romance languages, (137)
Root, (25)
Roumanian, (137)
Rounded vowels, (52)
Russian, (44) (45) (54) (71) (80) (163) (212)

S

Sahaptin languages (N. Amer.), (220)
Salinan (S.W. California), (150) (155)
Sanskrit (India), (54) (75) (82) (151) (154) (175) (200) (207) (209) (210)
Sarcee Indians, (228)
Saxon:
1. Low, (224)
2. Old, (175)
3. Upper, (225)
Saxons, (224) (225)
Scandinavian, (224)
1. See Danish; Icelandic; Swedish.
Scandinavians, (224)
Scotch, (224) (226)
Scotch, Lowland, (188)
Semitic languages, (61) (68) (76) (134) (151) (219) (228)
Sentence, (33) (36-8)
1. binding words into, methods of, (115-17)
2. stress in, influence of, (118) (119)
3. word-order in, (117) (118)
Sequence. See Order of words.
Shakespeare:
1. art of, (238) (240)
2. English of, (188) (189) (191)
Shasta (N. California), (220)
Shilh (Morocco), (77) (81)
Shilluk (Nile headwaters), (84) (150) (154) (155)
Siamese, (55) (66) (70) (207)
Singing, (50)
Siouan languages (N. Amer.), (76)
Sioux (Dakota), (29) (76) (95) (150)
Slavic languages, (212)
Slavs, (225)
Somali (E. Africa), (77) (80) (81)
Soudanese languages, (84) (154) (155) (163)
Sound-imitative words, (4) (5) (6) (80)
Sounds of speech, (24)
1. adjustments involved in, muscular, (46)
2. adjustments involved in certain, inhibition of, (46) (47)
3. basic importance of, (43)
4. classification of, (54) (54)
5. combinations of, (56)
6. conditioned appearance of, (56) (57)
7. dynamics of, (55) (56)
8. illusory feelings in regard to, (43-5)
9. “inner” or “ideal” system of, (57) (58)
10. place in phonetic pattern of, (194-6)
11. production of, (47-54)
12. values of, psychological, (56-8)
13. variability of, (45) (46)
Spanish, (137)
Speech. See Language.
Spirants, (52)
Splitting of sounds, (193) (195)
Stem, (26)
Stock, linguistic, (163-5) (218) (221)
Stopped consonants (or stops), (52)
Stress. See Accent.
Structure, linguistic, (127-56)
1. conservatism of, (200)
2. differences of, (127) (128)
3. intuitional forms of, (153) (154)
Structure, linguistic, types of:
1. classification of, by character of concepts, (143-7)
2. by degree of fusion, (136-43)
3. by degree of synthesis, (135) (136)
4. by formal processes, (133-5)
5. from threefold standpoint, (147-9) (154)
6. into “formal” and “formless,” (132) (133)
7. classifying, difficulties in, (129-32) (149)
8. examples of, (149-51)
9. mixed, (148)
10. reality of, (128) (129) (149) (152) (153)
11. validity of conceptual, historical test of, (152-6)
Style, (38) (216) (242-4)
Subject, (92) (98)
1. See Personal relations.
Subject of discourse, (37) (126)
Suffixes, (26) (64)
Suffixing, (61) (70) (71-5)
Suffixing languages, (134) (135)
Survivals, morphological, (149) (152) (202) (218) (219)
Swedish, (55) (110) (175)
Swinburne, (238) (240)
Swiss, French, (225)
Syllabifying, (56)
Symbolic languages, (133) (134) (147) (150) (151)
Symbolic processes, (134) (138) (139) (140)
Symbolic-fusional, (151)
Symbolic-isolating, (148)
Symons, (245)
Syntactic adhesions, (117) (118)
Syntactic relations:
1. primary methods of expressing, (119) (120)
2. transfer of values in, (120)
3. See Concepts, relational; Concord; Order, word; Personal relations; Sentence.
Synthetic tendency, (69) (135) (136) (137) (148) (150) (151) (154)

T

Takelma (S.W. Oregon), (81) (82) (84) (85) (151) (152) (220)
Teeth, (48)
1. articulations of, (53) (54)
Telegraph code, (20)
Temperament, (231) (232)
Tense, (91) (93) (114)
Teutonic race. See Baltic race.
Thinking, types of, (17) (18)
Thought, relation of language to, (12-17) (232) (233)
Throat, (48)
1. articulations of, (49) (50) (53)
Tibetan, (80) (102) (112) (124) (125) (136) (143) (144) (150) (154) (155) (209) (210)
Time. See Tense.
Tlingit (S. Alaska), (84) (134) (135) (219) (229)
1. T. Indians, (230)
Tongue, (48)
1. action of, (52) (53) (54)
Transfer, types of linguistic, (18-21)
Trills, (53)
Tsimshian (British Columbia), (70) (80) (81)
1. See Nass.
Turkish, (70) (135) (150) (207) (212)
Types, linguistic, change of, (153-6)
1. See Structure, linguistic.

U

Ugro-Finnic, (212)
“Umlaut.” See Mutation, vocalic.
United States:
1. culture in, (209)
2. race in, (223)
Ural-Altaic languages, (212)
Uvula, (48) (53)

V

Values:
1. “hesitation,” (173)
2. morphologic, (131) (132)
3. phonetic, (56-8)
4. variability in, of components of drift, (172) (173)
Variations, linguistic:
1. dialect, (157-65)
2. historical, (160-204)
3. individual, (157-9) (165) (199)
Verb, (123) (124) (126)
1. syntactic relations expressed in, (115)
Verhaeren, (245)
Verse:
1. accentual, (244) (245)
2. linguistic determinants of, (242-6)
3. quantitative, (244) (245)
4. syllabic, (244) (245)
Vocalic change, (26) (61) (64) (76-8)
1. See Mutation, vocalic.
Voice, production of, (50)
Voiced sounds, (50)
Voiceless:
1. laterals, (53)
2. nasals, (51)
3. sounds, (49) (50)
4. trills, (53)
5. vowels, (52)
“Voicelessness,” production of, (49)
Volition expressed in speech, (38) (39)
Vowels, (52)

W

Walking, a biological function, (1) (2)
Washo (Nevada), (81)
Welsh, (51) (53) (225)
Westermann, D., (154)
Whisper, (50)
Whitman, (239)
“Whom,” use and drift of, (166-74)
Word, (25-8)
1. definition of, (32-6)
2. syntactic origin of complex, (117) (118)
3. “twilight” type of, (28) (29)
4. types of, formal, (29-32)
Written language, (19) (20)

Y

Yana (N. California), (69) (70) (74) (76) (96) (105) (111) (112) (126) (150) (155)
Yiddish, (204)
Yokuts (S. California), (77) (78)
Yurok (N.W. California), (229)
1. Y. Indians, (228)

Z

Zaconic dialect of Greek, (162)

Footnote 1: We shall reserve capitals for radical elements.

Footnote 2: These words are not here used in a narrowly technical sense.

Footnote 3: It is not a question of the general isolating character of such languages as Chinese (see Chapter VI). Radical-words may and do occur in languages of all varieties, many of them of a high degree of complexity.

Footnote 4: Spoken by a group of Indian tribes in Vancouver Island.

Footnote 5: In this and other examples taken from exotic languages I am forced by practical considerations to simplify the actual phonetic forms. This should not matter perceptibly, as we are concerned with form as such, not with phonetic content.

Footnote 6: These oral experiences, which I have had time and again as a field student of American Indian languages, are very neatly confirmed by personal experiences of another sort. Twice I have taught intelligent young Indians to write their own languages according to the phonetic system which I employ. They were taught merely how to render accurately the sounds as such. Both had some difficulty in learning to break up a word into its constituent sounds, but none whatever in determining the words. This they both did with spontaneous and complete accuracy. In the hundreds of pages of manuscript Nootka text that I have obtained from one of these young Indians the words, whether abstract relational entities like English that and but or complex sentence-words like the Nootka example quoted above, are, practically without exception, isolated precisely as I or any other student would have isolated them. Such experiences with naïve speakers and recorders do more to convince one of the definitely plastic unity of the word than any amount of purely theoretical argument.

Footnote 7: “Coördinate sentences” like I shall remain but you may go may only doubtfully be considered as truly unified predications, as true sentences. They are sentences in a stylistic sense rather than from the strictly formal linguistic standpoint. The orthography I shall remain. But you may go is as intrinsically justified as I shall remain. Now you may go. The closer connection in sentiment between the first two propositions has led to a conventional visual representation that must not deceive the analytic spirit.

Footnote 8: Except, possibly, in a newspaper headline. Such headlines, however, are language only in a derived sense.

Footnote 9: E.g., the brilliant Dutch writer, Jac van Ginneken.

Footnote 10: Observe the “voluntary.” When we shout or grunt or otherwise allow our voices to take care of themselves, as we are likely to do when alone in the country on a fine spring day, we are no longer fixing vocal adjustments by voluntary control. Under these circumstances we are almost certain to hit on speech sounds that we could never learn to control in actual speech.

Footnote 11: If speech, in its acoustic and articulatory aspect, is indeed a rigid system, how comes it, one may plausibly object, that no two people speak alike? The answer is simple. All that part of speech which falls out of the rigid articulatory framework is not speech in idea, but is merely a superadded, more or less instinctively determined vocal complication inseparable from speech in practice. All the individual color of speech—personal emphasis, speed, personal cadence, personal pitch—is a non-linguistic fact, just as the incidental expression of desire and emotion are, for the most part, alien to linguistic expression. Speech, like all elements of culture, demands conceptual selection, inhibition of the randomness of instinctive behavior. That its “idea” is never realized as such in practice, its carriers being instinctively animated organisms, is of course true of each and every aspect of culture.

Footnote 12: Purely acoustic classifications, such as more easily suggest themselves to a first attempt at analysis, are now in less favor among students of phonetics than organic classifications. The latter have the advantage of being more objective. Moreover, the acoustic quality of a sound is dependent on the articulation, even though in linguistic consciousness this quality is the primary, not the secondary, fact.

Footnote 13: By “quality” is here meant the inherent nature and resonance of the sound as such. The general “quality” of the individual’s voice is another matter altogether. This is chiefly determined by the individual anatomical characteristics of the larynx and is of no linguistic interest whatever.

Footnote 14: As at the end of the snappily pronounced no! (sometimes written nope!) or in the over-carefully pronounced at all, where one may hear a slight check between the t and the a.

Footnote 15: “Singing” is here used in a wide sense. One cannot sing continuously on such a sound as b or d, but one may easily outline a tune on a series of b’s or d’s in the manner of the plucked “pizzicato” on stringed instruments. A series of tones executed on continuant consonants, like m, z, or l, gives the effect of humming, droning, or buzzing. The sound of “humming,” indeed, is nothing but a continuous voiced nasal, held on one pitch or varying in pitch, as desired.

Footnote 16: The whisper of ordinary speech is a combination of unvoiced sounds and “whispered” sounds, as the term is understood in phonetics.

Footnote 17: Aside from the involuntary nasalizing of all voiced sounds in the speech of those that talk with a “nasal twang.”

Footnote 18: These may be also defined as free unvoiced breath with varying vocalic timbres. In the long Paiute word quoted on page 31 the first u and the final ü are pronounced without voice.

Footnote 19: Nasalized stops, say m or n, can naturally not be truly “stopped,” as there is no way of checking the stream of breath in the nose by a definite articulation.

Footnote 20: The lips also may theoretically so articulate. “Labial trills,” however, are certainly rare in natural speech.

Footnote 21: This position, known as “faucal,” is not common.

Footnote 22: “Points of articulation” must be understood to include tongue and lip positions of the vowels.

Footnote 23: Including, under the fourth category, a number of special resonance adjustments that we have not been able to take up specifically.

Footnote 24: In so far, it should be added, as these sounds are expiratory, i.e., pronounced with the outgoing breath. Certain languages, like the South African Hottentot and Bushman, have also a number of inspiratory sounds, pronounced by sucking in the breath at various points of oral contact. These are the so-called “clicks.”

Footnote 25: The conception of the ideal phonetic system, the phonetic pattern, of a language is not as well understood by linguistic students as it should be. In this respect the unschooled recorder of language, provided he has a good ear and a genuine instinct for language, is often at a great advantage as compared with the minute phonetician, who is apt to be swamped by his mass of observations. I have already employed my experience in teaching Indians to write their own language for its testing value in another connection. It yields equally valuable evidence here. I found that it was difficult or impossible to teach an Indian to make phonetic distinctions that did not correspond to “points in the pattern of his language,” however these differences might strike our objective ear, but that subtle, barely audible, phonetic differences, if only they hit the “points in the pattern,” were easily and voluntarily expressed in writing. In watching my Nootka interpreter write his language, I often had the curious feeling that he was transcribing an ideal flow of phonetic elements which he heard, inadequately from a purely objective standpoint, as the intention of the actual rumble of speech.

Footnote 26: For the symbolism, see chapter II.

Footnote 27: “Plural” is here a symbol for any prefix indicating plurality.

Footnote 28: The language of the Aztecs, still spoken in large parts of Mexico.

Footnote 29: Indian language of British Columbia closely related to the Nass already cited.

Footnote 30: Including such languages as Navaho, Apache, Hupa, Carrier, Chipewyan, Loucheux.

Footnote 31: This may seem surprising to an English reader. We generally think of time as a function that is appropriately expressed in a purely formal manner. This notion is due to the bias that Latin grammar has given us. As a matter of fact the English future (I shall go) is not expressed by affixing at all; moreover, it may be expressed by the present, as in to-morrow I leave this place, where the temporal function is inherent in the independent adverb. Though in lesser degree, the Hupa -te is as irrelevant to the vital word as is to-morrow to the grammatical “feel” of I leave.

Footnote 32: Wishram dialect.

Footnote 33: Really “him,” but Chinook, like Latin or French, possesses grammatical gender. An object may be referred to as “he,” “she,” or “it,” according to the characteristic form of its noun.

Footnote 34: This analysis is doubtful. It is likely that -n- possesses a function that still remains to be ascertained. The Algonkin languages are unusually complex and present many unsolved problems of detail.

Footnote 35: “Secondary stems” are elements which are suffixes from a formal point of view, never appearing without the support of a true radical element, but whose function is as concrete, to all intents and purposes, as that of the radical element itself. Secondary verb stems of this type are characteristic of the Algonkin languages and of Yana.

Footnote 36: In the Algonkin languages all persons and things are conceived of as either animate or inanimate, just as in Latin or German they are conceived of as masculine, feminine, or neuter.

Footnote 37: Egyptian dialect.

Footnote 38: There are changes of accent and vocalic quantity in these forms as well, but the requirements of simplicity force us to neglect them.

Footnote 39: A Berber language of Morocco.

Footnote 40: Some of the Berber languages allow consonantal combinations that seem unpronounceable to us.

Footnote 41: One of the Hamitic languages of eastern Africa.

Footnote 42: See page 49.

Footnote 43: Spoken in the south-central part of California.

Footnote 44: See page 50.

Footnote 45: These orthographies are but makeshifts for simple sounds.

Footnote 46: Whence our ping-pong.

Footnote 47: An African language of the Guinea Coast.

Footnote 48: In the verbal adjective the tone of the second syllable differs from that of the first.

Footnote 49: Initial “click” (see page 55, note 15) omitted. Transcriber's Note: This footnote has been updated to Footnote 24.

Footnote 50: An Indian language of Nevada.

Footnote 51: An Indian language of Oregon.

Footnote 52: It is not unlikely, however, that these Athabaskan alternations are primarily tonal in character.

Footnote 53: Not in its technical sense.

Footnote 54: It is, of course, an “accident” that -s denotes plurality in the noun, singularity in the verb.

Footnote 55: “To cause to be dead” or “to cause to die” in the sense of “to kill” is an exceedingly wide-spread usage. It is found, for instance, also in Nootka and Sioux.

Footnote 56: Agriculture was not practised by the Yana. The verbal idea of “to farm” would probably be expressed in some such synthetic manner as “to dig-earth” or “to grow-cause.” There are suffixed elements corresponding to -er and -ling.

Footnote 57: “Doer,” not “done to.” This is a necessarily clumsy tag to represent the “nominative” (subjective) in contrast to the “accusative” (objective).

Footnote 58: I.e., not you or I.

Footnote 59: By “case” is here meant not only the subjective-objective relation but also that of attribution.

Footnote 60: Except in so far as Latin uses this method as a rather awkward, roundabout method of establishing the attribution of the color to the particular object or person. In effect one cannot in Latin directly say that a person is white, merely that what is white is identical with the person who is, acts, or is acted upon in such and such a manner. In origin the feel of the Latin illa alba femina is really “that-one, the-white-one, (namely) the-woman”—three substantive ideas that are related to each other by a juxtaposition intended to convey an identity. English and Chinese express the attribution directly by means of order. In Latin the illa and alba may occupy almost any position in the sentence. It is important to observe that the subjective form of illa and alba, does not truly define a relation of these qualifying concepts to femina. Such a relation might be formally expressed via an attributive case, say the genitive (woman of whiteness). In Tibetan both the methods of order and of true case relation may be employed: woman white (i.e., “white woman”) or white-of woman (i.e., “woman of whiteness, woman who is white, white woman”).

Footnote 61: Aside, naturally, from the life and imminence that may be created for such a sentence by a particular context.

Footnote 62: This has largely happened in popular French and German, where the difference is stylistic rather than functional. The preterits are more literary or formal in tone than the perfects.

Footnote 63: Hence, “the square root of 4 is 2,” precisely as “my uncle is here now.” There are many “primitive” languages that are more philosophical and distinguish between a true “present” and a “customary” or “general” tense.

Footnote 64: Except, of course, the fundamental selection and contrast necessarily implied in defining one concept as against another. “Man” and “white” possess an inherent relation to “woman” and “black,” but it is a relation of conceptual content only and is of no direct interest to grammar.

Footnote 65: Thus, the -er of farmer may he defined as indicating that particular substantive concept (object or thing) that serves as the habitual subject of the particular verb to which it is affixed. This relation of “subject” (a farmer farms) is inherent in and specific to the word; it does not exist for the sentence as a whole. In the same way the -ling of duckling defines a specific relation of attribution that concerns only the radical element, not the sentence.

Footnote 66: It is precisely the failure to feel the “value” or “tone,” as distinct from the outer significance, of the concept expressed by a given grammatical element that has so often led students to misunderstand the nature of languages profoundly alien to their own. Not everything that calls itself “tense” or “mode” or “number” or “gender” or “person” is genuinely comparable to what we mean by these terms in Latin or French.

Footnote 67: Suffixed articles occur also in Danish and Swedish and in numerous other languages. The Nootka element for “in the house” differs from our “house-” in that it is suffixed and cannot occur as an independent word; nor is it related to the Nootka word for “house.”

Footnote 68: Assuming the existence of a word “firelet.”

Footnote 69: The Nootka diminutive is doubtless more of a feeling-element, an element of nuance, than our -ling. This is shown by the fact that it may be used with verbs as well as with nouns. In speaking to a child, one is likely to add the diminutive to any word in the sentence, regardless of whether there is an inherent diminutive meaning in the word or not.

Footnote 70: -si is the third person of the present tense. -hau- “east” is an affix, not a compounded radical element.

Footnote 71: These are classical, not modern colloquial, forms.

Footnote 72: Just as in English “He has written books” makes no commitment on the score of quantity (“a few, several, many”).

Footnote 73: Such as person class, animal class, instrument class, augmentative class.

Footnote 74: A term borrowed from Slavic grammar. It indicates the lapse of action, its nature from the standpoint of continuity. Our “cry” is indefinite as to aspect, “be crying” is durative, “cry put” is momentaneous, “burst into tears” is inceptive, “keep crying” is continuative, “start in crying” is durative-inceptive, “cry now and again” is iterative, “cry out every now and then” or “cry in fits and starts” is momentaneous-iterative. “To put on a coat” is momentaneous, “to wear a coat” is resultative. As our examples show, aspect is expressed in English by all kinds of idiomatic turns rather than by a consistently worked out set of grammatical forms. In many languages aspect is of far greater formal significance than tense, with which the naïve student is apt to confuse it.

Footnote 75: By “modalities” I do not mean the matter of fact statement, say, of negation or uncertainty as such, rather their implication in terms of form. There are languages, for instance, which have as elaborate an apparatus of negative forms for the verb as Greek has of the optative or wish-modality.

Footnote 76: Compare page 97.

Footnote 77: It is because of this classification of experience that in many languages the verb forms which are proper, say, to a mythical narration differ from those commonly used in daily intercourse. We leave these shades to the context or content ourselves with a more explicit and roundabout mode of expression, e.g., “He is dead, as I happen to know,” “They say he is dead,” “He must be dead by the looks of things.”

Footnote 78: We say “I sleep” and “I go,” as well as “I kill him,” but “he kills me.” Yet me of the last example is at least as close psychologically to I of “I sleep” as is the latter to I of “I kill him.” It is only by form that we can classify the “I” notion of “I sleep” as that of an acting subject. Properly speaking, I am handled by forces beyond my control when I sleep just as truly as when some one is killing me. Numerous languages differentiate clearly between active subject and static subject (I go and I kill him as distinct from I sleep, I am good, I am killed) or between transitive subject and intransitive subject (I kill him as distinct from I sleep, I am good, I am killed, I go). The intransitive or static subjects may or may not be identical with the object of the transitive verb.

Footnote 79: Ultimately, also historical—say, age to “act that (one).”

Footnote 80: For with in the sense of “against,” compare German wider “against.”

Footnote 81: Cf. Latin ire “to go”; also our English idiom “I have to go,” i.e., “must go.”

Footnote 82: In Chinese no less than in English.

Footnote 83: By “originally” I mean, of course, some time antedating the earliest period of the Indo-European languages that we can get at by comparative evidence.

Footnote 84: Perhaps it was a noun-classifying element of some sort.

Footnote 85: Compare its close historical parallel off.

Footnote 86: “Ablative” at last analysis.

Footnote 87: Very likely pitch should be understood along with stress.

Footnote 88: As in Bantu or Chinook.

Footnote 89: Perhaps better “general.” The Chinook “neuter” may refer to persons as well as things and may also be used as a plural. “Masculine” and “feminine,” as in German and French, include a great number of inanimate nouns.

Footnote 90: Spoken in the greater part of the southern half of Africa. Chinook is spoken in a number of dialects in the lower Columbia River valley. It is impressive to observe how the human mind has arrived at the same form of expression in two such historically unconnected regions.

Footnote 91: In Yana the noun and the verb are well distinct, though there are certain features that they hold in common which tend to draw them nearer to each other than we feel to be possible. But there are, strictly speaking, no other parts of speech. The adjective is a verb. So are the numeral, the interrogative pronoun (e.g., “to be what?”), and certain “conjunctions” and adverbs (e.g., “to be and” and “to be not”; one says “and-past-I go,” i.e., “and I went”). Adverbs and prepositions are either nouns or merely derivative affixes in the verb.

Footnote 92: If possible, a triune formula.

Footnote 93: One celebrated American writer on culture and language delivered himself of the dictum that, estimable as the speakers of agglutinative languages might be, it was nevertheless a crime for an inflecting woman to marry an agglutinating man. Tremendous spiritual values were evidently at stake. Champions of the “inflective” languages are wont to glory in the very irrationalities of Latin and Greek, except when it suits them to emphasize their profoundly “logical” character. Yet the sober logic of Turkish or Chinese leaves them cold. The glorious irrationalities and formal complexities of many “savage” languages they have no stomach for. Sentimentalists are difficult people.

Footnote 94: I have in mind valuations of form as such. Whether or not a language has a large and useful vocabulary is another matter. The actual size of a vocabulary at a given time is not a thing of real interest to the linguist, as all languages have the resources at their disposal for the creation of new words, should need for them arise. Furthermore, we are not in the least concerned with whether or not a language is of great practical value or is the medium of a great culture. All these considerations, important from other standpoints, have nothing to do with form value.

Footnote 95: E.g., Malay, Polynesian.

Footnote 96: Where, as we have seen, the syntactic relations are by no means free from an alloy of the concrete.

Footnote 97: Very much as an English cod-liver oil dodges to some extent the task of explicitly defining the relations of the three nouns. Contrast French huile de foie de morue “oil of liver of cod.”

Footnote 98: See Chapter IV.

Footnote 99: There is probably a real psychological connection between symbolism and such significant alternations as drink, drank, drunk or Chinese mai (with rising tone) “to buy” and mai (with falling tone) “to sell.” The unconscious tendency toward symbolism is justly emphasized by recent psychological literature. Personally I feel that the passage from sing to sang has very much the same feeling as the alternation of symbolic colors—e.g., green for safe, red for danger. But we probably differ greatly as to the intensity with which we feel symbolism in linguistic changes of this type.

Footnote 100: Pure or “concrete relational.” See Chapter V.

Footnote 101: In spite of my reluctance to emphasize the difference between a prefixing and a suffixing language, I feel that there is more involved in this difference than linguists have generally recognized. It seems to me that there is a rather important psychological distinction between a language that settles the formal status of a radical element before announcing it—and this, in effect, is what such languages as Tlingit and Chinook and Bantu are in the habit of doing—and one that begins with the concrete nucleus of a word and defines the status of this nucleus by successive limitations, each curtailing in some degree the generality of all that precedes. The spirit of the former method has something diagrammatic or architectural about it, the latter is a method of pruning afterthoughts. In the more highly wrought prefixing languages the word is apt to affect us as a crystallization of floating elements, the words of the typical suffixing languages (Turkish, Eskimo, Nootka) are “determinative” formations, each added element determining the form of the whole anew. It is so difficult in practice to apply these elusive, yet important, distinctions that an elementary study has no recourse but to ignore them.

Footnote 102: English, however, is only analytic in tendency. Relatively to French, it is still fairly synthetic, at least in certain aspects.

Footnote 103: The former process is demonstrable for English, French, Danish, Tibetan, Chinese, and a host of other languages. The latter tendency may be proven, I believe, for a number of American Indian languages, e.g., Chinook, Navaho. Underneath their present moderately polysynthetic form is discernible an analytic base that in the one case may be roughly described as English-like, in the other, Tibetan-like.

Footnote 104: This applies more particularly to the Romance group: Italian, Spanish, Portuguese, French, Roumanian. Modern Greek is not so clearly analytic.

Footnote 105: See pages 133, 134.

Footnote 106: The following formulae may prove useful to those that are mathematically inclined. Agglutination: c = a + b; regular fusion: c = a + (b - x) + x; irregular fusion: c = (a - x) + (b - y) + (x + y); symbolism: c = (a - x) + x. I do not wish to imply that there is any mystic value in the process of fusion. It is quite likely to have developed as a purely mechanical product of phonetic forces that brought about irregularities of various sorts.

Footnote 107: See page 110.

Footnote 108: See Chapter V.

Footnote 109: If we deny the application of the term “inflective” to fusing languages that express the syntactic relations in pure form, that is, without the admixture of such concepts as number, gender, and tense, merely because such admixture is familiar to us in Latin and Greek, we make of “inflection” an even more arbitrary concept than it need be. At the same time it is true that the method of fusion itself tends to break down the wall between our conceptual groups II and IV, to create group III. Yet the possibility of such “inflective” languages should not be denied. In modern Tibetan, for instance, in which concepts of group II are but weakly expressed, if at all, and in which the relational concepts (e.g., the genitive, the agentive or instrumental) are expressed without alloy of the material, we get many interesting examples of fusion, even of symbolism. Mi di, e.g., “man this, the man” is an absolutive form which may be used as the subject of an intransitive verb. When the verb is transitive (really passive), the (logical) subject has to take the agentive form. Mi di then becomes mi di “by the man,” the vowel of the demonstrative pronoun (or article) being merely lengthened. (There is probably also a change in the tone of the syllable.) This, of course, is of the very essence of inflection. It is an amusing commentary on the insufficiency of our current linguistic classification, which considers “inflective” and “isolating” as worlds asunder, that modern Tibetan may be not inaptly described as an isolating language, aside from such examples of fusion and symbolism as the foregoing.

Footnote 110: I am eliminating entirely the possibility of compounding two or more radical elements into single words or word-like phrases (see pages 67-70). To expressly consider compounding in the present survey of types would be to complicate our problem unduly. Most languages that possess no derivational affixes of any sort may nevertheless freely compound radical elements (independent words). Such compounds often have a fixity that simulates the unity of single words.

Footnote 111: We may assume that in these languages and in those of type D all or most of the relational concepts are expressed in “mixed” form, that such a concept as that of subjectivity, for instance, cannot be expressed without simultaneously involving number or gender or that an active verb form must be possessed of a definite tense. Hence group III will be understood to include, or rather absorb, group IV. Theoretically, of course, certain relational concepts may be expressed pure, others mixed, but in practice it will not be found easy to make the distinction.

Footnote 112: The line between types C and D cannot be very sharply drawn. It is a matter largely of degree. A language of markedly mixed-relational type, but of little power of derivation pure and simple, such as Bantu or French, may be conveniently put into type C, even though it is not devoid of a number of derivational affixes. Roughly speaking, languages of type C may be considered as highly analytic (“purified”) forms of type D.

Footnote 113: In defining the type to which a language belongs one must be careful not to be misled by structural features which are mere survivals of an older stage, which have no productive life and do not enter into the unconscious patterning of the language. All languages are littered with such petrified bodies. The English -ster of spinster and Webster is an old agentive suffix, but, as far as the feeling of the present English-speaking generation is concerned, it cannot be said to really exist at all; spinster and Webster have been completely disconnected from the etymological group of spin and of weave (web). Similarly, there are hosts of related words in Chinese which differ in the initial consonant, the vowel, the tone, or in the presence or absence of a final consonant. Even where the Chinaman feels the etymological relationship, as in certain cases he can hardly help doing, he can assign no particular function to the phonetic variation as such. Hence it forms no live feature of the language-mechanism and must be ignored in defining the general form of the language. The caution is all the more necessary, as it is precisely the foreigner, who approaches a new language with a certain prying inquisitiveness, that is most apt to see life in vestigial features which the native is either completely unaware of or feels merely as dead form.

Footnote 114: Might nearly as well have come under D.

Footnote 115: Very nearly complex pure-relational.

Footnote 116: Not Greek specifically, of course, but as a typical representative of Indo-European.

Footnote 117: Such, in other words, as can be shown by documentary or comparative evidence to have been derived from a common source. See Chapter VII.

Footnote 118: These are far-eastern and far-western representatives of the “Soudan” group recently proposed by D. Westermann. The genetic relationship between Ewe and Shilluk is exceedingly remote at best.

Footnote 119: This case is doubtful at that. I have put French in C rather than in D with considerable misgivings. Everything depends on how one evaluates elements like -al in national, -té in bonté, or re- in retourner. They are common enough, but are they as alive, as little petrified or bookish, as our English -ness and -ful and un-?

Footnote 120: In spite of its more isolating cast.

Footnote 121: In a book of this sort it is naturally impossible to give an adequate idea of linguistic structure in its varying forms. Only a few schematic indications are possible. A separate volume would be needed to breathe life into the scheme. Such a volume would point out the salient structural characteristics of a number of languages, so selected as to give the reader an insight into the formal economy of strikingly divergent types.

Footnote 122: In so far as they do not fall out of the normal speech group by reason of a marked speech defect or because they are isolated foreigners that have acquired the language late in life.

Footnote 123: Observe that we are speaking of an individual’s speech as a whole. It is not a question of isolating some particular peculiarity of pronunciation or usage and noting its resemblance to or identity with a feature in another dialect.

Footnote 124: It is doubtful if we have the right to speak of linguistic uniformity even during the predominance of the Koine. It is hardly conceivable that when the various groups of non-Attic Greeks took on the Koine they did not at once tinge it with dialectic peculiarities induced by their previous speech habits.

Footnote 125: The Zaconic dialect of Lacedaemon is the sole exception. It is not derived from the Koine, but stems directly from the Doric dialect of Sparta.

Footnote 126: Though indications are not lacking of what these remoter kin of the Indo-European languages may be. This is disputed ground, however, and hardly fit subject for a purely general study of speech.

Footnote 127: “Dialect” in contrast to an accepted literary norm is a use of the term that we are not considering.

Footnote 128: Spoken in France and Spain in the region of the Pyrenees.

Footnote 129: Or rather apprehended, for we do not, in sober fact, entirely understand it as yet.

Footnote 130: Not ultimately random, of course, only relatively so.

Footnote 131: In relative clauses too we tend to avoid the objective form of “who.” Instead of “The man whom I saw” we are likely to say “The man that I saw” or “The man I saw.”

Footnote 132: “Its” was at one time as impertinent a departure as the “who” of “Who did you see?” It forced itself into English because the old cleavage between masculine, feminine, and neuter was being slowly and powerfully supplemented by a new one between thing-class and animate-class. The latter classification proved too vital to allow usage to couple males and things (“his”) as against females (“her”). The form “its” had to be created on the analogy of words like “man’s,” to satisfy the growing form feeling. The drift was strong enough to sanction a grammatical blunder.

Footnote 133: Psychoanalysts will recognize the mechanism. The mechanisms of “repression of impulse” and of its symptomatic symbolization can be illustrated in the most unexpected corners of individual and group psychology. A more general psychology than Freud’s will eventually prove them to be as applicable to the groping for abstract form, the logical or esthetic ordering of experience, as to the life of the fundamental instincts.

Footnote 134: Note that it is different with whose. This has not the support of analogous possessive forms in its own functional group, but the analogical power of the great body of possessives of nouns (man’s, boy’s) as well as of certain personal pronouns (his, its; as predicated possessive also hers, yours, theirs) is sufficient to give it vitality.

Footnote 135: Aside from certain idiomatic usages, as when You saw whom? is equivalent to You saw so and so and that so and so is who? In such sentences whom is pronounced high and lingeringly to emphasize the fact that the person just referred to by the listener is not known or recognized.

Footnote 136: Students of language cannot be entirely normal in their attitude towards their own speech. Perhaps it would be better to say “naïve” than “normal.”

Footnote 137: It is probably this variability of value in the significant compounds of a general linguistic drift that is responsible for the rise of dialectic variations. Each dialect continues the general drift of the common parent, but has not been able to hold fast to constant values for each component of the drift. Deviations as to the drift itself, at first slight, later cumulative, are therefore unavoidable.

Footnote 138: Most sentences beginning with interrogative whom are likely to be followed by did or does, do. Yet not all.

Footnote 139: Better, indeed, than in our oldest Latin and Greek records. The old Indo-Iranian languages alone (Sanskrit, Avestan) show an equally or more archaic status of the Indo-European parent tongue as regards case forms.

Footnote 140: Should its eventually drop out, it will have had a curious history. It will have played the rôle of a stop-gap between his in its non-personal use (see footnote 11, page 167) and the later analytic of it. Transcriber's Note: This footnote has been renumbered to Footnote 132.

Footnote 141: Except in so far as that has absorbed other functions than such as originally belonged to it. It was only a nominative-accusative neuter to begin with.

Footnote 142: Aside from the interrogative: am I? is he? Emphasis counts for something. There is a strong tendency for the old “objective” forms to bear a stronger stress than the “subjective” forms. This is why the stress in locutions like He didn’t go, did he? and isn’t he? is thrown back on the verb; it is not a matter of logical emphasis.

Footnote 143: They: them as an inanimate group may be looked upon as a kind of borrowing from the animate, to which, in feeling, it more properly belongs.

Footnote 144: See page 155.

Footnote 145: I have changed the Old and Middle High German orthography slightly in order to bring it into accord with modern usage. These purely orthographical changes are immaterial. The u of mus is a long vowel, very nearly like the oo of English moose.

Footnote 146: The vowels of these four words are long; o as in rode, e like a of fade, u like oo of brood, y like German ü.

Footnote 147: Or rather stage in a drift.

Footnote 148: Anglo-Saxon fet is “unrounded” from an older föt, which is phonetically related to fot precisely as is mys (i.e., müs) to mus. Middle High German ue (Modern German u) did not develop from an “umlauted” prototype of Old High German uo and Anglo-Saxon o, but was based directly on the dialectic uo. The unaffected prototype was long o. Had this been affected in the earliest Germanic or West-Germanic period, we should have had a pre-German alternation fot: föti; this older ö could not well have resulted in ue. Fortunately we do not need inferential evidence in this case, yet inferential comparative methods, if handled with care, may be exceedingly useful. They are indeed indispensable to the historian of language.

Footnote 149: See page 133.

Footnote 150: Primitive Germanic fot(s), fotiz, mus, musiz; Indo-European pods, podes, mus, muses. The vowels of the first syllables are all long.

Footnote 151: Or in that unconscious sound patterning which is ever on the point of becoming conscious. See page 57.

Footnote 152: As have most Dutch and German dialects.

Footnote 153: At least in America.

Footnote 154: It is possible that other than purely phonetic factors are also at work in the history of these vowels.

Footnote 155: The orthography is roughly phonetic. Pronounce all accented vowels long except where otherwise indicated, unaccented vowels short; give continental values to vowels, not present English ones.

Footnote 156: After I. the numbers are not meant to correspond chronologically to those of the English table. The orthography is again roughly phonetic.

Footnote 157: I use ss to indicate a peculiar long, voiceless s-sound that was etymologically and phonetically distinct from the old Germanic s. It always goes back to an old t. In the old sources it is generally written as a variant of z, though it is not to be confused with the modern German z (= ts). It was probably a dental (lisped) s.

Footnote 158: Z is to be understood as French or English z, not in its German use. Strictly speaking, this “z” (intervocalic -s-) was not voiced but was a soft voiceless sound, a sibilant intermediate between our s and z. In modern North German it has become voiced to z. It is important not to confound this s—z with the voiceless intervocalic s that soon arose from the older lisped ss. In Modern German (aside from certain dialects), old s and ss are not now differentiated when final (Maus and Fuss have identical sibilants), but can still be distinguished as voiced and voiceless s between vowels (Mäuse and Füsse).

Footnote 159: In practice phonetic laws have their exceptions, but more intensive study almost invariably shows that these exceptions are more apparent than real. They are generally due to the disturbing influence of morphological groupings or to special psychological reasons which inhibit the normal progress of the phonetic drift. It is remarkable with how few exceptions one need operate in linguistic history, aside from “analogical leveling” (morphological replacement).

Footnote 160: These confusions are more theoretical than real, however. A language has countless methods of avoiding practical ambiguities.

Footnote 161: A type of adjustment generally referred to as “analogical leveling.”

Footnote 162: Isolated from other German dialects in the late fifteenth and early sixteenth centuries. It is therefore a good test for gauging the strength of the tendency to “umlaut,” particularly as it has developed a strong drift towards analytic methods.

Footnote 163: Ch as in German Buch.

Footnote 164: The earlier students of English, however, grossly exaggerated the general “disintegrating” effect of French on middle English. English was moving fast toward a more analytic structure long before the French influence set in.

Footnote 165: For we still name our new scientific instruments and patent medicines from Greek and Latin.

Footnote 166: One might all but say, “has borrowed at all.”

Footnote 167: See page 206.

Footnote 168: Ugro-Finnic and Turkish (Tartar)

Footnote 169: Probably, in Sweet’s terminology, high-back (or, better, between back and “mixed” positions)-narrow-unrounded. It generally corresponds to an Indo-European long u.

Footnote 170: There seem to be analogous or partly analogous sounds in certain languages of the Caucasus.

Footnote 171: This can actually be demonstrated for one of the Athabaskan dialects of the Yukon.

Footnote 172: In the sphere of syntax one may point to certain French and Latin influences, but it is doubtful if they ever reached deeper than the written language. Much of this type of influence belongs rather to literary style than to morphology proper.

Footnote 173: See page 163.

Footnote 174: A group of languages spoken in southeastern Asia, of which Khmer (Cambodgian) is the best known representative.

Footnote 175: A group of languages spoken in northeastern India.

Footnote 176: I have in mind, e.g., the presence of postpositions in Upper Chinook, a feature that is clearly due to the influence of neighboring Sahaptin languages; or the use by Takelma of instrumental prefixes, which are likely to have been suggested by neighboring “Hokan” languages (Shasta, Karok).

Footnote 177: Itself an amalgam of North “French” and Scandinavian elements.

Footnote 178: The “Celtic” blood of what is now England and Wales is by no means confined to the Celtic-speaking regions—Wales and, until recently, Cornwall. There is every reason to believe that the invading Germanic tribes (Angles, Saxons, Jutes) did not exterminate the Brythonic Celts of England nor yet drive them altogether into Wales and Cornwall (there has been far too much “driving” of conquered peoples into mountain fastnesses and land’s ends in our histories), but simply intermingled with them and imposed their rule and language upon them.

Footnote 179: In practice these three peoples can hardly be kept altogether distinct. The terms have rather a local-sentimental than a clearly racial value. Intermarriage has gone on steadily for centuries and it is only in certain outlying regions that we get relatively pure types, e.g., the Highland Scotch of the Hebrides. In America, English, Scotch, and Irish strands have become inextricably interwoven.

Footnote 180: The High German now spoken in northern Germany is not of great age, but is due to the spread of standardized German, based on Upper Saxon, a High German dialect, at the expense of “Plattdeutsch.”

Footnote 181: “Dolichocephalic.”

Footnote 182: “Brachycephalic.”

Footnote 183: By working back from such data as we possess we can make it probable that these languages were originally confined to a comparatively small area in northern Germany and Scandinavia. This area is clearly marginal to the total area of distribution of the Indo-European-speaking peoples. Their center of gravity, say 1000 B.C., seems to have lain in southern Russia.

Footnote 184: While this is only a theory, the technical evidence for it is stronger than one might suppose. There are a surprising number of common and characteristic Germanic words which cannot be connected with known Indo-European radical elements and which may well be survivals of the hypothetical pre-Germanic language; such are house, stone, sea, wife (German Haus, Stein, See, Weib).

Footnote 185: Only the easternmost part of this island is occupied by Melanesian-speaking Papuans.

Footnote 186: A “nationality” is a major, sentimentally unified, group. The historical factors that lead to the feeling of national unity are various—political, cultural, linguistic, geographic, sometimes specifically religious. True racial factors also may enter in, though the accent on “race” has generally a psychological rather than a strictly biological value. In an area dominated by the national sentiment there is a tendency for language and culture to become uniform and specific, so that linguistic and cultural boundaries at least tend to coincide. Even at best, however, the linguistic unification is never absolute, while the cultural unity is apt to be superficial, of a quasi-political nature, rather than deep and far-reaching.

Footnote 187: The Semitic languages, idiosyncratic as they are, are no more definitely ear-marked.

Footnote 188: See page 209.

Footnote 189: The Fijians, for instance, while of Papuan (negroid) race, are Polynesian rather than Melanesian in their cultural and linguistic affinities.

Footnote 190: Though even here there is some significant overlapping. The southernmost Eskimo of Alaska were assimilated in culture to their Tlingit neighbors. In northeastern Siberia, too, there is no sharp cultural line between the Eskimo and the Chukchi.

Footnote 191: The supersession of one language by another is of course not truly a matter of linguistic assimilation.

Footnote 192: “Temperament” is a difficult term to work with. A great deal of what is loosely charged to national “temperament” is really nothing but customary behavior, the effect of traditional ideals of conduct. In a culture, for instance, that does not look kindly upon demonstrativeness, the natural tendency to the display of emotion becomes more than normally inhibited. It would be quite misleading to argue from the customary inhibition, a cultural fact, to the native temperament. But ordinarily we can get at human conduct only as it is culturally modified. Temperament in the raw is a highly elusive thing.

Footnote 193: See pages 39, 40.

Footnote 194: I can hardly stop to define just what kind of expression is “significant” enough to be called art or literature. Besides, I do not exactly know. We shall have to take literature for granted.

Footnote 195: This “intuitive surrender” has nothing to do with subservience to artistic convention. More than one revolt in modern art has been dominated by the desire to get out of the material just what it is really capable of. The impressionist wants light and color because paint can give him just these; “literature” in painting, the sentimental suggestion of a “story,” is offensive to him because he does not want the virtue of his particular form to be dimmed by shadows from another medium. Similarly, the poet, as never before, insists that words mean just what they really mean.

Footnote 196: See Benedetto Croce, “Aesthetic.”

Footnote 197: The question of the transferability of art productions seems to me to be of genuine theoretic interest. For all that we speak of the sacrosanct uniqueness of a given art work, we know very well, though we do not always admit it, that not all productions are equally intractable to transference. A Chopin étude is inviolate; it moves altogether in the world of piano tone. A Bach fugue is transferable into another set of musical timbres without serious loss of esthetic significance. Chopin plays with the language of the piano as though no other language existed (the medium “disappears”); Bach speaks the language of the piano as a handy means of giving outward expression to a conception wrought in the generalized language of tone.

Footnote 198: Provided, of course, Chinese is careful to provide itself with the necessary scientific vocabulary. Like any other language, it can do so without serious difficulty if the need arises.

Footnote 199: Aside from individual peculiarities of diction, the selection and evaluation of particular words as such.

Footnote 200: Not by any means a great poem, merely a bit of occasional verse written by a young Chinese friend of mine when he left Shanghai for Canada.

Footnote 201: The old name of the country about the mouth of the Yangtsze.

Footnote 202: A province of Manchuria.

Footnote 203: I.e., China.

Footnote 204: Poetry everywhere is inseparable in its origins from the singing voice and the measure of the dance. Yet accentual and syllabic types of verse, rather than quantitative verse, seem to be the prevailing norms.

Footnote 205: Quantitative distinctions exist as an objective fact. They have not the same inner, psychological value that they had in Greek.

Footnote 206: Verhaeren was no slave to the Alexandrine, yet he remarked to Symons, à propos of the translation of Les Aubes, that while he approved of the use of rhymeless verse in the English version, he found it “meaningless” in French.

Download ePUB

If you like this ebook, consider a donation!

Contents