Speech production visualized by Real-time MRI

Linguistics
Part of a series on
Outline History Index
General linguistics Diachronic Lexicography Morphology Phonology Pragmatics Semantics Syntax Syntax–semantics interface Typology
Applied linguistics Acquisition Anthropological Applied Computational Conversation analysis Corpus linguistics Discourse analysis Distance Documentation Ethnography of communication Ethnomethodology Forensic History of linguistics Interlinguistics Neurolinguistics Philology Philosophy of language Phonetics Psycholinguistics Sociolinguistics Text Translating and interpreting Writing systems
Theoretical frameworks Formalist Constituency Dependency Distributionalism Generative Glossematics Functional Cognitive Construction grammar Functional discourse grammar Grammaticalization Interactional linguistics Prague circle Systemic functional Usage-based Structuralism
Topics Autonomy of syntax Compositionality Conservative and innovative language Descriptivism Etymology Iconicity Internationalism Internet linguistics LGBT linguistics Origin of language Orthography Philosophy of linguistics Prescriptivism Second-language acquisition Theory of language
Portal
v t e

Speech is the use of the human voice as a medium for language. Spoken language combines vowel and consonant sounds to form units of meaning like words, which belong to a language's lexicon. There are many different intentional speech acts, such as informing, declaring, asking, persuading, directing; acts may vary in various aspects like enunciation, intonation, loudness, and tempo to convey meaning. Individuals may also unintentionally communicate aspects of their social position through speech, such as sex, age, place of origin, physiological and mental condition, education, and experiences.

While normally used to facilitate communication with others, people may also use speech without the intent to communicate. Speech may nevertheless express emotions or desires; people talk to themselves sometimes in acts that are a development of what some psychologists (e.g., Lev Vygotsky) have maintained is the use of silent speech in an interior monologue to vivify and organize cognition, sometimes in the momentary adoption of a dual persona as self addressing self as though addressing another person. Solo speech can be used to memorize or to test one's memorization of things, and in prayer or in meditation.

Researchers study many different aspects of speech: speech production and speech perception of the sounds used in a language, speech repetition, speech errors, the ability to map heard spoken words onto the vocalizations needed to recreate them, which plays a key role in children's enlargement of their vocabulary, and what different areas of the human brain, such as Broca's area and Wernicke's area, underlie speech. Speech is the subject of study for linguistics, cognitive science, communication studies, psychology, computer science, speech pathology, otolaryngology, and acoustics. Speech compares with written language,^[1] which may differ in its vocabulary, syntax, and phonetics from the spoken language, a situation called diglossia.

The evolutionary origin of speech is subject to debate and speculation. While animals also communicate using vocalizations, and trained apes such as Washoe and Kanzi can use simple sign language, no animals' vocalizations are articulated phonemically and syntactically, and do not constitute speech.

Evolution

Main article: Origin of speech

Although related to the more general problem of the origin of language, the evolution of distinctively human speech capacities has become a distinct and in many ways separate area of scientific research.^[2]^[3]^[4]^[5]^[6] The topic is a separate one because language is not necessarily spoken: it can equally be written or signed. Speech is in this sense optional, although it is the default modality for language.

Monkeys, non-human apes and humans, like many other animals, have evolved specialised mechanisms for producing sound for purposes of social communication.^[7] On the other hand, no monkey or ape uses its tongue for such purposes.^[8]^[9] The human species' unprecedented use of the tongue, lips and other moveable parts seems to place speech in a quite separate category, making its evolutionary emergence an intriguing theoretical challenge in the eyes of many scholars.^[10]

Determining the timeline of human speech evolution is made additionally challenging by the lack of data in the fossil record. The human vocal tract does not fossilize, and indirect evidence of vocal tract changes in hominid fossils has proven inconclusive.^[10]

Production

Main articles: Speech production and Linguistics

Speech production is an unconscious multi-step process by which thoughts are generated into spoken utterances. Production involves the unconscious mind selecting appropriate words and the appropriate form of those words from the lexicon and morphology, and the organization of those words through the syntax. Then, the phonetic properties of the words are retrieved and the sentence is articulated through the articulations associated with those phonetic properties.^[11]

In linguistics, articulatory phonetics is the study of how the tongue, lips, jaw, vocal cords, and other speech organs are used to make sounds. Speech sounds are categorized by manner of articulation and place of articulation. Place of articulation refers to where in the neck or mouth the airstream is constricted. Manner of articulation refers to the manner in which the speech organs interact, such as how closely the air is restricted, what form of airstream is used (e.g. pulmonic, implosive, ejectives, and clicks), whether or not the vocal cords are vibrating, and whether the nasal cavity is opened to the airstream.^[12] The concept is primarily used for the production of consonants, but can be used for vowels in qualities such as voicing and nasalization. For any place of articulation, there may be several manners of articulation, and therefore several homorganic consonants.

Normal human speech is pulmonic, produced with pressure from the lungs, which creates phonation in the glottis in the larynx, which is then modified by the vocal tract and mouth into different vowels and consonants. However humans can pronounce words without the use of the lungs and glottis in alaryngeal speech, of which there are three types: esophageal speech, pharyngeal speech and buccal speech (better known as Donald Duck talk).

Errors

Main article: Speech error

Speech production is a complex activity, and as a consequence errors are common, especially in children. Speech errors come in many forms and are used to provide evidence to support hypotheses about the nature of speech.^[13] As a result, speech errors are often used in the construction of models for language production and child language acquisition. For example, the fact that children often make the error of over-regularizing the -ed past tense suffix in English (e.g. saying 'singed' instead of 'sang') shows that the regular forms are acquired earlier.^[14]^[15] Speech errors associated with certain kinds of aphasia have been used to map certain components of speech onto the brain and see the relation between different aspects of production; for example, the difficulty of expressive aphasia patients in producing regular past-tense verbs, but not irregulars like 'sing-sang' has been used to demonstrate that regular inflected forms of a word are not individually stored in the lexicon, but produced from affixation to the base form.^[16]

Perception

Main article: Speech perception

Speech perception refers to the processes by which humans can interpret and understand the sounds used in language. The study of speech perception is closely linked to the fields of phonetics and phonology in linguistics and cognitive psychology and perception in psychology. Research in speech perception seeks to understand how listeners recognize speech sounds and use this information to understand spoken language. Research into speech perception also has applications in building computer systems that can recognize speech, as well as improving speech recognition for hearing- and language-impaired listeners.^[17]

Speech perception is categorical, in that people put the sounds they hear into categories rather than perceiving them as a spectrum. People are more likely to be able to hear differences in sounds across categorical boundaries than within them. A good example of this is voice onset time (VOT), one aspect of the phonetic production of consonant sounds. For example, Hebrew speakers, who distinguish voiced /b/ from voiceless /p/, will more easily detect a change in VOT from -10 ( perceived as /b/ ) to 0 ( perceived as /p/ ) than a change in VOT from +10 to +20, or -10 to -20, despite this being an equally large change on the VOT spectrum.^[18]

Development

Main article: Language development

Most human children develop proto-speech babbling behaviors when they are four to six months old. Most will begin saying their first words at some point during the first year of life. Typical children progress through two or three word phrases before three years of age followed by short sentences by four years of age.^[19]

Repetition

Main article: Speech repetition

In speech repetition, speech being heard is quickly turned from sensory input into motor instructions needed for its immediate or delayed vocal imitation (in phonological memory). This type of mapping plays a key role in enabling children to expand their spoken vocabulary. Masur (1995) found that how often children repeat novel words versus those they already have in their lexicon is related to the size of their lexicon later on, with young children who repeat more novel words having a larger lexicon later in development. Speech repetition could help facilitate the acquisition of this larger lexicon.^[20]

Problems

Treatment

Main article: Speech–language pathology

Speech-related diseases, disorders, and conditions can be treated by a speech-language pathologist (SLP) or speech therapist. SLPs assess levels of speech needs, make diagnoses based on the assessments, and then treat the diagnoses or address the needs.^[29]

Brain physiology

Classical model

The classical or Wernicke-Geschwind model of the language system in the brain focuses on Broca's area in the inferior prefrontal cortex, and Wernicke's area in the posterior superior temporal gyrus on the dominant hemisphere of the brain (typically the left hemisphere for language). In this model, a linguistic auditory signal is first sent from the auditory cortex to Wernicke's area. The lexicon is accessed in Wernicke's area, and these words are sent via the arcuate fasciculus to Broca's area, where morphology, syntax, and instructions for articulation are generated. This is then sent from Broca's area to the motor cortex for articulation.^[30]

Paul Broca identified an approximate region of the brain in 1861 which, when damaged in two of his patients, caused severe deficits in speech production, where his patients were unable to speak beyond a few monosyllabic words. This deficit, known as Broca's or expressive aphasia, is characterized by difficulty in speech production where speech is slow and labored, function words are absent, and syntax is severely impaired, as in telegraphic speech. In expressive aphasia, speech comprehension is generally less affected except in the comprehension of grammatically complex sentences.^[31] Wernicke's area is named after Carl Wernicke, who in 1874 proposed a connection between damage to the posterior area of the left superior temporal gyrus and aphasia, as he noted that not all aphasic patients had had damage to the prefrontal cortex.^[32] Damage to Wernicke's area produces Wernicke's or receptive aphasia, which is characterized by relatively normal syntax and prosody but severe impairment in lexical access, resulting in poor comprehension and nonsensical or jargon speech.^[31]

Modern research

Modern models of the neurological systems behind linguistic comprehension and production recognize the importance of Broca's and Wernicke's areas, but are not limited to them nor solely to the left hemisphere.^[33] Instead, multiple streams are involved in speech production and comprehension. Damage to the left lateral sulcus has been connected with difficulty in processing and producing morphology and syntax, while lexical access and comprehension of irregular forms (e.g. eat-ate) remain unaffected.^[34] Moreover, the circuits involved in human speech comprehension dynamically adapt with learning, for example, by becoming more efficient in terms of processing time when listening to familiar messages such as learned verses.^[35]

Animal communication

Main article: Talking animals

Some non-human animals can produce sounds or gestures resembling those of a human language.^[36] Several species or groups of animals have developed forms of communication which superficially resemble verbal language, however, these usually are not considered a language because they lack one or more of the defining characteristics, e.g. grammar, syntax, recursion, and displacement. Researchers have been successful in teaching some animals to make gestures similar to sign language,^[37]^[38] although whether this should be considered a language has been disputed.^[39]

References

External links

More

Communication studies

Communication studies
History Outline
Topics and terminology	Biocommunication Broadcasting Communication Computer-mediated communication Conversation History of communication Information Intercultural Interpersonal Intrapersonal Journalism Mass media Meaning Media Media ecology Meta-communication Models of communication New media Nonverbal communication Nonviolent communication Propaganda Reading Speech Symbol list Telecommunication Text and conversation theory Writing
Subfields	Closed-loop Communication design Communication theory Communicology Crisis Climate Cross-cultural Developmental Discourse analysis Environmental Global Health International Mass Media studies Mediated cross-border Organizational Political Risk Science Technical Visual
Scholars	Adorno Barthes Bateson Benjamin Burke Castells Chomsky Craig Ellul Fisher Flusser Gasset Gerbner Goffman Habermas Horkheimer Huxley Innis Jakobson Janis Johnson Kincaid Lippman Luhmann Marcuse McLuhan Mead Morgan Ong Packard Peirce Postman Quebral Richards Rogers Schramm Shannon Tankard Tannen Wertheimer
Category

Authority control databases: National

Nonverbal communication

Modalities

Physical	Blushing Body language / Kinesics Body-to-body communication Facial expression Facial Action Coding System Microexpression Subtle expression Gesture List Speech-independent gestures Haptic communication Imitation Interpersonal synchrony Laughter Oculesics Eye contact Pupil dilation Olfaction Posture Proxemics
Speech	Affect Emotional prosody Paralanguage Intonation Loudness Prosody Rhythm Stress Tone Voice quality
Social context	Chronemics Conventions Display rules Habitus High-context and low-context cultures Interpersonal relationship Social norm
Other	Emoticon / Smiley One-bit message Missed call Silent service code
Unconscious	Microexpression Non-verbal leakage
Multi-faceted	Affect display Deception Emotion recognition First impression Intimacy

Broader concepts
Basic interpersonal communicative skills Communication Emotional intelligence Nunchi People skills Semiotics Social behavior Social cue Social competence Social skills Unsaid

Further information

Disorders

Neuroanatomy

Applications

Technology

Key people

Animal communication Behavioral communication Aggressive Assertive Passive Passive-aggressive Impression management Meta-communication Monastic sign lexicons Verbal communication
Non-verbal language	Sign language Tactile signing Tadoma
Art and literature	Mime Mimoplastic art Subtext