Vocabulary

A vocabulary (also known as a lexicon) is a set of words, typically the set in a language or the set known to an individual. The word vocabulary originated from the Latin vocabulum, meaning "a word, name". It forms an essential component of language and communication, helping convey thoughts, ideas, emotions, and information. Vocabulary can be oral, written, or signed and can be categorized into two main types: active vocabulary (words one uses regularly) and passive vocabulary (words one recognizes but does not use often). An individual's vocabulary continually evolves through various methods, including direct instruction, independent reading, and natural language exposure, but it can also shrink due to forgetting, trauma, or disease. Furthermore, vocabulary is a significant focus of study across various disciplines, like linguistics, education, psychology, and artificial intelligence. Vocabulary is not limited to single words; it also encompasses multi-word units known as collocations, idioms, and other types of phraseology. Acquiring an adequate vocabulary is one of the largest challenges in learning a second language.

Terminology
A vocabulary is the set of words in a given language that an individual knows and uses. In the context of linguistics, a vocabulary may refer more broadly to any set of words. Types of vocabularies have been further defined: a lexis is a vocabulary comprising all words used in a language or other linguistic context or in a person's lexical repertoire. An individual person's vocabulary includes an passive vocabulary of words they can recognize or understand, as well as an active vocabulary of words they regularly use in speech and writing. In semiotics, vocabulary refers to the complete set of symbols and signs in a sign system or a text, extending the definition beyond purely verbal communication to encompass other forms of symbolic communication.

Vocabulary acquisition is a central aspect of language education, as it directly impacts reading comprehension, expressive and receptive language skills, and academic achievement. Vocabulary is examined in psychology as a measure of language processing and cognitive development. It can serve as an indicator of intellectual ability or cognitive status, with vocabulary tests often forming part of intelligence and neuropsychological assessments.

Definition of "word"
Word has a variety of meanings, and our understand of ideas such as vocabulary size differ depending on the definition used.

The most common definition equates words with lemmas (the inflected or dictionary form; this includes walk, but not walks, walked or walking). Most of the time lemmas do not include proper nouns (names of people, places, companies, etc.). Another definition often used in research of vocabulary size is that of word family. These are all the words that can be derived from a ground word (e.g., the words effortless, effortlessly, effortful, effortfully are all part of the word family effort). Estimates of vocabulary size range from as high as 200 thousand to as low as 10 thousand, depending on the definition used.

Productive and receptive knowledge
The first major change distinction that must be made when evaluating word knowledge is whether the knowledge is productive (also called achieve or active) or receptive (also called receive or passive); even within those opposing categories, there is often no clear distinction. Words that are generally understood when heard or read or seen constitute a person's receptive vocabulary. These words may range from well known to barely known (see degree of knowledge below). A person's receptive vocabulary is usually the larger of the two. For example, although a young child may not yet be able to speak, write, or sign, they may be able to follow simple commands and appear to understand a good portion of the language to which they are exposed. In this case, the child's receptive vocabulary is likely tens, if not hundreds of words, but their active vocabulary is zero. When that child learns to speak or sign, however, the child's active vocabulary begins to increase. It is also possible for the productive vocabulary to be larger than the receptive vocabulary, for example in a second-language learner who has learned words through study rather than exposure, and can produce them, but has difficulty recognizing them in conversation.

Productive vocabulary, therefore, generally refers to words that can be produced within an appropriate context and match the intended meaning of the speaker or signer. As with receptive vocabulary, however, there are many degrees at which a particular word may be considered part of an active vocabulary. Knowing how to pronounce, sign, or write a word does not necessarily mean that the word that has been used correctly or accurately reflects the intended message; but it does reflect a minimal amount of productive knowledge.

Degree of knowledge
Within the receptive–productive distinction lies a range of abilities that are often referred to as degree of knowledge. This simply indicates that a word gradually enters a person's vocabulary over a period of time as more aspects of word knowledge are learnt. Roughly, these stages could be described as:


 * 1) Never encountered the word.
 * 2) Heard the word, but cannot define it.
 * 3) Recognizes the word due to context or tone of voice.
 * 4) Able to use the word and understand the general and/or intended meaning, but cannot clearly explain it.
 * 5) Fluent with the word – its use and definition.

Depth of knowledge
The differing degrees of word knowledge imply a greater depth of knowledge, but the process is more complex than that. There are many facets to knowing a word, some of which are not hierarchical so their acquisition does not necessarily follow a linear progression suggested by degree of knowledge. Several frameworks of word knowledge have been proposed to better operationalise this concept. One such framework includes nine facets:


 * 1) orthography – written form
 * 2) phonology – spoken form
 * 3) reference – concept and reference
 * 4) semantics – meaning
 * 5) register – appropriacy of use or register
 * 6) collocation – lexical neighbours
 * 7) word associations
 * 8) syntax – grammatical function
 * 9) morphology – word parts

Types of vocabulary
Listed in order of most ample to most limited:

Reading vocabulary
A person's reading vocabulary is all the words recognized when reading. This class of vocabulary is generally the most ample, as new words are more commonly encountered when reading than when listening.

Listening vocabulary
A person's listening vocabulary comprises the words recognized when listening to speech. Cues such as the speaker's tone and gestures, the topic of discussion, and the conversation's social context may convey the meaning of an unfamiliar word.

Speaking vocabulary
A person's speaking vocabulary comprises the words used in speech and is generally a subset of the listening vocabulary. Due to the spontaneous nature of speech, words are often misused slightly and unintentionally, but facial expressions and tone of voice can compensate for this misuse.

Writing vocabulary
The written word appears in registers as different as formal essays and social media feeds. While many written words rarely appear in speech, a person's written vocabulary is generally limited by preference and context: a writer may prefer one synonym over another, and they will be unlikely to use technical vocabulary relating to a subject in which they have no interest or knowledge.

Final vocabulary
The American philosopher Richard Rorty characterized a person's "final vocabulary" as follows:

"All human beings carry about a set of words which they employ to justify their actions, their beliefs, and their lives. These are the words in which we formulate praise of our friends and contempt for our enemies, our long-term projects, our deepest self-doubts and our highest hopes... I shall call these words a person's 'final vocabulary'. Those words are as far as he can go with language; beyond them is only helpless passivity or a resort to force. (Contingency, Irony, and Solidarity p. 73)"

Vocabulary growth
During its infancy, a child instinctively builds a vocabulary. Infants imitate words that they hear and then associate those words with objects and actions. This is the listening vocabulary. The speaking vocabulary follows, as a child's thoughts become more reliant on their ability to self-express without relying on gestures or babbling. Once the reading and writing vocabularies start to develop, through questions and education, the child starts to discover the anomalies and irregularities of language.

In first grade, a child who can read learns about twice as many words as one who cannot. Generally, this gap does not narrow later. This results in a wide range of vocabulary by age five or six, when an English-speaking child will have learned about 1500 words.

Vocabulary grows throughout one's life. Between the ages of 20 and 60, people learn about 6,000 more lemmas, or one every other day. An average 20-year-old knows 42,000 lemmas coming from 11,100 word families. People expand their vocabularies by e.g. reading, playing word games, and participating in vocabulary-related programs. Exposure to traditional print media teaches correct spelling and vocabulary, while exposure to text messaging leads to more relaxed word acceptability constraints.

Importance

 * An extensive vocabulary aids expression and communication.
 * Vocabulary size has been directly linked to reading comprehension.
 * Linguistic vocabulary is synonymous with thinking vocabulary.
 * A person may be judged by others based on their vocabulary.
 * Wilkins (1972) said, "Without grammar, very little can be conveyed; without vocabulary, nothing can be conveyed."

Native-language vocabulary
Estimating average vocabulary size poses various difficulties and limitations due to the different definitions and methods employed such as what is the word, what is to know a word, what sample dictionaries were used, how tests were conducted, and so on. Native speakers' vocabularies also vary widely within a language, and are dependent on the level of the speaker's education.

As a result, estimates vary from 10,000 to 17,000 word families or 17,000-42,000 dictionary words for young adult native speakers of English.

A 2016 study shows that 20-year-old English native speakers recognize on average 42,000 lemmas, ranging from 27,100 for the lowest 5% of the population to 51,700 lemmas for the highest 5%. These lemmas come from 6,100 word families in the lowest 5% of the population and 14,900 word families in the highest 5%. 60-year-olds know on average 6,000 lemmas more.

According to another, earlier 1995 study junior-high students would be able to recognize the meanings of about 10,000–12,000 words, whereas for college students this number grows up to about 12,000–17,000 and for elderly adults up to about 17,000 or more.

For native speakers of German, average absolute vocabulary sizes range from 5,900 lemmas in first grade to 73,000 for adults.

The effects of vocabulary size on language comprehension
The knowledge of the 3000 most frequent English word families or the 5000 most frequent words provides 95% vocabulary coverage of spoken discourse. For minimal reading comprehension a threshold of 3,000 word families (5,000 lexical items) was suggested and for reading for pleasure 5,000 word families (8,000 lexical items) are required. An "optimal" threshold of 8,000 word families yields the coverage of 98% (including proper nouns).

Second language vocabulary acquisition
Learning vocabulary is one of the first steps in learning a second language, but a learner never finishes vocabulary acquisition. Whether in one's native language or a second language, the acquisition of new vocabulary is an ongoing process. There are many techniques that help one acquire new vocabulary.

Memorization
Although memorization can be seen as tedious or boring, associating one word in the native language with the corresponding word in the second language until memorized is considered one of the best methods of vocabulary acquisition. By the time students reach adulthood, they generally have gathered a number of personalized memorization methods. Although many argue that memorization does not typically require the complex cognitive processing that increases retention (Sagarra and Alba, 2006), it does typically require a large amount of repetition, and spaced repetition with flashcards is an established method for memorization, particularly used for vocabulary acquisition in computer-assisted language learning. Other methods typically require more time and longer to recall.

Some words cannot be easily linked through association or other methods. When a word in the second language is phonologically or visually similar to a word in the native language, one often assumes they also share similar meanings. Though this is frequently the case, it is not always true. When faced with a false friend, memorization and repetition are the keys to mastery. If a second language learner relies solely on word associations to learn new vocabulary, that person will have a very difficult time mastering false friends. When large amounts of vocabulary must be acquired in a limited amount of time, when the learner needs to recall information quickly, when words represent abstract concepts or are difficult to picture in a mental image, or when discriminating between false friends, rote memorization is the method to use. A neural network model of novel word learning across orthographies, accounting for L1-specific memorization abilities of L2-learners has recently been introduced (Hadzibeganovic and Cannas, 2009).

The keyword method
One way of learning vocabulary is to use mnemonic devices or to create associations between words, this is known as the "keyword method" (Sagarra and Alba, 2006). It also takes a long time to implement — and takes a long time to recollect — but because it makes a few new strange ideas connect it may help in learning. Also it presumably does not conflict with Paivio's dual coding system because it uses visual and verbal mental faculties. However, this is still best used for words that represent concrete things, as abstract concepts are more difficult to remember.

Word lists
Several word lists have been developed to provide people with a limited vocabulary for rapid language proficiency or for effective communication. These include Basic English (850 words), Special English (1,500 words), General Service List (2,000 words), and Academic Word List. Some learner's dictionaries have developed defining vocabularies which contain only most common and basic words. As a result, word definitions in such dictionaries can be understood even by learners with a limited vocabulary. Some publishers produce dictionaries based on word frequency or thematic groups.

The Swadesh list was made for investigation in linguistics.

Focal vocabulary
Focal vocabulary is a specialized set of terms and distinctions that is particularly important to a certain group: those with a particular focus of experience or activity. A lexicon, or vocabulary, is a language's dictionary: its set of names for things, events, and ideas. Some linguists believe that lexicon influences people's perception of things, the Sapir–Whorf hypothesis. For example, the Nuer of Sudan have an elaborate vocabulary to describe cattle. The Nuer have dozens of names for cattle because of the cattle's particular histories, economies, and environments. This kind of comparison has elicited some linguistic controversy, as with the number of "Eskimo words for snow". English speakers with relevant specialised knowledge can also display elaborate and precise vocabularies for snow and cattle when the need arises.