Linguistic performance

The term linguistic performance was used by Noam Chomsky in 1960 to describe "the actual use of language in concrete situations". It is used to describe both the production, sometimes called  parole, as well as the comprehension of language. Performance is defined in opposition to "competence"; the latter describes the mental knowledge that a speaker or listener has of language.

Part of the motivation for the distinction between performance and competence comes from speech errors: despite having a perfect understanding of the correct forms, a speaker of a language may unintentionally produce incorrect forms. This is because performance occurs in real situations, and so is subject to many non-linguistic influences. For example, distractions or memory limitations can affect lexical retrieval (Chomsky 1965:3), and give rise to errors in both production and perception. Such non-linguistic factors are completely independent of the actual knowledge of language, and establish that speakers' knowledge of language (their competence) is distinct from their actual use of language (their performance).

Competence versus performance
Competence is the collection of subconscious rules that one knows when one knows a language; performance is the system which puts these rules to use. This distinction is related to the broader notion of Marr's levels used in other cognitive sciences, with competence corresponding to Marr's computational level.

For example, many linguistic theories, particularly in generative grammar, would propose competence-based explanations for why English speakers would judge the sentence in (1) as odd. In these explanations, the sentence would be ungrammatical because the rules of English only generate sentences where demonstratives agree with the grammatical number of their associated noun.


 * (1) *That cats is eating the mouse.

By contrast, generative theories generally provide performance-based explanations for the oddness of center embedding sentences like one in (2). According to such explanations, the grammar of English could in principle generate such sentences, but doing so in practice is so taxing on working memory that the sentence ends up being unparsable.


 * (2) *The cat that the dog that the man fed chased meowed.

In general, performance-based explanations deliver a simpler theory of grammar at the cost of additional assumptions about memory and parsing. As a result, the choice between a competence-based explanation and a performance-based explanation for a given phenomenon is not always obvious and can require investigating whether the additional assumptions are supported by independent evidence. For example, while many generative models of syntax explain island effects by positing constraints within the grammar, it has also been argued that some or all of these constraints are in fact the result of limitations on performance.

Related distinctions
Published in 1916, Ferdinand de Saussure's Course in General Linguistics describes language as "a system of signs that express ideas". de Saussure describes two components of language: langue and parole. Langue consists of the structural relations that define a language, which includes grammar, syntax and phonology. Parole is the physical manifestation of signs; in particular the concrete manifestation of langue as speech or writing. While langue can be viewed strictly as a system of rules, it is not an absolute system such that parole must utterly conform to langue. Drawing an analogy to chess, de Saussure compares langue to the rules of chess that define how the game should be played, and parole to the individual choices of a player given the possible moves allowed within the system of rules.

In 1986, Noam Chomsky proposed a distinction similar to the competence/performance distinction, entertaining the notion of an I-Language (internal language) which is the intrinsic linguistic knowledge within a native speaker and E-Language (external language) which is the observable linguistic output of a speaker.

Performance-grammar correspondence hypothesis
John A. Hawkins's Performance-Grammar Correspondence Hypothesis (PGCH) states that the syntactic structures of grammars are conventionalized based on whether and how much the structures are preferred in performance. Performance preference is related to structure complexity and processing, or comprehension, efficiency. Specifically, a complex structure refers to a structure containing more linguistic elements or words at the end of the structure than at the beginning. It is this structural complexity that results in decreased processing efficiency since more structure requires additional processing. This model seeks to explain word order across languages based on avoidance of unnecessary complexity in favour of increased processing efficiency. Speakers make an automatic calculation of the Immediate Constituent(IC)-to-word order ratio and produce the structure with the highest ratio. Structures with a high IC-to-word order are structures that contain the fewest words required for the listener to parse the structure into constituents which results in more efficient processing.

Head-initial structures
In head-initial structures, which includes example SVO and VSO word order, the speaker's goal is to order the sentence constituents from least to most complex.

SVO word order
SVO word order can be exemplified with English; consider the example sentences in (1). In (1a) three immediate constituents (ICs) are present in the verb phrase, namely VP, PP1 and PP2, and there are four words (went, to, London, in) required to parse the VP into its constituents. Therefore, the IC-to-word ratio is 3/4=75%. In contrast, in (1b) the VP is still composed of three ICs but there are now six words that are required to determine the constituent structure of the VP (went, in, the, late, afternoon, to). Thus, the ratio for (1b) is 3/6 = 50%. Hawkins proposes that speakers prefer to produce (1a) since it has a higher IC-to-word ratio and this leads to faster and more efficient processing.

1a. John [VP went [PP1 to London] PP2 in the late afternoon 1b. John [VP went [PP2 in the late afternoon]] PP1 to London

Hawkins supports the above analysis by providing performance data to demonstrate the preference speakers have for ordering short phrases before long phrases when producing head-initial structures. The table based on English data, below, illustrates that the short prepositional phrase (PP1) is preferentially ordered before the long PP (PP2) and that this preference increases as the size differential between the two PPs increases. For example, 60% of the sentences are ordered short (PP1) to long (PP2) when PP2 was longer than PP1 by 1 word. In contrast, 99% of the sentences are ordered short to long when PP2 is longer than PP1 by 7+ words.

English prepositional phrase orderings by relative weight  PP2 = longer PP; PP1=shorter PP. Proportion of short-long to long-short as a percentage; actual numbers of sequences in parentheses. An additional 71 sequences had PPs of equal length (total n=394)

VSO word order
Hawkins argues that the preference for short followed by long phrases applies to all languages that have head-initial structuring. This includes languages with VSO word order such as from Hungarian. By calculating the IC-to-word ratio for the Hungarian sentences in the same way as was done for the English sentences, 2a. emerges as having a higher ratio than 2b.

2a. VP[Döngetik NP[facipöink NP[az utcakat] ] batter  wooden shoes-1PL  the streets-ACC Our wooden shoes batter the streets 2b. VP[Döngetik NP[az utcakat] NP[[ facipöink ] ]

The Hungarian performance data (below) show the same preference pattern as the English data. This study looked at the ordering of two successive noun phrases (NPs) and found that the shorter NP followed by the longer NP is preferred in performance, and that this preference increases as the size differential between NP1 and NP2 increases.

Hungarian noun phrase orderings by relative weight  mNP = any NP constructed on its left periphery. NP2 = longer NP; NP1 = shorter NP. Proportion of short-long to long-short given as a percentage; actual numbers of sequences given in parentheses. An additional 21 sequences had NPs of equal length (total n = 16).

Head-final structures
Hawkins' explanation of performance and word order extends to head-final structures. For example, since Japanese is a SOV language the head (V) is at the end of the sentence. This theory predicts that speakers will prefer to order the phrases in head-final sentences from long phrases to short, as opposed to short to long as seen in head-initial languages. This reversal of ordering preference is due to the fact that in head-final sentences it is the long followed by short phrasal ordering that has the higher IC-to-word ratio.

3a. Tanaka ga vp[pp[Hanako kara]np[sono hon o] katta] Tanaka NOM Hanako from that book ACC bought Tanako bought that book from Hanako 3b. Tanaka ga vp[np[sono hon o] pp[Hanako kara] [katta]

The VP and its constituents in 4. are constructed from their heads on the right. This means that the number of words used to calculate the ratio is counted from the head of the first phrase (PP in 3a. and NP in 3b.) to the verb (as indicated in bold above). The IC-to-word ratio for the VP in 3a. is 3/5=60% while the ratio for the VP in 3b. is 3/4=75%. Therefore, 3b. should be preferred by Japanese speakers since it has a higher IC-to-word ratio which leads to faster parsing of sentences by the listener.

The performance preference for long to short phrase ordering in SVO languages is supported by performance data. The table below shows that production of long to short phrases is preferred and that this preference increases as the size of the differential between the two phrases increases. For example, ordering of the longer 2ICm (where ICm is either a direct object NP with an accusative case particle or a PP constructed from the right periphery) before the shorter 1ICm is more frequent, and the frequency increases to 91% if the 2ICm is longer than the 1ICm by 9+ words.

Japanese NPo and PPm orderings by relative weight  Npo = direct object NP with accusative case particle. PPm = PP constructed on its right periphery by a P(ostposition). ICm= either NPo or PPm. 2IC=longer IC; 1IC = shorter IC. Proportion of long-to short to short-long orders given as a percentage; actual numbers of sequences in parentheses. an additional 91 sequences had ICs of equal length (total n=244)

Utterance planning hypothesis
Tom Wasow proposes that word order arises as a result of utterance planning benefiting the speaker. He introduces the concepts of early versus late commitment, where commitment is the point in the utterance where it becomes possible to predict subsequent structure. Specifically, early commitment refers to the commitment point present earlier in the utterance and late commitment refers to the commitment point present later in the utterance. He explains that early commitment will favour the listener since early prediction of subsequent structure enables faster processing. Comparatively, late commitment will favour the speaker by postponing decision making, giving the speaker more time to plan the utterance. Wasow illustrates how utterance planning influences syntactic word order by testing early versus late commitment in heavy-NP shifted (HNPS) sentences. The idea is to examine the patterns of HNPS to determine if the performance data show sentences that are structured to favour the speaker or the listener.

Examples of early/late commitment and heavy-NP shift
The following examples illustrate what is meant by early versus late commitment and how heavy-NP shift applies to these sentences. Wasow looked at two types of verbs:

Vt (transitive verbs): require NP objects.

4a. Pat VP[brought NP[a box with a ribbon around it] PP[ [to the party] ]  4b. Pat VP[brought PP[to the party] NP[ [a box with a ribbon around it] ] 

In 4a. no heavy-NP shift has been applied. The NP is available early but does not provide any additional information about the sentence structure – the "to" appearing late in the sentence is an example of late commitment. In contrast, in 4b., where heavy-NP shift has shifted the NP to the right, as soon as "to" is uttered the listener knows that the VP must contain the NP and a PP. In other words, when "to" is uttered it allows the listener to predict the remaining structure of the sentence early on. Thus for transitive verbs HNPS results in early commitment and favors the listener.

Vp (prepositional verbs): can take an NP object or an immediately following PP with no NP object

5a. Pat VP[wrote NP[something about Chris] PP[ [on the blackboard]]. 5b. Pat VP[wrote PP[on the blackboard] NP[ [something about Chris.]] 

No HNPS has been applied to 5a. In 5b. the listener needs to hear the word "something" in order to know that the utterance contains a PP and an NP since the object NP is optional but "something" has been shifted to later in the sentence. Thus for prepositional verbs HNPS results in late commitment and favours the speaker.

Predictions and findings
Based on the above information Wasow predicted that if sentences are constructed from the speaker's perspective then heavy-NP shift would rarely apply to sentences containing a transitive verb but would apply frequently to sentences containing a prepositional verb. The opposite prediction was made if sentences are constructed from the listener's perspective.

To test his predictions Wasow analyzed performance data (from corpora data) for the rates of occurrence of HNPS for Vt and Vp and found HNPS occurred twice as frequently in Vp than in Vt, therefore supporting the predictions made from the speaker's perspective. In contrast, he did not find evidence in support of the predictions made based on the listener's perspective. In other words, given the data above, when HNPS is applied to sentences containing a transitive verb the result favors the listener. Wasow found that HNPS applied to transitive verb sentences is rare in performance data thus supporting the speaker's perspective. Additionally, when HNPS is applied to prepositional verb structures the result favors the speaker. In his study of the performance data, Wasow found evidence of HNPS frequently applied to prepositional verb structures further supporting the speaker's perspective. Based on these findings Wasow concludes that HNPS is correlated with the speaker's preference for late commitment thereby demonstrating how speaker performance preference can influence word order.

Alternative grammar models
While the dominant views of grammar are largely oriented towards competence, many, including Chomsky himself, have argued that a complete model of grammar should be able to account for performance data. But while Chomsky argues that competence should be studied first, thereby allowing further study of performance, some systems, such as constraint grammars are built with performance as a starting point (comprehension, in the case of constraint grammars While traditional models of generative grammar have had a great deal of success in describing the structure of languages, they have been less successful in describing how language is interpreted in real situations. For example, traditional grammar describes a sentence as having an "underlying structure" which is different from the "surface structure" which speakers actually produce.  In a real conversation, however, a listener interprets the meaning of a sentence in real time, as the surface structure goes by. This kind of on-line processing, which accounts for phenomena such as finishing another person's sentence, and starting a sentence without knowing how it is going to finish, is not directly accounted for in traditional generative models of grammar. Several alternative grammar models exist which may be better able to capture this surface-based aspect of linguistic performance, including Constraint Grammar, Lexical Functional Grammar, and Head-driven phrase structure grammar.

Errors in linguistic performance
Errors in linguistic performance not only occur in children newly acquiring their native language, second language learners, those with a disability or an acquired brain injury but among competent speakers as well. Types of performance errors that will be of focus here are those that involve errors in syntax, other types of errors can occur in the phonological, semantic features of words, for further information see speech errors. Phonological and semantic errors can be due to the repetition of words, mispronunciations, limitations in verbal working memory, and length of the utterance. Slips of the tongue are most common in spoken languages and occur when the speaker either: says something they did not mean to; produces the incorrect order of sounds or words; or uses the incorrect word. Other instances of errors in linguistic performance are slips of the hand in signed languages, slips of the ear which are errors in comprehension of utterances and slips of the pen which occur while writing. Errors of linguistic performance are perceived by both the speaker and the listener and can therefore have many interpretations depending on the persons judgement and the context in which the sentence was spoken.

It is proposed that there is a close relation between the linguistic units of grammar and the psychological units of speech which implies that there is a relation between linguistic rules and the psychological processes that create utterances. Errors in performance can occur at any level of these psychological processes. Lise Menn proposes that there are five levels of processing in speech production, each with its own possible error that could occur. According to the proposed speech processing structure by Menn an error in the syntactic properties of an utterance occurs at the positional level.
 * 1) Message Level
 * 2) Functional Level
 * 3) Positional Level
 * 4) Phonological Encoding
 * 5) Speech Gesture

Another proposal for the levels of speech processing is made by Willem J. M. Levelt to be structured as so: Levelt (1993) states that we as speakers are unaware of most of these levels of performance such as articulation, which includes the movement and placement of the articulators, the formulation of the utterance which includes the words selected and their pronunciation and the rules which must be followed for the utterance to be grammatical. The levels speakers are consciously aware is the intent of the message which occurs at the level of conceptualization and then again at self-monitoring which is when the speaker would become aware of any errors that may have occurred and correct themselves.
 * 1) Conceptualization
 * 2) Formulation
 * 3) Articulation
 * 4) Self-Monitoring

Slips of the tongue
One type of slip of the tongue which cause an error in the syntax of the utterance are called transformational errors. Transformational errors are a mental operation proposed by Chomsky in his Transformational Hypothesis and it has three parts which errors in performance can occur. These transformations are applied at the level of the underlying structures and predict the ways in which an error can occur.
 * Structural analysis
 * Structural Change
 * Conditions

Structural Analysis errors can occur due to the application of (a) the rule misanalyzing the tense marker causing the rule to apply incorrectly, (b) the rule not being applied when it should or (c) a rule being applied when it should not.

This example from Fromkin (1980) demonstrates a rule misanalyzing the tense marker and for subject-auxiliary inversion to be incorrectly applied. The subject-auxiliary inversion is misanalyzed as to which structure it applies, applying without the verb be in the tense as it moves to the C position. This causes "do-support" to occur and the verb to lack tense causing the syntactic error.

6a. Error: Why do you be an oaf sometimes? 6b. Target: Why are you an oaf sometimes?

The following example from Fromkin (1980) demonstrates how a rule is being applied when it should not. The subject-auxiliary inversion rule is omitted in the error utterance, causing affix-hopping to occur and putting the tense onto the verb "say" creating the syntactic error. In the target the subject-auxiliary rule and then do-support applies creating the grammatically correct structure. 7a. Error: And what he said? 7b. Target: And what did he say?



This example from Fromkin (1980) shows how a rule is being applied when it should not. The subject-auxiliary inversion and do-support has applied to an idiomatic expression causing the insertion of "do" when it should not be applied in the ungrammatical utterance. 8a. Error: How do we go!! 8b. Target: How we go!!



Structural Change Errors can occur in the carrying out of rules, even though the analysis of the phrase marker is done correctly. This can occur when the analysis requires multiple rules to occur.

The following example from Fromkin (1980) shows the relative clause rule copies the determiner phrase "a boy" within the clause and this causes front attaching to the Wh-marker. Deletion is then skipped, leaving the determiner phrase in the clause in the error utterance causing it to be ungrammatical. 9a. Error: A boy who I know a boy has hair down to here. 9b. Target: A boy who I know has hair down to here.



Conditions errors restrict when the rule can and cannot be applied.

This last example from Fromkin (1980) shows that a rule was applied under a certain condition in which it is restricted. The subject-auxiliary inversion rule cannot apply to embedded clauses. In the case of this example it has causing for the syntactic error. 10a. Error: I know where is a top for it. 10b. Target: I know where a top for it is.

A study of deaf Italians found that the second person singular of indicatives would extend to corresponding forms in imperatives and negative imperatives.

The following is an example taken from Dutch data in which there is verb omission in the embedded clause of the utterance (which is not allowed in Dutch), resulting in a performance error.

A study done with Zulu speaking children with a language delay displayed errors in linguistic performance of lacking proper passive verb morphology.

Slips of the hand
The linguistic components of American Sign Language (ASL) can be broken down into four parts; the hand configuration, place of articulation, movement and other minor parameters. Hand configuration is determined by the shape of the hand, fingers and thumbs and is specific to the sign that is being used. It allows the signer to articulate what they are wanting to communicate by extending, flexing, bending or spreading the digits; the position of the thumb to the fingers; or the curvature of the hand. However, there are not an infinite amount of possible hand configurations, there are 19 classes of hand configuration primes as listed by the Dictionary of American Sign Language. Place of articulation is the particular location that the sign is being performed known as the "signing place". The "signing place" can be the whole face or a particular part of it, the eyes, nose, cheek, ear, neck, trunk, any part of the arm, or the neutral area in front of the signers head and body. Movement is the most complex as it can be difficult to analyze. Movement is restricted to directional, rotations of the wrist, local movements of the hand and interactions of the hands. These movements can occur singularly, in sequence, or simultaneously. Minor parameters in ASL include contacting region, orientation and hand arrangement. They are subclasses of hand configuration. Performance errors resulting in ungrammatical signs can result due to processes that change the hand configuration, place, movement or other parameter of the sign. These processes can be anticipation, preservation, or metathesis. Anticipation is caused when some characteristic of the next sign is incorporated into the sign that is presently being performed. Preservation is the opposite of anticipation where some characteristic of the preceding sign is carried over into the performance of the next sign. Metathesis occurs when two characteristics of adjacent signs are combined into one in the performance of both signs. Each of these errors will result in an incorrect sign being performed. This could result in either a different sign being performed instead of the intended one, or nonexistent signs which forms are possible and those which forms are not possible due to the structural rules. These are the main types of performance errors in sign language however on the rare occasion there is also the possibility of errors in the order of the signs performed resulting in a different meaning than what the signer intended.

Other types of errors
Unacceptable Sentences are ones which, although are grammatical, are not considered proper utterances. They are considered unacceptable due to the lack of our cognitive systems to process them. Speakers and listeners can be aided in the performance and processing of these sentences by eliminating time and memory constraints, increasing motivation to process these utterances and using pen and paper. In English there are three types of sentences that are grammatical but are considered unacceptable by speakers and listeners. When a speaker makes an utterance they must translate their ideas into words, then syntactically proper phrases with proper pronunciation. The speaker must have prior world knowledge and an understanding of the grammatical rules that their language enforces. When learning a second language or with children acquiring their first language, speakers usually have this knowledge before they are able to produce them. Their speech is usually slow and deliberate, using phrases they have already mastered, and with practice their skills increase. Errors of linguistic performance are judged by the listener giving many interpretations if an utterance is well-formed or ungrammatical depending on the individual. As well the context in which an utterance is used can determine if the error would be considered or not. When comparing "Who must telephone her?" and "Who need telephone her?" the former would be considered the ungrammatical phrase. However, when comparing it to "Who want telephone her?" it would be considered the grammatical phrase. The listener may also be the speaker. When repeating sentences with errors if the error is not comprehended then it is performed. As well if the speaker does notice the error in the sentence they are supposed to repeat they are unaware of the difference between their well-formed sentence and the ungrammatical sentence. An unacceptable utterance can also be performed due to a brain injury. Three types of brain injuries that could cause errors in performance were studied by Fromkin are dysarthria, apraxia and literal paraphasia. Dysarthria is a defect in the neuromuscular connection that involves speech movement. The speech organs involved can be paralyzed or weakened, making it difficult or impossible for the speaker to produce a target utterance. Apraxia is when there is damage to the ability to initiate speech sounds with no paralysis or weakening of the articulators. Literal paraphasia causes disorganization of linguistic properties, resulting in errors of word order of phonemes. Having a brain injury and being unable to perform proper linguistic utterances, some individuals are still able to process complex sentences and formulate syntactically well formed sentences in their mind. Child productions when they are acquiring language are full of errors of linguistic performance. Children must go from imitating adult speech to create new phrases of their own. They will need to use their cognitive operations of the knowledge of their language they are learning to determine the rules and properties of that language. The following are examples of errors in English speaking children's productions.
 * 1) Repeated self-embedded clauses: The cheese that the rat that the cat chased ate is on the table.
 * 2) Multi Right Branching: This is the cat that caught the rat that ate the cheese that is on the table.
 * 3) Ambiguity or Garden Path Sentences: The horse raced past the barn fell
 * "I goed"
 * "He runned"

In an elicited production experiment a child, Adam, was prompted to ask questions to an Old Lady

Mean length of utterance
The most commonly used measure of syntax complexity is the mean length of utterance, also known as MLU. This measure is independent from how often children talk and focuses on the complexity and development of their grammatical systems, including morphological and syntactic development. The number representing a person's MLU corresponds to the complexity of the syntax being used. In general, as the MLU increases, the syntactic complexity also increases. Typically, the average MLU corresponds to a child's age due to their increase in working memory, which allows for sentences to be of greater syntactic complexity. For example, the average MLU of a 7-year-old child is 7 words. However, children show more individual variability of syntactic performance with more complex syntax. Complex syntax have a higher number of phrases and clause levels, therefore adding more words to the overall syntactic structure. Seeing as there are more individual differences in MLU and syntactic development as children get older, MLU is particularly used to measure grammatical complexity among school-aged children. Other types of segmentation strategies for discourse are the T-unit and C-unit (communicative unit). If these two measurements are used to account for discourse, the average length of the sentence will be lower than if MLU is used alone. Both the T-units and C-units count each clause as a new unit, hence a lower number of units.

Typical MLU per age group can be found in the following table, according to Roger Brown's five stages of syntactic and morphological development:

Here are the steps for calculating MLU:


 * 1) Acquire a language sample of about 50-100 utterances
 * 2) Count the number of morphemes said by the child, then divide by the number of utterances
 * 3) The investigator can assess what stage of syntactic development the child is at, based on their MLU

Here's an example of how to calculate MLU:

In total there are 17 morphemes in this data set. In order to find the MLU, we divide the total number of morphemes (17) by the total number of utterances (4). In this particular data set, the mean length of utterance is 17/4 = 4.25.

Clause density
Clause density refers to the degree to which utterances contain dependent clauses. This density is calculated as a ratio of the total number of clauses across sentences, divide by the number of sentences in a discourse sample. For example, if the clause density is 2.0, the ratio would indicate that the sentence being analyzed has 2 clauses on average: one main clause and one subordinate clause.

Here is an example of how clause density is measured, using T-units, adapted from Silliman & Wilkinson 2007:

Indices of syntactic performance
Indices track structures to show a more comprehensive picture of a person's syntactic complexity. Some examples of indices are Development Sentence Scoring, the Index of Productive Syntax and the Syntactic Complexity Measure.

Developmental sentence scoring
Developmental Sentence Scoring is another method to measure syntactic performance as a clinical tool. In this indice, each consecutive utterance, or sentence, elicited from a child is scored. This is a commonly applied measurement of syntax for first and second language learners, with samples gathered from both elicited and spontaneous oral discourse. Methods for eliciting speech for these samples come in many forms, such having the participant answering questions or re-telling a story. These elicited conversations are commonly tape-recorded for playback during analysis to see how well the person can incorporate syntax among other linguistic cues. For every utterance elicited, the utterance will receive one point if it is a correct form used in adult speech. A score of 1 indicates the least complex syntactic form in the category, whereas a higher score reflects higher level grammaticality. Points are specifically awarded to an utterance based on whether or not it contains any of the eight categories outlined below.

Syntactic categories measured by developmental sentence scoring with examples:

Indefinite pronouns 11a. Score of 1: it, this, that 11b. Score of 6: both, many, several, most, least

Personal pronouns 12a. Score of 1: I, me, my, mine, you, your(s) 12b. Score of 6: Wh-pronouns (i.e. who, which, what, how) and wh-word + infinitive (i.e. I know what to do)

Main verb 13a. Score of 1: Uninflected verb (i.e. I "see" you) and copula, is or 's (i.e. It's red) 13b. Score of 6: Must, shall + verb (i.e. He "must come" or We "shall see"), have + verb + '-en' (i.e. I have eaten)

Secondary verb 14a. Score of 1: Infinitival complements (i.e. I wan"na see" = I want to see) 14b. Score of 6: Gerund (i.e. Swinging is fun)

Negatives 15a. Score of 1: it, this or that + copula or auxiliary 'is' or 's + not (i.e. It's "not" mine) 15b. Score of 5: Uncontracted negative with 'have' (i.e. I have "not" eaten it), auxiliary'have'-negative contraction (i.e. I had"n't" eaten        it), pronoun auxiliary 'have' contraction (i.e. I've "not" eaten it)

Conjunctions 16a. Score of 1: and 16b. Score of 6: where, than, how

Interrogative reversals 17a. Score of 1: Reversal of copula (i.e. "Is it" red?) 17b. Score of 5: Reversal with three auxiliaries (i.e. "Could he" have been going?)

Wh-questions 18a. Score of 1: who or what (i.e. "What" do you mean?), what + noun (i.e. "What book" are you reading?) 18b. Score of 5: whose or which (i.e. "Which" do you want?), which + noun (i.e. "Which book" do you want?)

In particular, those categories that appear the earliest in speech receive a lower score, whereas later-appearing categories receive a higher score. If an entire sentence is correct according to adult-like forms, then the utterance would receive an extra point. The eight categories above are the most commonly used structures in syntactic formation, thus structures such as possessives, articles, plurals, prepositional phrases, adverbs and descriptive adjectives were omitted and not scored. Additionally, the scoring system is arbitrary when applied to certain structures. For example, there is no indication as to why "if" would receive four points rather than five. The scores of all the utterances are totalled in the end of the analysis and then averaged to get a final score. This means that the individual's final score reflects their entire syntactic complexity level, rather than syntactic level in a specific category. The main advantage of development sentence scoring is that the final score represents the individual's general syntactic development and allows for easier tracking of changes in language development, making this tool effective for longitudinal studies.

Index of productive syntax
Similar to Development Sentence Scoring, the Index of Productive Syntax evaluates the grammatical complexity of spontaneous language samples. After age 3, Index of Productive Syntax becomes more widely used than MLU to measure syntactic complexity in children. This is because at around age 3, MLU does not distinguish between children of similar language competency as well as Index of Productive Syntax does. For this reason, MLU is initially used in early childhood development to track syntactic ability, then Index of Productive Syntax is used to maintain validity. Individual utterances in a discourse sample are scored based on the presence of 60 different syntactic forms, placed more generally under four subscales: noun phrase, verb phrase, question/negation and sentence structure forms. After a sample is recorded, a corpus is then formed based on 100 utterance transcriptions with 60 different language structures being measured in each utterance. Not included in the corpus are imitations, self-repetitions and routines, which constitute language that does not represent productive language usage. In each of the four sub-scales previously mentioned, the first two unique occurrences of a form are scored. After this, occurrences of a sub-scale are not scored. However, if a child has mastered a complex syntax structure earlier than expected, they will receive extra points.

Standardized tests
The six main tasks in standardized testing for syntax:
 * What is the level of syntactic complexity?
 * What specific syntactic structures are found? (a syntactic content analysis)
 * Are specific structures representative of what is known about syntactic development within the age range of standardization sample?
 * What are the processing requirements of the test format? (a task analysis)
 * Are processing requirements similar to or different from language processing in more naturalistic contexts?
 * Is syntactic ability in naturalistic language predicted by performance on the test?

Some of the common standardized tests for measuring syntactic performance are the TOLD-2 Intermediate (Test of Language Development), the TOAL-2 (Test of Adolescent Language) and the CELF-R (Clinical Evaluation of Language Fundamentals, Revised Screening Test).