Urdu

Urdu (, ; ALA-LC: ) is an Indo-Aryan language spoken chiefly in South Asia. It is the national language and lingua franca of Pakistan, where it is also an official language alongside English. In India, Urdu is an Eighth Schedule language, the status and cultural heritage of which are recognised by the Constitution of India; and it also has an official status in several Indian states. In Nepal, Urdu is a registered regional dialect and in South Africa it is a protected language in the constitution. It is also spoken as a minority language in Afghanistan and Bangladesh, with no official status.

Urdu has been described as a Persianised register of the Hindustani language; Urdu and Hindi share a common Sanskrit- and Prakrit-derived vocabulary base, phonology, syntax, and grammar, making them mutually intelligible during colloquial communication. While formal Urdu draws literary, political, and technical vocabulary from Persian, formal Hindi draws these aspects from Sanskrit; consequently, the two languages' mutual intelligibility effectively decreases as the factor of formality increases.

In 1837, Urdu became an official language of the British East India Company, replacing Persian across northern India during Company rule; Persian had until this point served as the court language of various Indo-Islamic empires. Religious, social, and political factors arose during the European colonial period that advocated a distinction between Urdu and Hindi, leading to the Hindi–Urdu controversy.

According to 2022 estimates by Ethnologue and The World Factbook, produced by the Central Intelligence Agency (CIA), Urdu is the 10th-most widely spoken language in the world, with million total speakers, including those who speak it as a second language.

Etymology
The name Urdu was first used by the poet Ghulam Hamadani Mushafi around 1780 for Hindustani language even though he himself also used Hindavi term in his poetry to define the language. Ordu means army in the Turkic languages. In late 18th century, it was known as Zaban-e-Urdu-e-Mualla  means language of the exalted camp. Earlier it was known as Hindvi, Hindi and Hindustani.

History
Urdu, like Hindi, is a form of Hindustani language. Some linguists have suggested that the earliest forms of Urdu evolved from the medieval (6th to 13th century) Apabhraṃśa register of the preceding Shauraseni language, a Middle Indo-Aryan language that is also the ancestor of other modern Indo-Aryan languages.

Origins
In the Delhi region of India the native language was Khariboli, whose earliest form is known as Old Hindi (or Hindavi).  It belongs to the Western Hindi group of the Central Indo-Aryan languages. The contact of Hindu and Muslim cultures during the period of Islamic conquests in the Indian subcontinent (12th to 16th centuries) led to the development of Hindustani as a product of a composite Ganga-Jamuni tehzeeb.

In cities such as Delhi, the ancient language Old Hindi began to acquire many Persian loanwords and continued to be called "Hindi" and later, also "Hindustani". An early literary tradition of Hindavi was founded by Amir Khusrau in the late 13th century. After the conquest of the Deccan, and a subsequent immigration of noble Muslim families into the south, a form of the language flourished in medieval India as a vehicle of poetry, (especially under the Bahmanids), and is known as Dakhini, which contains loanwords from Telugu and Marathi.

From the 13th century until the end of the 18th century; the language now known as Urdu was called Hindi, Hindavi, Hindustani, Dehlavi, Dihlawi, Lahori, and Lashkari. The Delhi Sultanate established Persian as its official language in India, a policy continued by the Mughal Empire, which extended over most of northern South Asia from the 16th to 18th centuries and cemented Persian influence on Hindustani.

According to the Navadirul Alfaz by Khan-i Arzu, the "Zaban-e Urdu-e Shahi" [language of the Imperial Camp] had attained special importance in the time of Alamgir". By the end of the reign of Aurangzeb in the early 1700s, the common language around Delhi began to be referred to as Zaban-e-Urdu, a name derived from the Turkic word ordu (army) or orda and is said to have arisen as the "language of the camp", or "Zaban-i-Ordu" means "Language of High camps" or natively "Lashkari Zaban" means "Language of Army" even though term Urdu held different meanings at that time. It is recorded that Aurangzeb spoke in Hindvi, which was most likely Persianized, as there are substantial evidence that Hindvi was written in the Persian script in this period.

During this time period Urdu was referred to as "Moors", which simply meant Muslim, by European writers. John Ovington wrote in 1689:

"The language of the Moors is different from that of the ancient original inhabitants of India but is obliged to these Gentiles for its characters. For though the Moors dialect is peculiar to themselves, yet it is destitute of Letters to express it; and therefore, in all their Writings in their Mother Tongue, they borrow their letters from the Heathens, or from the Persians, or other Nations."

In 1715, a complete literary Diwan in Rekhta was written by Nawab Sadruddin Khan. An Urdu-Persian dictionary was written by Khan-i Arzu in 1751 in the reign of Ahmad Shah Bahadur. The name Urdu was first introduced by the poet Ghulam Hamadani Mushafi around 1780. As a literary language, Urdu took shape in courtly, elite settings. While Urdu retained the grammar and core Indo-Aryan vocabulary of the local Indian dialect Khariboli, it adopted the Nastaleeq writing system  – which was developed as a style of Persian calligraphy.

Other historical names
Throughout the history of the language, Urdu has been referred to by several other names: Hindi, Hindavi, Rekhta, Urdu-e-Muallah, Dakhini, Moors and Dehlavi.

In 1773, the Swiss French soldier Antoine Polier notes that the English liked to use the name "Moors" for Urdu:

I have a deep knowledge [je possède à fond] of the common tongue of India, called Moors by the English, and Ourdouzebain by the natives of the land.

Several works of Sufi writers like Ashraf Jahangir Semnani used similar names for the Urdu language. Shah Abdul Qadir Raipuri was the first person who translated The Quran into Urdu.

During Shahjahan's time, the Capital was relocated to Delhi and named Shahjahanabad and the Bazar of the town was named Urdu e Muallah.

In the Akbar era the word Rekhta was used to describe Urdu for the first time. It was originally a Persian word that meant "to create a mixture". Amir Khusrau was the first person to use the same word for Poetry.

Colonial period
Before the standardisation of Urdu into colonial administration, British officers often referred to the language as "Moors" or "Moorish jargon". John Gilchrist was the first in British India to begin a systematic study on Urdu and began to use the term "Hindustani" what the majority of Europeans called "Moors", authoring the book The Strangers's East Indian Guide to the Hindoostanee or Grand Popular Language of India (improperly Called Moors).

Urdu was then promoted in colonial India by British policies to counter the previous emphasis on Persian. In colonial India, "ordinary Muslims and Hindus alike spoke the same language in the United Provinces in the nineteenth century, namely Hindustani, whether called by that name or whether called Hindi, Urdu, or one of the regional dialects such as Braj or Awadhi." Elites from Muslim communities, as well as a minority of Hindu elites, such as Munshis of Hindu origin, wrote the language in the Perso-Arabic script in courts and government offices, though Hindus continued to employ the Devanagari script in certain literary and religious contexts. Through the late 19th century, people did not view Urdu and Hindi as being two distinct languages, though in urban areas, the standardised Hindustani language was increasingly being referred to as Urdu and written in the Perso-Arabic script. Urdu and English replaced Persian as the official languages in northern parts of India in 1837. In colonial Indian Islamic schools, Muslims were taught Persian and Arabic as the languages of Indo-Islamic civilisation; the British, in order to promote literacy among Indian Muslims and attract them to attend government schools, started to teach Urdu written in the Perso-Arabic script in these governmental educational institutions and after this time, Urdu began to be seen by Indian Muslims as a symbol of their religious identity. Hindus in northwestern India, under the Arya Samaj agitated against the sole use of the Perso-Arabic script and argued that the language should be written in the native Devanagari script, which triggered a backlash against the use of Hindi written in Devanagari by the Anjuman-e-Islamia of Lahore. Hindi in the Devanagari script and Urdu written in the Perso-Arabic script established a sectarian divide of "Urdu" for Muslims and "Hindi" for Hindus, a divide that was formalised with the partition of colonial India into the Dominion of India and the Dominion of Pakistan after independence (though there are Hindu poets who continue to write in Urdu, including Gopi Chand Narang and Gulzar).

Urdu had been used as a literary medium for British colonial Indian writers from the Bombay, Bengal, Orissa, and Hyderabad State as well.

Post-Partition
Before independence, Muslim League leader Muhammad Ali Jinnah advocated the use of Urdu, which he used as a symbol of national cohesion in Pakistan. After the Bengali language movement and the separation of former East Pakistan, Urdu was recognised as the sole national language of Pakistan in 1973, although English and regional languages were also granted official recognition. Following the 1979 Soviet Invasion of Afghanistan and subsequent arrival of millions of Afghan refugees who have lived in Pakistan for many decades, many Afghans, including those who moved back to Afghanistan, have also become fluent in Hindi-Urdu, an occurrence aided by exposure to the Indian media, chiefly Hindi-Urdu Bollywood films and songs.

There have been attempts to purge Urdu of native Prakrit and Sanskrit words, and Hindi of Persian loanwords – new vocabulary draws primarily from Persian and Arabic for Urdu and from Sanskrit for Hindi. English has exerted a heavy influence on both as a co-official language. According to Bruce (2021), Urdu has adapted English words since the eighteenth century. A movement towards the hyper-Persianisation of an Urdu emerged in Pakistan since its independence in 1947 which is "as artificial as" the hyper-Sanskritised Hindi that has emerged in India; hyper-Persianisation of Urdu was prompted in part by the increasing Sanskritisation of Hindi. However, the style of Urdu spoken on a day-to-day basis in Pakistan is akin to neutral Hindustani that serves as the lingua franca of the northern Indian subcontinent.

Since at least 1977, some commentators such as journalist Khushwant Singh have characterised Urdu as a "dying language", though others, such as Indian poet and writer Gulzar (who is popular in both countries and both language communities, but writes only in Urdu (script) and has difficulties reading Devanagari, so he lets others 'transcribe' his work) have disagreed with this assessment and state that Urdu "is the most alive language and moving ahead with times" in India. This phenomenon pertains to the decrease in relative and absolute numbers of native Urdu speakers as opposed to speakers of other languages; declining (advanced) knowledge of Urdu's Perso-Arabic script, Urdu vocabulary and grammar; the role of translation and transliteration of literature from and into Urdu; the shifting cultural image of Urdu and socio-economic status associated with Urdu speakers (which negatively impacts especially their employment opportunities in both countries),  the de jure legal status and de facto political status of Urdu, how much Urdu is used as language of instruction and chosen by students in higher education,   and how the maintenance and development of Urdu is financially and institutionally supported by governments and NGOs. In India, although Urdu is not and never was used exclusively by Muslims (and Hindi never exclusively by Hindus), the ongoing Hindi–Urdu controversy and modern cultural association of each language with the two religions has led to fewer Hindus using Urdu. In the 20th century, Indian Muslims gradually began to collectively embrace Urdu (for example, 'post-independence Muslim politics of Bihar saw a mobilisation around the Urdu language as tool of empowerment for minorities especially coming from weaker socio-economic backgrounds' ), but in the early 21st century an increasing percentage of Indian Muslims began switching to Hindi due to socio-economic factors, such as Urdu being abandoned as the language of instruction in much of India, and having limited employment opportunities compared to Hindi, English and regional languages. The number of Urdu speakers in India fell 1.5% between 2001 and 2011 (then 5.08 million Urdu speakers), especially in the most Urdu-speaking states of Uttar Pradesh (c. 8% to 5%) and Bihar (c. 11.5% to 8.5%), even though the number of Muslims in these two states grew in the same period. Although Urdu is still very prominent in early 21st-century Indian pop culture, ranging from Bollywood to social media, knowledge of the Urdu script and the publication of books in Urdu have steadily declined, while policies of the Indian government do not actively support the preservation of Urdu in professional and official spaces. Because the Pakistani government proclaimed Urdu the national language at Partition, the Indian state and some religious nationalists began in part to regard Urdu as a 'foreign' language, to be viewed with suspicion. Urdu advocates in India disagree whether it should be allowed to write Urdu in the Devanagari and Latin script (Roman Urdu) to allow its survival, or whether this will only hasten its demise and that the language can only be preserved if expressed in the Perso-Arabic script.

For Pakistan, Willoughby & Aftab (2020) argued that Urdu originally had the image of a refined elite language of the Enlightenment, progress and emancipation, which contributed to the success of the independence movement. But after the 1947 Partition, when it was chosen as the national language of Pakistan to unite all inhabitants with one linguistic identity, it faced serious competition primarily from Bengali (spoken by 56% of the total population, mostly in East Pakistan until that attained independence in 1971 as Bangladesh), and after 1971 from English. Both pro-independence elites that formed the leadership of the Muslim League in Pakistan and the Hindu-dominated Congress Party in India had been educated in English during the British colonial period, and continued to operate in English and send their children to English-medium schools as they continued dominate both countries' post-Partition politics. Although the Anglicized elite in Pakistan has made attempts at Urduisation of education with varying degrees of success, no successful attempts were ever made to Urduise politics, the legal system, the army, or the economy, all of which remained solidly Anglophone. Even the regime of general Zia-ul-Haq (1977–1988), who came from a middle-class Punjabi family and initially fervently supported a rapid and complete Urduisation of Pakistani society (earning him the honorary title of the 'Patron of Urdu' in 1981), failed to make significant achievements, and by 1987 had abandoned most of his efforts in favour of pro-English policies. Since the 1960s, the Urdu lobby and eventually the Urdu language in Pakistan has been associated with religious Islamism and political national conservatism (and eventually the lower and lower-middle classes, alongside regional languages such as Punjabi, Sindhi, and Balochi), while English has been associated with the internationally oriented secular and progressive left (and eventually the upper and upper-middle classes). Despite governmental attempts at Urduisation of Pakistan, the position and prestige of English only grew stronger in the meantime.

Demographics and geographic distribution
There are over 100 million native speakers of Urdu in India and Pakistan together: there were 50.8 million Urdu speakers in India (4.34% of the total population) as per the 2011 census; and approximately 16 million in Pakistan in 2006. There are several hundred thousand in the United Kingdom, Saudi Arabia, United States, and Bangladesh. However, Hindustani, of which Urdu is one variety, is spoken much more widely, forming the third most commonly spoken language in the world, after Mandarin and English. The syntax (grammar), morphology, and the core vocabulary of Urdu and Hindi are essentially identical – thus linguists usually count them as one single language, while some contend that they are considered as two different languages for socio-political reasons.

Owing to interaction with other languages, Urdu has become localised wherever it is spoken, including in Pakistan. Urdu in Pakistan has undergone changes and has incorporated and borrowed many words from regional languages, thus allowing speakers of the language in Pakistan to distinguish themselves more easily and giving the language a decidedly Pakistani flavor. Similarly, the Urdu spoken in India can also be distinguished into many dialects such as the Standard Urdu of Lucknow and Delhi, as well as the Dakhni (Deccan) of South India. Because of Urdu's similarity to Hindi, speakers of the two languages can easily understand one another if both sides refrain from using literary vocabulary.

Pakistan
Although Urdu is widely spoken and understood throughout all of Pakistan, only 7% of Pakistan's population spoke Urdu as their native language around 1992. Most of the nearly three million Afghan refugees of different ethnic origins (such as Pashtun, Tajik, Uzbek, Hazarvi, and Turkmen) who stayed in Pakistan for over twenty-five years have also become fluent in Urdu. Muhajirs since 1947 have historically formed the majority population in the city of Karachi, however. Many newspapers are published in Urdu in Pakistan, including the Daily Jang, Nawa-i-Waqt, and Millat.

No region in Pakistan uses Urdu as its mother tongue, though it is spoken as the first language of Muslim migrants (known as Muhajirs) in Pakistan who left India after independence in 1947. Other communities, most notably the Punjabi elite of Pakistan, have adopted Urdu as a mother tongue and identify with both an Urdu speaker as well as Punjabi identity. Urdu was chosen as a symbol of unity for the new state of Pakistan in 1947, because it had already served as a lingua franca among Muslims in north and northwest British India. It is written, spoken and used in all provinces/territories of Pakistan, and together with English as the main languages of instruction, although the people from differing provinces may have different native languages.

Urdu is taught as a compulsory subject up to higher secondary school in both English and Urdu medium school systems, which has produced millions of second-language Urdu speakers among people whose native language is one of the other languages of Pakistan – which in turn has led to the absorption of vocabulary from various regional Pakistani languages, while some Urdu vocabularies has also been assimilated by Pakistan's regional languages. Some who are from a non-Urdu background now can read and write only Urdu. With such a large number of people(s) speaking Urdu, the language has acquired a peculiar Pakistani flavor further distinguishing it from the Urdu spoken by native speakers, resulting in more diversity within the language.

India
In India, Urdu is spoken in places where there are large Muslim minorities or cities that were bases for Muslim empires in the past. These include parts of Uttar Pradesh, Madhya Pradesh, Bihar, Telangana, Andhra Pradesh, Maharashtra (Marathwada and Konkanis), Karnataka and cities such as Hyderabad, Lucknow, Delhi, Malerkotla, Bareilly, Meerut, Saharanpur, Muzaffarnagar, Roorkee, Deoband, Moradabad, Azamgarh, Bijnor, Najibabad, Rampur, Aligarh, Allahabad, Gorakhpur, Agra, Firozabad, Kanpur, Badaun, Bhopal, Hyderabad, Aurangabad, Bangalore, Kolkata, Mysore, Patna, Darbhanga, Gaya, Madhubani, Samastipur, Siwan, Saharsa, Supaul, Muzaffarpur, Nalanda, Munger, Bhagalpur, Araria, Gulbarga, Parbhani, Nanded, Malegaon, Bidar, Ajmer, and Ahmedabad. In a very significant amount among the nearly 800 districts of India, there is a small Urdu-speaking minority at least. In Araria district, Bihar, there is a plurality of Urdu speakers and near-plurality in Hyderabad district, Telangana (43.35% Telugu speakers and 43.24% Urdu speakers).

Some Indian Muslim schools (Madrasa) teach Urdu as a first language and have their own syllabi and exams. In fact, the language of Bollywood films tend to contain a large number of Persian and Arabic words and thus considered to be "Urdu" in a sense, especially in songs.

India has more than 3,000 Urdu publications, including 405 daily Urdu newspapers. Newspapers such as Neshat News Urdu, Sahara Urdu, Daily Salar, Hindustan Express, Daily Pasban, Siasat Daily, The Munsif Daily and Inqilab are published and distributed in Bangalore, Malegaon, Mysore, Hyderabad, and Mumbai.

Elsewhere


Outside South Asia, it is spoken by large numbers of migrant South Asian workers in the major urban centres of the Persian Gulf countries. Urdu is also spoken by large numbers of immigrants and their children in the major urban centres of the United Kingdom, the United States, Canada, Germany, New Zealand, Norway, and Australia. Along with Arabic, Urdu is among the immigrant languages with the most speakers in Catalonia.

Colonial India
Religious and social atmospheres in early nineteenth century India played a significant role in the development of the Urdu register. Hindi became the distinct register spoken by those who sought to construct a Hindu identity in the face of colonial rule. As Hindi separated from Hindustani to create a distinct spiritual identity, Urdu was employed to create a definitive Islamic identity for the Muslim population in India. Urdu's use was not confined only to northern India – it had been used as a literary medium for Indian writers from the Bombay Presidency, Bengal, Orissa Province, and Tamil Nadu as well.

As Urdu and Hindi became means of religious and social construction for Muslims and Hindus respectively, each register developed its own script. According to Islamic tradition, Arabic, the language of Muhammad and the Qur'an, holds spiritual significance and power. Because Urdu was intentioned as means of unification for Muslims in Northern India and later Pakistan, it adopted a modified Perso-Arabic script.

Pakistan
Urdu continued its role in developing a Pakistani identity as the Islamic Republic of Pakistan was established with the intent to construct a homeland for the Muslims of Colonial India. Several languages and dialects spoken throughout the regions of Pakistan produced an imminent need for a uniting language. Urdu was chosen as a symbol of unity for the new Dominion of Pakistan in 1947, because it had already served as a lingua franca among Muslims in north and northwest of British Indian Empire. Urdu is also seen as a repertory for the cultural and social heritage of Pakistan.

While Urdu and Islam together played important roles in developing the national identity of Pakistan, disputes in the 1950s (particularly those in East Pakistan, where Bengali was the dominant language), challenged the idea of Urdu as a national symbol and its practicality as the lingua franca. The significance of Urdu as a national symbol was downplayed by these disputes when English and Bengali were also accepted as official languages in the former East Pakistan (now Bangladesh).

Pakistan
Urdu is the sole national, and one of the two official languages of Pakistan (along with English). It is spoken and understood throughout the country, whereas the state-by-state languages (languages spoken throughout various regions) are the provincial languages, although only 7.57% of Pakistanis speak Urdu as their first language. Its official status has meant that Urdu is understood and spoken widely throughout Pakistan as a second or third language. It is used in education, literature, office and court business, although in practice, English is used instead of Urdu in the higher echelons of government. Article 251(1) of the Pakistani Constitution mandates that Urdu be implemented as the sole language of government, though English continues to be the most widely used language at the higher echelons of Pakistani government.

India
Urdu is also one of the officially recognised languages in India and also has the status of "additional official language" in the Indian states of Andhra Pradesh, Uttar Pradesh, Bihar, Jharkhand, West Bengal, Telangana and the national capital territory Delhi. Also as one of the five official languages of Jammu and Kashmir.

India established the governmental Bureau for the Promotion of Urdu in 1969, although the Central Hindi Directorate was established earlier in 1960, and the promotion of Hindi is better funded and more advanced, while the status of Urdu has been undermined by the promotion of Hindi. Private Indian organisations such as the Anjuman-e-Tariqqi Urdu, Deeni Talimi Council and Urdu Mushafiz Dasta promote the use and preservation of Urdu, with the Anjuman successfully launching a campaign that reintroduced Urdu as an official language of Bihar in the 1970s. In the former Jammu and Kashmir state, section 145 of the Kashmir Constitution stated: "The official language of the State shall be Urdu but the English language shall unless the Legislature by law otherwise provides, continue to be used for all the official purposes of the State for which it was being used immediately before the commencement of the Constitution."

Dialects
Urdu became a literary language in the 18th century and two similar standard forms came into existence in Delhi and Lucknow. Since the partition of India in 1947, a third standard has arisen in the Pakistani city of Karachi. Deccani, an older form used in southern India, became a court language of the Deccan sultanates by the 16th century. Urdu has a few recognised dialects, including Dakhni, Dhakaiya, Rekhta, and Modern Vernacular Urdu (based on the Khariboli dialect of the Delhi region). Dakhni (also known as Dakani, Deccani, Desia, Mirgan) is spoken in Deccan region of southern India. It is distinct by its mixture of vocabulary from Marathi and Konkani, as well as some vocabulary from Arabic, Persian and Chagatai that are not found in the standard dialect of Urdu. Dakhini is widely spoken in all parts of Maharashtra, Telangana, Andhra Pradesh and Karnataka. Urdu is read and written as in other parts of India. A number of daily newspapers and several monthly magazines in Urdu are published in these states.

Dhakaiya Urdu is a dialect native to the city of Old Dhaka in Bangladesh, dating back to the Mughal era. However, its popularity, even amongst native speakers, has been gradually declining since the Bengali Language Movement in the 20th century. It is not officially recognised by the Government of Bangladesh. The Urdu spoken by Stranded Pakistanis in Bangladesh is different from this dialect.

Code switching
Many bilingual or multi-lingual Urdu speakers, being familiar with both Urdu and English, display code-switching (referred to as "Urdish") in certain localities and between certain social groups. On 14 August 2015, the Government of Pakistan launched the Ilm Pakistan movement, with a uniform curriculum in Urdish. Ahsan Iqbal, Federal Minister of Pakistan, said "Now the government is working on a new curriculum to provide a new medium to the students which will be the combination of both Urdu and English and will name it Urdish."

Comparison with Modern Standard Hindi


Standard Urdu is often compared with Standard Hindi. Both Urdu and Hindi, which are considered standard registers of the same language, Hindustani (or Hindi-Urdu), share a core vocabulary and grammar.

Apart from religious associations, the differences are largely restricted to the standard forms: Standard Urdu is conventionally written in the Nastaliq style of the Persian alphabet and relies heavily on Persian and Arabic as a source for technical and literary vocabulary, whereas Standard Hindi is conventionally written in Devanāgarī and draws on Sanskrit. However, both share a core vocabulary of native Sanskrit and Prakrit derived words and a significant amount of Arabic and Persian loanwords, with a consensus of linguists considering them to be two standardised forms of the same language and consider the differences to be sociolinguistic; a few classify them separately. The two languages are often considered to be a single language (Hindustani or Hindi-Urdu) on a dialect continuum ranging from Persianised to Sanskritised vocabulary, but now they are more and more different in words due to politics. Old Urdu dictionaries also contain most of the Sanskrit words now present in Hindi.

Mutual intelligibility decreases in literary and specialised contexts that rely on academic or technical vocabulary. In a longer conversation, differences in formal vocabulary and pronunciation of some Urdu phonemes are noticeable, though many native Hindi speakers also pronounce these phonemes. At a phonological level, speakers of both languages are frequently aware of the Perso-Arabic or Sanskrit origins of their word choice, which affects the pronunciation of those words. Urdu speakers will often insert vowels to break up consonant clusters found in words of Sanskritic origin, but will pronounce them correctly in Arabic and Persian loanwords. As a result of religious nationalism since the partition of British India and continued communal tensions, native speakers of both Hindi and Urdu frequently assert that they are distinct languages.

The grammar of Hindi and Urdu is shared, though formal Urdu makes more use of the Persian "-e-" izafat grammatical construct (as in Hammam-e-Qadimi, or Nishan-e-Haider) than does Hindi.

Urdu speakers by country
The following table shows the number of Urdu speakers in some countries.

Consonants

 * Notes


 * Marginal and non-universal phonemes are in parentheses.
 * is post-velar.

Vowels

 * Notes


 * This table contains a list of phones, not phonemes. In particular, [ɛ] is an allophone of /ə/ near /h/, and the short nasal vowels are not phonemic either.
 * Marginal and non-universal vowels are in parentheses.

Vocabulary
Syed Ahmed Dehlavi, a 19th-century lexicographer who compiled the Farhang-e-Asifiya Urdu dictionary, estimated that 75% of Urdu words have their etymological roots in Sanskrit and Prakrit,  and approximately 99% of Urdu verbs have their roots in Sanskrit and Prakrit. Urdu has borrowed words from Persian and to a lesser extent, Arabic through Persian, to the extent of about 25%  to 30% of Urdu's vocabulary. A table illustrated by the linguist Afroz Taj of the University of North Carolina at Chapel Hill likewise illustrates the amount of Persian loanwords to native Sanskrit-derived words in literary Urdu as comprising a 1:3 ratio.



The "trend towards Persianisation" started in the 18th century by the Delhi school of Urdu poets, though other writers, such as Meeraji, wrote in a Sanskritised form of the language. There has been a move towards hyper Persianisation in Pakistan since 1947, which has been adopted by much of the country's writers; as such, some Urdu texts can be composed of 70% Perso-Arabic loanwords just as some Persian texts can have 70% Arabic vocabulary. Some Pakistani Urdu speakers have incorporated Hindi vocabulary into their speech as a result of exposure to Indian entertainment. In India, Urdu has not diverged from Hindi as much as it has in Pakistan.

Most borrowed words in Urdu are nouns and adjectives. Many of the words of Arabic origin have been adopted through Persian, and have different pronunciations and nuances of meaning and usage than they do in Arabic. There are also a smaller number of borrowings from Portuguese. Some examples for Portuguese words borrowed into Urdu are chabi ("chave": key), girja ("igreja": church), kamra ("cámara": room), qamīz ("camisa": shirt).

Although the word Urdu is derived from the Turkic word ordu (army) or orda, from which English horde is also derived, Turkic borrowings in Urdu are minimal and Urdu is also not genetically related to the Turkic languages. Urdu words originating from Chagatai and Arabic were borrowed through Persian and hence are Persianised versions of the original words. For instance, the Arabic ta' marbuta ( ة ) changes to he or te. Nevertheless, contrary to popular belief, Urdu did not borrow from the Turkish language, but from Chagatai, a Turkic language from Central Asia. Urdu and Turkish both borrowed from Arabic and Persian, hence the similarity in pronunciation of many Urdu and Turkish words.

Formality
Urdu in its less formalised register is known as rekhta (rek̤h̤tah, ); the more formal register is sometimes referred to as zabān-i Urdū-yi muʿallá or lashkari zabān, referring to the Imperial army or simply Lashkari. The etymology of the word used in Urdu, for the most part, decides how polite or refined one's speech is. For example, Urdu speakers distinguish between pānī and āb, both meaning water. The former is used colloquially and has older Sanskrit origins; the latter is used formally and poetically, being of Persian origin.

If a word is of Persian or Arabic origin, the level of speech is considered to be more formal and grander. Similarly, if Persian or Arabic grammar constructs, such as the izafat, are used in Urdu, the level of speech is also considered more formal. If a word is inherited from Sanskrit, the level of speech is considered more colloquial and personal.

Writing system
Urdu is written right-to left in an extension of the Persian alphabet, which is itself an extension of the Arabic alphabet. Urdu is associated with the Nastaʿlīq style of Persian calligraphy, whereas Arabic is generally written in the Naskh or Ruq'ah styles. Because of its thousands of ligatures, Nasta’liq is notoriously difficult to typeset, so Urdu newspapers were hand-written by masters of calligraphy, known as kātib or  kh ush-nawīs, until the late 1980s. One handwritten Urdu newspaper, The Musalman, is still published daily in Chennai. InPage, a widely used desktop publishing tool for Urdu, has over 20,000 ligatures in its Nastaʿliq computer fonts.

A highly Persianised and technical form of Urdu was the lingua franca of the law courts of the British administration in Bengal and the North-West Provinces & Oudh. Until the late 19th century, all proceedings and court transactions in this register of Urdu were written officially in the Persian script. In 1880, Sir Ashley Eden, the Lieutenant-Governor of Bengal in colonial India abolished the use of the Persian alphabet in the law courts of Bengal and ordered the exclusive use of Kaithi, a popular script used for both Urdu and Hindi; in the Bihar Province, the court language was Urdu written in the Kaithi script. Kaithi's association with Urdu and Hindi was ultimately eliminated by the political contest between these languages and their scripts, in which the Persian script was definitively linked to Urdu. More recently in India, Urdu speakers have adopted Devanagari for publishing Urdu periodicals and have innovated new strategies to mark Urdu in Devanagari as distinct from Hindi in Devanagari. Such publishers have introduced new orthographic features into Devanagari for the purpose of representing the Perso-Arabic etymology of Urdu words. One example is the use of अ (Devanagari a) with vowel signs to mimic contexts of (‘ain), in violation of Hindi orthographic rules. For Urdu publishers, the use of Devanagari gives them a greater audience, whereas the orthographic changes help them preserve a distinct identity of Urdu.

Some poets from Bengal, namely Qazi Nazrul Islam, have historically used the Bengali script to write Urdu poetry like Prem Nagar Ka Thikana Karle and Mera Beti Ki Khela, as well as bilingual Bengali-Urdu poems like Alga Koro Go Khõpar Bãdhon, Juboker Chholona and Mera Dil Betab Kiya. Dhakaiya Urdu is a colloquial non-standard dialect of Urdu which was typically not written. However, organisations seeking to preserve the dialect have begun transcribing the dialect in the Bengali script.