Phono-semantic matching

Phono-semantic matching (PSM) is the incorporation of a word into one language from another, often creating a neologism, where the word's non-native quality is hidden by replacing it with phonetically and semantically similar words or roots from the adopting language. Thus the approximate sound and meaning of the original expression in the source language are preserved, though the new expression (the PSM – the phono-semantic match) in the target language may sound native.

Phono-semantic matching is distinct from calquing, which includes (semantic) translation but does not include phonetic matching (i.e., retention of the approximate sound of the borrowed word through matching it with a similar-sounding pre-existent word or morpheme in the target language).

Phono-semantic matching is also distinct from homophonic translation, which retains the sound of a word but not the meaning.

History
The term "phono-semantic matching" was introduced by linguist and revivalist Ghil'ad Zuckermann. It challenged Einar Haugen's classic typology of lexical borrowing (loanwords). While Haugen categorized borrowing into either substitution or importation, camouflaged borrowing in the form of PSM is a case of "simultaneous substitution and importation." Zuckermann proposed a new classification of multisourced neologisms, words deriving from two or more sources at the same time. Examples of such mechanisms are phonetic matching, semanticized phonetic matching and phono-semantic matching.

Zuckermann concludes that language planners, for example members of the Academy of the Hebrew Language, employ the very same techniques used in folk etymology by laymen, as well as by religious leaders. He urges lexicographers and etymologists to recognize the widespread phenomena of camouflaged borrowing and multisourced neologization and not to force one source on multi-parental lexical items.

Arabic
Zuckermann analyses the evolution of the word artichoke. Beginning in Arabic الخرشوف "the artichoke", it was adapted into Andalusian Arabic alxarshofa, then Old Spanish alcarchofa, then Italian alcarcioffo, then Northern Italian arcicioffo > arciciocco > articiocco, then phonetically realised in English as artichoke. The word was eventually phono-semantically matched back into colloquial Levantine Arabic (for example in Syria, Lebanon and Israel) as أرضي شوكي, consisting of أرضي "earthly" and شوكي  "thorny".

Arabic has made use of phono-semantic matching to replace blatantly imported new terminology with a word derived from an existing triliteral root. Examples are:

Dutch
A number of PSMs exist in Dutch as well. One notable example is hangmat ("hammock"), which is a modification of Spanish hamaca, also the source of the English word. Natively, the word is transparently analysed as a "hang-mat", which aptly describes the object. Similarly:
 * In ansjovis ("anchovy"), the second part was modified to resemble vis ("fish"), although the word originates in Spanish anchova;
 * In scheurbuik ("scurvy"), the word parts were modified to resemble scheur- (stem of scheuren, tear open) and buik ("belly, stomach"), although the word originates in Middle Low German schorbuck;
 * In sprokkelmaand (an alternative name for februari, "February"), the first part was modified to resemble sprokkelen ("gather wood"), although the word originates in Latin spurcalia;
 * In zijdenhemdje (a variety of apple with a very soft, thin, yellow skin), the word parts were modified to resemble zijden ("silken") and hemdje ("shirt; small shirt; vest"), although the word actually denotes the place Sydenham where the apple originates.
 * Dutch dictionary Van Dale describes balkenbrij as a particularly notable example.
 * Other examples are angstvallig, dukdalf, geeuwhonger, hagedis, hondsdraf, penthouse, rederijker, rendier and zondvloed.

English
A few PSMs exist in English. The French word ("Carthusian monastery") was translated to the English charterhouse. The French word, itself an adaptation of the Choctaw name for the bowfin, has likewise been Anglicized as , although it is unrelated to the pikes. The French name for the Osage orange, bois d'arc (lit. "bow-wood"), is sometimes rendered as "bowdark".

The second part of the word muskrat was altered to match rat, replacing the original form , which derives from an Algonquian (possibly Powhatan ) word, muscascus (literally "it is red"), or from the Abenaki native word mòskwas.

The use of runagates in Psalm 68 of the Anglican Book of Common Prayer derives from phono-semantic matching between Latin renegatus and English  .

Finnish
The Finnish compound word for "jealous," mustasukkainen, literally means "black-socked" (musta "black" and sukka "sock"). However, the word is a case of a misunderstood loan translation from Swedish svartsjuk "black-sick". The Finnish word sukka fit with a close phonological equivalent to the Swedish sjuk. Similar cases are työmyyrä "hardworking person", literally "work mole", from arbetsmyra "work ant", matching myra "ant" to myyrä "mole"; and liikavarvas "clavus", literally "extra toe", from liktå < liktorn "dead thorn", matching liika "extra" to lik "dead (archaic)" and varvas "toe" to tå < torn "thorn".

German
"applies the concepts of multisourced neologisation and, more generally, camouflaged borrowing, as established by to Modern German, pursuing a twofold aim, namely to underline the significance of multisourced neologisation for language contact theory and secondly to demonstrate that together with other forms of camouflaged borrowing it remains an important borrowing mechanism in contemporary German."

Icelandic
demonstrate how Icelandic camouflages many English words by means of phono-semantic matching. For example, the Icelandic-looking word eyðni, meaning "AIDS", is a PSM of the English acronym AIDS, using the pre-existent Icelandic verb eyða, meaning "to destroy", and the Icelandic nominal suffix -ni. Similarly, the Icelandic word tækni, meaning "technology, technique", derives from tæki, meaning "tool", combined with the nominal suffix -ni, but is, in fact, a PSM of the Danish teknik (or of another derivative of Greek τεχνικός ), meaning "technology, technique". Tækni was coined in 1912 by Dr Björn Bjarnarson from Viðfjörður in the East of Iceland. It had been in little use until the 1940s, but has since become common, as a lexeme and as an element in new formations, such as raftækni, lit. "electrical technics", i.e. "electronics", tæknilegur "technical" and tæknir "technician". Other PSMs discussed in the article are beygla, bifra – bifrari, brokkál, dapur –  dapurleiki - depurð, fjárfesta - fjárfesting, heila, guðspjall, ímynd, júgurð, korréttur, Létt og laggott, musl, pallborð –  pallborðsumræður, páfagaukur, ratsjá, setur, staða, staðall –  staðla –  stöðlun, toga –  togari, uppi and veira.

Japanese
In modern Japanese, loanwords are generally represented phonetically via katakana. However, in earlier times loanwords were often represented by kanji (Chinese characters), a process called when used for phonetic matching, or  when used for semantic matching. Some of these continue to be used; the characters chosen may correspond to the sound, the meaning, or both.

In most cases the characters used were chosen only for their matching sound or only for their matching meaning. For example, in the word 寿司 (sushi), the two characters are respectively read as and, but the character 寿 means "one's natural life span" and 司 means "to administer", neither of which has anything to do with the food –  this is. Conversely, in the word 煙草 for "tobacco", the individual kanji respectively mean "smoke" and "herb", which corresponds to the meaning, while none of their possible readings have a phonetic relationship to the word  –  this is.

In some cases, however, the kanji were chosen for both their semantic and phonetic values, a form of phono-semantic matching. A stock example is 倶楽部 for "club", where the characters can be interpreted loosely in sequence as "together-fun-place" (which has since been borrowed into Chinese during the early 20th century with the same meaning, including the individual characters, but with a pronunciation that differs considerably from the original English and the Japanese, ). Another example is 合羽 for the Portuguese capa, a kind of raincoat. The characters can mean "wings coming together", as the pointed capa resembles a bird with wings folded together.

Mandarin Chinese
PSM is frequently used in Mandarin borrowings. An example is the Taiwanese Mandarin word 威而剛, which literally means "powerful and hard" and refers to Viagra, the drug for treating erectile dysfunction in men, manufactured by Pfizer.

Another example is the Mandarin form of World Wide Web, which is , which satisfies "www" and literally means "myriad dimensional net". The English word hacker has been borrowed into Mandarin as (, "dark/wicked visitor").

Modern Standard Chinese 声纳/聲納 "sonar" uses the characters 声/聲  "sound" and 纳/納  "receive, accept". The pronunciations and  are phonetically somewhat similar to the two syllables of the English word. Chinese has a large number of homo/heterotonal homophonous morphemes, which would have been a better phonetic fit than, but not nearly as good semantically – consider the syllable song (cf. 送  'deliver, carry, give (as a present)', 松  'pine; loose, slack', 耸/聳  'tower; alarm, attract' etc.), sou (cf. 搜  'search', 叟  'old man', 馊/餿  'sour, spoiled' and many others) or shou (cf. 收  'receive, accept', 受  'receive, accept', 手  'hand', 首  'head', 兽/獸  'beast', 瘦  'thin' and so forth).

According to Zuckermann, PSM in Mandarin is common in:


 * brand names, e.g., 可口可乐/可口可樂, "Coca-Cola" translates to "tasty [and] entertaining", 可乐/可樂 itself genericised to refer to any cola.
 * computer jargon, e.g., the aforementioned word for "World Wide Web".
 * technological terms, e.g., the aforementioned word for "sonar".
 * toponyms, e.g., the name 白俄罗斯/白俄羅斯, "Belarus" combines the word 白 , "White" with the name 俄罗斯/俄羅斯 , "Russia", therefore meaning "White Russia" just like the endonym "Белару́сь".

From a monolingual Chinese view, Mandarin PSM is the 'lesser evil' compared with Latin script (in digraphic writing) or code-switching (in speech). Zuckermann's exploration of PSM in Standard Chinese and Meiji-period Japanese concludes that the Chinese writing system is multifunctional: pleremic ("full" of meaning, e.g., logographic), cenemic ("empty" of meaning, e.g., phonographic - like a syllabary), and phono-logographic (simultaneously cenemic and pleremic). Zuckermann argues that Leonard Bloomfield's assertion that "a language is the same no matter what system of writing may be used" is inaccurate. "If Chinese had been written using roman letters, thousands of Chinese words would not have been coined, or would have been coined with completely different forms". Evidence of this can be seen in the Dungan language, a Chinese language that is closely related to Mandarin, but written phonetically in Cyrillic, where words are directly borrowed, often from Russian, without PSM.

A related practice is the translation of Western names into Chinese characters.

Modern Hebrew
Often in phono-semantic matching, the source language determines both the root word and the noun-pattern. This makes it difficult to determine the source language's influence on the target language morphology. For example, "the phono-semantic matcher of English dock with Israeli Hebrew mivdók could have used – after deliberately choosing the phonetically and semantically suitable root   meaning 'check' (Rabbinic) or 'repair' (Biblical) –  the noun-patterns mi⌂⌂a⌂á, ma⌂⌂e⌂á, mi⌂⌂é⌂et, mi⌂⌂a⌂áim etc. (each ⌂ represents a slot where a radical is inserted). Instead, mi⌂⌂ó⌂, which was not highly productive, was chosen because its [o] makes the final syllable of  mivdók sound like English dock."

Miscellaneous
The Hebrew name יְרוּשָׁלַיִם (Yərūšālayim) for Jerusalem is rendered as Ἱεροσόλυμα (Hierosóluma) in, e.g. Matthew 2:1. The first part corresponds to the Ancient Greek prefix ἱερo- (hiero-), meaning "sacred, holy".

Old High German widarlōn ("repayment of a loan") was rendered as widerdonum ("reward") in Medieval Latin. The last part corresponds to the Latin donum ("gift").

Viagra, a brand name which was suggested by Interbrand Wood (the consultancy firm hired by Pfizer), is itself a multisourced neologism, based on Sanskrit व्याघ्र  ("tiger") but enhanced by the words vigour (i.e. strength) and Niagara (i.e. free/forceful flow).

Other than through Sinoxenic borrowings, Vietnamese employs phono-semantic matching less commonly than Chinese. Examples include ma trận ("matrix", from the words for "magic" and "battle array"), áp dụng ("apply", from the words for "press down" and "use"), and (Huey P. Long, from "yellow flying dragon", evoking the Huey P. Long Bridge).

Motivations
According to Zuckermann, PSM has various advantages from the point of view of a puristic language planner:


 * recycling obsolete lexical items
 * camouflaging foreign influence (for the native speaker in the future)
 * facilitating initial learning (mnemonics) (for the contemporary learner/speaker)

Other motivations for PSM include the following:


 * playfulness (cf. midrashic tradition of homiletic commentary, cf. the Jewish pilpul)
 * Apollonianism (the wish to create order/meaningfulness, cf. folk etymology, etymythology, paronymic attraction)
 * iconicity (the belief that there is something intrinsic about the sound of names; cf. phonaesthetics)
 * political correctness / rejective lexical engineering
 * attracting customers (in the case of brand names)

Expressive loan
An expressive loan is a loanword incorporated into the expressive system of the borrowing language, making it resemble native words or onomatopoeia. Expressive loanwords are hard to identify, and by definition, they follow the common phonetic sound change patterns poorly. Likewise, there is a continuum between "pure" loanwords and "expressive" loanwords. The difference to a folk etymology (or an eggcorn) is that a folk etymology is based on misunderstanding, whereas an expressive loan is changed on purpose, the speaker taking the loanword knowing full well that the descriptive quality is different from the original sound and meaning.

South-eastern Finnish, for example, has many expressive loans. The main source language, Russian, does not use the vowels 'y', 'ä' or 'ö' [y æ ø]. Thus, it is common to add these to redescriptivized loans to remove the degree of foreignness that the loanword would otherwise have. For example, tytinä "brawn" means "wobblyness", and superficially it looks like a native construction, originating from the verb tutista "to wobble" added with a front vowel sound in the vowel harmony. However, it is expressivized from tyyteni (which is a confusing word as -ni is a possessive suffix), which in turn is a loanword from Russian stúden' . A somewhat more obvious example is tökötti "sticky, tarry goo", which could be mistaken as a derivation from the onomatopoetic word tök (cf. the verb tökkiä "to poke"). However, it is an expressive loan of Russian d'ogot' "tar".