Swadesh list

The Swadesh list is a compilation of tentatively universal concepts for the purposes of lexicostatistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness of those languages. The Swadesh list is named after linguist Morris Swadesh. It is used in lexicostatistics (the quantitative assessment of the genealogical relatedness of languages) and glottochronology (the dating of language divergence). Because there are several different lists, some authors also refer to "Swadesh lists".

Versions and authors
Morris Swadesh created several versions of his list. He started with a list of 215 meanings (falsely introduced as a list of 225 meanings in the paper due to a spelling error ), which he reduced to 165 words for the Salish-Spokane-Kalispel language. In 1952, he published a list of 215 meanings, of which he suggested the removal of 16 for being unclear or not universal, with one added to arrive at 200 words. In 1955, he wrote, "The only solution appears to be a drastic weeding out of the list, in the realization that quality is at least as important as quantity. Even the new list has defects, but they are relatively mild and few in number." After minor corrections, the final 100-word list was published posthumously in 1971 and 1972.

Other versions of lexicostatistical test lists were published e.g. by Robert Lees (1953), John A. Rea (1958:145f), Dell Hymes (1960:6), E. Cross (1964 with 241 concepts), W. J. Samarin (1967:220f), D. Wilson (1969 with 57 meanings), Lionel Bender (1969), R. L. Oswald (1971), Winfred P. Lehmann (1984:35f), D. Ringe (1992, passim, different versions), Sergei Starostin (1984, passim, different versions), William S-Y. Wang (1994), M. Lohr (2000, 128 meanings in 18 languages). B. Kessler (2002), and many others. The Concepticon, a project hosted at the Cross-Linguistic Linked Data (CLLD) project, collects various concept lists (including classical Swadesh lists) across different linguistic areas and times, currently listing 240 different concept lists.

Frequently used and widely available on the internet, is the version by Isidore Dyen (1992, 200 meanings of 95 language variants). Since 2010, a team around Michael Dunn has tried to update and enhance that list.

Principle
In origin, the words in the Swadesh lists were chosen for their universal, culturally independent availability in as many languages as possible, regardless of their "stability". Nevertheless, the stability of the resulting list of "universal" vocabulary under language change and the potential use of this fact for purposes of glottochronology have been analyzed by numerous authors, including Marisa Lohr 1999, 2000.

The Swadesh list was put together by Morris Swadesh on the basis of his intuition. Similar more recent lists, such as the Dolgopolsky list (1964) or the Leipzig–Jakarta list (2009), are based on systematic data from many different languages, but they are not yet as widely known nor as widely used as the Swadesh list.

Usage in lexicostatistics and glottochronology
Lexicostatistical test lists are used in lexicostatistics to define subgroupings of languages, and in glottochronology to "provide dates for branching points in the tree". The task of defining (and counting the number) of cognate words in the list is far from trivial, and often is subject to dispute, because cognates do not necessarily look similar, and recognition of cognates presupposes knowledge of the sound laws of the respective languages.

Swadesh 100 original final list
Swadesh's final list, published in 1971, contains 100 terms. Explanations of the terms can be found in Swadesh 1952 or, where noted by a dagger (†), in Swadesh 1955. Note that only this original sequence clarifies the correct meaning which is lost in an alphabetical order, e.g., in the case "27. bark" (originally without the specification here added). • # 	I (first person singular pronoun)

• # 	you (second person singular pronoun; 1952 thou & ye)

• #	we (1955: inclusive)

• #	this

• #	that

• #	who? (“?” not 1971)

• # 	what? (“?” not 1971)

• #	not

• #	all (of a number)

• # 	many

• # 	one

• #	two

• #	big

• # 	long (not wide)

• # 	small

• #	woman

• # 	man (adult male human)

• #	person (individual human)

• # 	fish (noun)

• #	bird

• #	dog

• #	louse

• # 	tree (not log)

• # 	seed (noun)

• #	leaf (botanics)

• #	root (botanics)

• #	bark (of tree)

• #	skin (1952: person’s)

• #	flesh (1952 meat, flesh)

• #	blood

• #	bone

• # 	grease (1952: fat, organic substance)

• #	egg

• #	horn (of bull etc., not 1952)†

• #	tail

• #	feather (large, not down)

• #	hair (on head of humans)

• #	head (anatomic)

• #	ear

• #	eye

• #	nose

• #	mouth

• #	tooth (front, rather than molar)

• # 	tongue (anatomical)

• #	claw (not in 1952)†

• #	foot (not leg)

• # 	knee (not 1952)†

• #	hand

• #	belly (lower part of body, abdomen)

• # 	neck (not nape)

• #	breasts (female; 1955 breast)†

• # 	heart

• # 	liver

• #	drink (verb)

• # 	eat (verb)

• # 	bite (verb)

• #	see (verb)

• # 	hear (verb)

• #	know (facts)

• # 	sleep (verb)

• #	die (verb)

• # 	kill (verb)

• # 	swim (verb)

• #	fly (verb)

• #	walk (verb)

• #	come (verb)

• #	lie (on side, recline)

• #	sit (verb)

• # 	stand (verb)

• # 	give (verb)

• # 	say (verb)†

• # 	sun

• #	moon (not 1952)†

• # 	star

• # 	water (noun)

• #	rain (noun, 1952 verb)

• #	stone

• #	sand

• # 	earth (soil)

• #	cloud (not fog)

• # 	smoke (noun, of fire)

• #	fire

• # 	ash(es)

• # 	burn (verb intransitive)

• # 	path (1952 road, trail; not street)

• #	mountain (not hill)

• # 	red (color)

• #	green (color)

• # 	yellow (color)

• # 	white (color)

• #	black (color)

• # 	night

• #	hot (adjective; 1952 warm, of weather)

• #	cold (of weather)

• # 	full†

• #	new

• #	good

• #	round (not 1952)†

• #	dry (substance)

• #	name "Claw" was only added in 1955, but again replaced by many well-known specialists with (finger)nail, because expressions for "claw" are not available in many old, extinct, or lesser known languages.

The 110-item Global Lexicostatistical Database list uses the original 100-item Swadesh list, in addition to 10 other words from the Swadesh–Yakhontov list.

Swadesh 207 list
The most used list nowadays is the Swadesh 207-word list, adapted from Swadesh 1952.

In Wiktionary ("Swadesh lists by language"), Panlex and in Palisto's "Swadesh Word List of Indo-European languages", hundreds of Swadesh lists in this form can be found.

• # 	I

• # 	you (singular)

• # 	they (singular)

• # 	we

• # 	you (plural)

• # 	they (plural)

• # 	this

• # 	that

• # 	here

• # 	there

• # 	who

• # 	what

• # 	where

• # 	when

• # 	how

• # 	not

• # 	all

• # 	many

• # 	some

• # 	few

• # 	other

• # 	one

• # 	two

• # 	three

• # 	four

• # 	five

• # 	big

• # 	long

• # 	wide

• # 	thick

• # 	heavy

• # 	small

• # 	short

• # 	narrow

• # 	thin

• # 	woman

• # 	man (adult male)

• # 	man (human being)

• # 	child

• # 	wife

• # 	husband

• # 	mother

• # 	father

• # 	animal

• # 	fish

• # 	bird

• # 	dog

• # 	louse

• # 	snake

• # 	worm

• # 	tree

• # 	forest

• # 	stick

• # 	fruit

• # 	seed

• # 	leaf

• # 	root

• # 	bark (of a tree)

• # 	flower

• # 	grass

• # 	rope

• # 	skin

• # 	meat

• # 	blood

• # 	bone

• # 	fat (noun)

• # 	egg

• # 	horn

• # 	tail

• # 	feather

• # 	hair

• # 	head

• # 	ear

• # 	eye

• # 	nose

• # 	mouth

• # 	tooth

• # 	tongue (organ)

• # 	fingernail

• # 	foot

• # 	leg

• # 	knee

• # 	hand

• # 	wing

• # 	belly

• # 	guts

• # 	neck

• # 	back

• # 	breast

• # 	heart

• # 	liver

• # 	to drink

• # 	to eat

• # 	to bite

• # 	to suck

• # 	to spit

• # 	to vomit

• # 	to blow

• # 	to breathe

• # 	to laugh

• # 	to see

• # 	to hear

• # 	to know

• # 	to think

• # 	to smell

• # 	to fear

• # 	to sleep

• # 	to live

• # 	to die

• # 	to kill

• # 	to fight

• # 	to hunt

• # 	to hit

• # 	to cut

• # 	to split

• # 	to stab

• # 	to scratch

• # 	to dig

• # 	to swim

• # 	to fly

• # 	to walk

• # 	to come

• # 	to lie (as in a bed)

• # 	to sit

• # 	to stand

• # 	to turn (intransitive)

• # 	to fall

• # 	to give

• # 	to hold

• # 	to squeeze

• # 	to rub

• # 	to wash

• # 	to wipe

• # 	to pull

• # 	to push

• # 	to throw

• # 	to tie

• # 	to sew

• # 	to count

• # 	to say

• # 	to sing

• # 	to play

• # 	to float

• # 	to flow

• # 	to freeze

• # 	to swell

• # 	sun

• # 	moon

• # 	star

• # 	water

• # 	rain

• # 	river

• # 	lake

• # 	sea

• # 	salt

• # 	stone

• # 	sand

• # 	dust

• # 	earth

• # 	cloud

• # 	fog

• # 	sky

• # 	wind

• # 	snow

• # 	ice

• # 	smoke

• # 	fire

• # 	ash

• # 	to burn

• # 	road

• # 	mountain

• # 	red

• # 	green

• # 	yellow

• # 	white

• # 	black

• # 	night

• # 	day

• # 	year

• # 	warm

• # 	cold

• # 	full

• # 	new

• # 	old

• # 	good

• # 	bad

• # 	rotten

• # 	dirty

• # 	straight

• # 	round

• # 	sharp (as a knife)

• # 	dull (as a knife)

• # 	smooth

• # 	wet

• # 	dry

• # 	correct

• # 	near

• # 	far

• # 	right

• # 	left

• # 	at

• # 	in

• # 	with

• # 	and

• # 	if

• # 	because

• # 	name

Shorter lists
The Swadesh–Yakhontov list is a 35-word subset of the Swadesh list posited as especially stable by Russian linguist Sergei Yakhontov around the 1960s, although the list was only officially published in 1991. It has been used in lexicostatistics by linguists such as Sergei Starostin. With their Swadesh numbers, they are:

• 1. I

• 2. you (singular)

• 3. this

• 4. who

• 5. what

• 6. one

• 7. two

• 8. fish

• 9. dog

• 10. louse

• 11. blood

• 12. bone

• 13. egg

• 14. horn

• 15. tail

• 16. ear

• 17. eye

• 18. nose

• 19. tooth

• 20. tongue

• 21. hand

• 22. know

• 23. die

• 24. give

• 25. sun

• 26. moon

• 27. water

• 28. salt

• 29. stone

• 30. wind

• 31. fire

• 32. year

• 33. full

• 34. new

• 35. name

Holman et al. (2008) found that in identifying the relationships between Chinese dialects the Swadesh–Yakhontov list was less accurate than the original Swadesh-100 list. Further they found that a different (40-word) list (also known as the ASJP list) was just as accurate as the Swadesh-100 list. However, they calculated the relative stability of the words by comparing retentions between languages in established language families. They found no statistically significant difference in the correlations in the families of the Old versus the New World.

The ranked Swadesh-100 list, with Swadesh numbers and relative stability, is as follows (Holman et al., Appendix. Asterisked words appear on the 40-word list):

• # 22 louse (42.8)

• # 12 two (39.8)

• # 75 water (37.4)

• # 39 ear (37.2)

• # 61 die (36.3)

• # 1 I (35.9)

• # 53 liver (35.7)

• # 40 eye (35.4)

• # 48 hand (34.9)

• # 58 hear (33.8)

• # 23 tree (33.6)

• # 19 fish (33.4)

• # 100 name (32.4)

• # 77 stone (32.1)

• # 43 tooth (30.7)

• # 51 breasts (30.7)

• # 2 you (30.6)

• # 85 path (30.2)

• # 31 bone (30.1)

• # 44 tongue (30.1)

• # 28 skin (29.6)

• # 92 night (29.6)

• # 25 leaf (29.4)

• # 76 rain (29.3)

• # 62 kill (29.2)

• # 30 blood (29.0)

• # 34 horn (28.8)

• # 18 person (28.7)

• # 47 knee (28.0)

• # 11 one (27.4)

• # 41 nose (27.3)

• # 95 full (26.9)

• # 66 come (26.8)

• # 74 star (26.6)

• # 86 mountain (26.2)

• # 82 fire (25.7)

• # 3 we (25.4)

• # 54 drink (25.0)

• # 57 see (24.7)

• # 27 bark (24.5)

• # 96 new (24.3)

• # 21 dog (24.2)

• # 72 sun (24.2)

• # 64 fly (24.1)

• # 32 grease (23.4)

• # 73 moon (23.4)

• # 70 give (23.3)

• # 52 heart (23.2)

• # 36 feather  (23.1)

• # 90 white (22.7)

• # 89 yellow (22.5)

• # 20 bird (21.8)

• # 38 head (21.7)

• # 79 earth (21.7)

• # 46 foot (21.6)

• # 91 black (21.6)

• # 42 mouth (21.5)

• # 88 green (21.1)

• # 60 sleep (21.0)

• # 7 what (20.7)

• # 26 root (20.5)

• # 45 claw (20.5)

• # 56 bite (20.5)

• # 83 ash (20.3)

• # 87 red (20.2)

• # 55 eat (20.0)

• # 33 egg (19.8)

• # 6 who (19.0)

• # 99 dry (18.9)

• # 37 hair (18.6)

• # 81 smoke (18.5)

• # 8 not (18.3)

• # 4 this (18.2)

• # 24 seed (18.2)

• # 16 woman (17.9)

• # 98 round (17.9)

• # 14 long (17.4)

• # 69 stand (17.1)

• # 97 good (16.9)

• # 17 man (16.7)

• # 94 cold (16.6)

• # 29 flesh (16.4)

• # 50 neck (16.0)

• # 71 say (16.0)

• # 84 burn (15.5)

• # 35 tail (14.9)

• # 78 sand (14.9)

• # 5 that (14.7)

• # 65 walk (14.4)

• # 68 sit (14.3)

• # 10 many (14.2)

• # 9 all (14.1)

• # 59 know (14.1)

• # 80 cloud (13.9)

• # 63 swim (13.6)

• # 49 belly (13.5)

• # 13 big (13.4)

• # 93 hot (11.6)

• # 67 lie (11.2)

• # 15 small (6.3)

Sign languages
In studying the sign languages of Vietnam and Thailand, linguist James Woodward noted that the traditional Swadesh list applied to spoken languages was unsuited for sign languages. The Swadesh list results in overestimation of the relationships between sign languages, due to indexical signs such as pronouns and parts of the body. The modified list is as follows, in largely alphabetical order:

• # 	all

• # 	animal

• #	bad

• #	because

• #	bird

• #	black

• # 	blood

• #	child

• #	count

• # 	day

• # 	die

• #	dirty

• #	dog

• # 	dry

• # 	dull

• #	dust

• # 	earth

• #	egg

• # 	grease

• #	father

• #	feather

• #	fire

• # 	fish

• # 	flower

• #	good

• #	grass

• #	green

• #	heavy

• #	how

• #	hunt

• #	husband

• # 	ice

• #	if

• #	kill

• #	laugh

• #	leaf

• #	lie

• #	live

• #	long

• #	louse

• #	man

• #	meat

• #	mother

• # 	mountain

• #	name

• #	narrow

• # 	new

• #     night

• #      not

• #	old

• #	other

• #      person

• #	play

• # 	rain

• # 	red

• #	correct

• # 	river

• # 	rope

• #	salt

• # 	sea

• #	sharp

• # 	short

• #	sing

• # 	sit

• # 	smooth

• #	snake

• #	snow

• #	stand

• #	star

• #	stone

• # 	sun

• # 	tail

• # 	thin

• # 	tree

• #	vomit

• # 	warm

• # 	water

• #	wet

• #	what

• #	when

• # 	where

• #	white

• # 	who

• #	wide

• # 	wife

• # 	wind

• # 	with

• #	woman

• # 	wood

• #	worm

• # 	year

• # 	yellow

• #	full

• # 	moon

• #	brother

• #	cat

• # 	dance

• #	pig

• #	sister

• #	work