User:Isabella Liberty/sandbox

@StatisticalMT --- PUBLISHED

Challenges(...) BOLD NEW

[word alignment

'''One of the problems presented is function words that have no clear equivalent in the target language. For example, when translating from English to German the sentence "John does not live here," the word "does" doesn't have a clear alignment in the translated sentence "John wohnt hier nicht." Through logical reasoning, it may be aligned with the words "wohnt" (as in English it contains grammatical information for the word "live") or "nicht" (as it only appears in the sentence because it is negated) or it may be unaligned.''']

[sentence alignment (bold new)

In parallel corpora single sentences in one language can be found translated into several sentences in the other and vice versa.[14]

'''Long sentences may be broken up, short sentences may be merged. There are even some languages that use writing systems without clear indication of a sentence end (for example, Thai).'''

Sentence aligning can be performed through the Gale-Church alignment algorithm.

Through this and other mathematical models efficient search and retrieval of the highest scoring sentence alignment is possible.]

[idioms

Depending on the corpora used, idioms may not translate "idiomatically". For example, using Canadian Hansard as the bilingual corpus, "hear" may almost invariably be translated to "Bravo!" since in Parliament "Hear, Hear!" becomes "Bravo!". [15]

'''This problem is connected with word alignment, as in very specific contexts the idiomatic expression may align with words that result in an idiomatic expression of the same meaning in the target language. However, it is unlikely, as the alignment usually doesn't work in any other contexts. For that reason, idioms should only be subjected to phrasal alignment, as they cannot be decomposed further without losing their meaning. This problem is therefore specific for word-based translation.''']