Talk:WWWJDIC

Merge with EDICT
They are two quite different projects. See my comments on the Talk page for EDICT. JimBreen (talk) 06:03, 11 September 2012 (UTC)

Tanaka Corpus
Just a note on the use of "highly modified". About 60,000 sentence pairs have been deleted from the original corpus (mostly duplicate and near duplicate sentences). Over 2,600 new sentence pairs have been added to improve vocabulary coverage. Sentences are fully indexed to Edict entries (proper nouns, punctuation etc. is excluded from indexing). More details are available from the page linked.

Rewrite
This article is rather muddled and could do with a rewrite. Samatarou (talk) 01:33, 14 January 2016 (UTC)