Talk:Comparison of machine translation applications

First post
(added heading to 1st post)--Canoe1967 (talk) 19:40, 23 April 2012 (UTC)

I started this comparison article which I think is necessary to Wikipedia.. I am currently going to have to pause my work on it due to other obligations.. If there is anyone here whom thinks the table should look differently please correct it or help me fill it up if you feel like it before I'll get back to it later today or tommorow. --Acidburn24m 22:11, 20 October 2007 (UTC)


 * How do you recommend ordering the languages on the right? Alphabetically? By number of speakers? - Francis Tyers · 22:58, 20 October 2007 (UTC)


 * Also, I've added a free software line, but it would also be ok to split the page into two tables "Free software" first then "Proprietary software" second. - Francis Tyers · 23:08, 20 October 2007 (UTC)


 * that's a good idea! I added it to a table of it's own. Acidburn24m 05:39, 21 October 2007 (UTC)


 * Ok, but none of the other systems apart from Apertium are free software. - Francis Tyers · 11:38, 21 October 2007 (UTC)

Through the last hour I've been trying to think how to get this whole table in shape.. I want people to clearly see in the table which options to they got for each language instead of searching through all combinations of all the languages through a huge list... the problem is that I am not sure how such a list would look like ... I tried scrabbling something in Excel but unfortunately it isn't good because I realized that it would require a lot of duplicated rows which would also only complicate the list.

if anyone got any bright ideas how to get a nice simple small table which would show all the combinations without making any duplicate rows please provide your ideas. Acidburn24m 05:44, 21 October 2007 (UTC)


 * There are literally hundreds of systems, how should we decide which are included on this page? E.g. Gramtrans would be one I'd immediately think of for translation between the scandinavian languages. - Francis Tyers · 11:38, 21 October 2007 (UTC)


 * It might be a good idea to merge "English - French" and "French - English" into one row, "English - French" and then mark it as unidirectional or bidirectional ? - Francis Tyers · 11:39, 21 October 2007 (UTC)


 * sounds interesting.. could you try to create the table as you vision it? Acidburn24m 15:55, 21 October 2007 (UTC)


 * Ouf, I'll try if I have some time, but it would require some significant restructuring. - Francis Tyers · 10:30, 22 October 2007 (UTC)


 * Try creating your idea in Excel and post an screen capture image here of how you think it should look like (just like I did).. if you do so, I could work on the significant restructuring part of building the wikipedia tables themselves. Acidburn24m 14:04, 22 October 2007 (UTC)


 * I don't use non-free software, but I could knock up something in Gnumeric. - Francis Tyers · 11:55, 28 October 2007 (UTC)

another idea I had is maybe making different tables for the availble MT web applications versus the actual MT programs? --Acidburn24m 19:20, 21 October 2007 (UTC)


 * Aye, considering that the big three "Google", "Babelfish" and "Windows Live" all use pretty much the same software (SYSTRAN). - Francis Tyers · 10:30, 22 October 2007 (UTC)

I am wondering, which other Software based MT applications are there besides Babylon? Acidburn24m 20:44, 23 October 2007 (UTC)

New Table
After giving it some thought I redesigned the whole table to be only focused on all the languages which one could transform to english and all the languages one can transform from english. I decided to sort the list according to this list to get the most relevent resolts to the actual languages which are used on the internet. I still need help with the coding of the table itself and filling it up. --Acidburn24m 06:02, 25 October 2007 (UTC)

Alternative
I was thinking something like this, using arrows after the yes/no to depict the directions, e.g. →, ← and ⇄ - Francis Tyers · 12:00, 28 October 2007 (UTC)
 * looks nice.. Acidburn24m 18:38, 28 October 2007 (UTC)
 * now that I finished fixing the table it is the time to think how to implement your idea into it. please let me know what you think should be changed exactly before I start editing it again. Acidburn24m 17:31, 1 November 2007 (UTC)

Table fixed and data has been entered
I know there is still much data to enter.. at this point I need to take a break entering the data. please take you time to go over the data I entered and varify if it's correct. I would also appriciate if you can enter new data into the empty boxes as well. Acidburn24m 19:01, 29 October 2007 (UTC)
 * Ok.. I entered what I could find.. I can't find any list of the Machine Translation languages which Babylon can translate between..  if you find it please fill it in the table. if you have any other ideas for MT websites to add please do add them to the list. --Acidburn24m 00:44, 30 October 2007 (UTC)
 * Francis Tyers, still waiting for you opinion about my latest changes.. Acidburn24m 17:29, 1 November 2007 (UTC)

Does anybody know of any other MT applications which can translate English to Hebrew other than Babylon and 1-800-translate ?
If you do, please let me know. Acidburn24m 05:08, 16 November 2007 (UTC)
 * I believe Moses (and probably OpenLogos as well) can do the job, though you need to provide the parallel corpus and patience while it trains... —Preceding unsigned comment added by 62.226.31.103 (talk) 04:05, 26 February 2009 (UTC)

TRANSLATE.EU
Please add TRANSLATE.EU into article. —Preceding unsigned comment added by 87.226.25.208 (talk) 20:08, 19 December 2007 (UTC)

Should language pairs that no listed application offers be listed?
The paragraph "Languages features comparison" currently also contains language pairs where no listed translation application offers a translation, e.g. Thai to English, Finnish to English, English to Slovenian and so on - as this list could be expanded nearly arbitrarily (there are countless "missing" languages), I propose to delete these from the table. Gestumblindi (talk) 02:55, 24 March 2008 (UTC)


 * Agree. - Francis Tyers · 08:53, 8 May 2008 (UTC)


 * Funny question. I mean how many (living) languages exist? Several hundred?
 * But seriously, I'm not sure if it makes sense at all, since for example Moses, and basically all statistical machine translation software, is potentially able to translate any language pair. The problem is the availability of a sufficiently large and qualitative parallel corpus for the respective pair. Even the commercial engines are very likely based on stochastic methods and the only difference is they bought translated text collections for only a small set of languages and are therefor restricted, whereas the free engines are restricted by the lack of free parallel text collections at all.Though one only has too lock around to find quite a lot of material which can be aligned. A good starting point might be OPUS.
 * I guess what I want to say with this is, it should be at least pointed out that often it's not the capability of the engine but rather the availability parallel text collections that restrict the offer of languages. 62.226.48.30 (talk) 00:18, 27 February 2009 (UTC)

Web application vs Software package
A lot of those translation companies offer software packages at a cost. The license section doesn't say whether it is referring to the web application or the software package. This needs to be corrected --Voidvector (talk) 14:02, 20 June 2008 (UTC)

Yup, that needs a lot of work. It is misleading - SYSTRAN is supposed to be free, web-based software according to this article. FYI, SYSTRAN costs nearly $1000, and the software can do a lot more than the free online service. Actually, the software comes much closer to machine translation than the online thingamajig. Major editing is needed. —Preceding unsigned comment added by 74.15.124.120 (talk) 01:29, 18 July 2008 (UTC)

citatation needed for existance of platform
In "General information" there is no reference of the "platform"s. They should be added or this column should be deleted. -- 84.132.60.151 (talk) 12:35, 1 February 2009 (UTC)

Include only original software
1-800-translate was included and I believe it should be removed since they do not have machine translation software and if they ever had it was probably a reseller of systran.

Someone mentioned translate.eu... It is just a mashup of other tecnologies, such as google translator.

Also it is interesting to notice that Babylon is not avalaible for download. It is simply a plugin but not the full translator, so it is basically online (or needs web access).

Other old softwares like "Power Translator" seems missing, althought I do not know if they are still in development... —Preceding unsigned comment added by 81.184.58.60 (talk) 19:57, 18 March 2009 (UTC)

Web-based and offline
A distinction should be made between (browser, internet-connected offline programs, eg Lingoes) and offline programs. Programs as Lingoes still need internet connection to function yet do not use browser. —Preceding unsigned comment added by 81.246.131.229 (talk) 12:52, 26 March 2009 (UTC)

Slightly biased?
It appears to make Google translate look good by selecting the languages it's capable of translating and not many others. Maybe the other translation services don't do anymore, so maybe it's an accurate representation but it looks biased. (I personally think Google is the best but that's besides the point.) Also the red linked ones should be removed (their articles got deleted for being non-notable, advertising, or having a expired prod). 71.155.243.128 (talk) 00:26, 15 July 2009 (UTC)

Merge Hindi to Punjabi Machine Translation System with this article Comparison of machine translation applications
I suggest merging the above article with this article. The discussion is being held here Talk:Hindi to Punjabi Machine Translation System. Kindly leave your suggestions. thanks '' ▒ Wirεłεşş ▒ Fidεłitұ ▒ Ćłâşş ▒ Θnε ▒ ―Œ  ♣Łεâvε Ξ мεşşâgε♣  05:20, 14 January 2010 (UTC)
 * I am removing the stale merge tag--Canoe1967 (talk) 21:49, 23 April 2012 (UTC)

Added Bing/Microsoft Translator info
I updated the language support list for languages now supported by Bing Translator (including addition of Haitian Creole system). By way of disclosure - I work for the Microsoft Research team responsible for delivering this service. I have added the appropriate references to verify that the language list does include these languages. Vikram (talk) 21:09, 25 January 2010 (UTC)

Openlogos supported languages
These are the supported languages for openlogos: source-language: e[ng] target-languages: g[er] f[re] s[pa] i[ta] p[or] source-language: g[er] target-languages: e[ng] f[re] i[ta]"

It is not obvious but it can be read here: http://www.pro-linux.de/NB3/artikel/2/253/3,openlogos-101-installation-und-anwendung.html —Preceding unsigned comment added by 81.184.58.60 (talk) 00:22, 22 March 2010 (UTC)

Goldendict is not a translator
I think goldendict is not a translator, it simply calls other webservices which can translate. —Preceding unsigned comment added by 81.184.58.60 (talk) 00:25, 22 March 2010 (UTC)

Websites not in tables yet
Should we add a section mentioning these? I don't have the knowledge to edit them into the tables.--Canoe1967 (talk) 19:58, 23 April 2012 (UTC)

LingoWare
LingoWare should be added to the list. Galzigler (talk) 21:33, 11 December 2012 (UTC)

Links in 3rd column "License"
The links "Paid" point to something that does not say what is a "paid" licence: does it differ from "Commercial" ?

The "Commercial" either has no link, or "Commercial" or "Commercial".

All these should be unified, as "Commercial" seems to me the best. --SGlad (talk) 15:37, 24 May 2013 (UTC)

Reference cleanup
Currently, the first reference reads "http://imtranslator.net/compare/#window compares translations from four websites (this seems to be VIRUS becareful, this is installed without user approval, not easy to remove)" It would be a good idea to check this website and remove/change the link if the site is indeed malicious, or remove the warning if not.

I would, of course, do it myself but cannot do so from my current location. (Hope I've formatted this talk page entry correctly...) — Preceding unsigned comment added by 13.21.125.8 (talk) 15:07, 30 July 2013 (UTC)
 * For the safety of our users, I'm going to be bold and remove it for now. If someone wishes to test it and re-add it, fine. But until then, I see no explicit reason to keep it. 64.134.100.199 (talk) 19:18, 31 December 2013 (UTC)

Requested edit
Please add Tilde MT (www.tilde.com/mt)
 * If you could supply the needed data for the chart, that would be perfect. -- Diannaa (talk) 22:58, 23 October 2015 (UTC)

Removed some ad copy descriptions from a few of the commercial SMT tools
What users want to know in the notes field is technical information, like whether machine translations are statistical or rule-based.

Instead, what we get is uninformative crap like this:
 * tauyou by tauyou.
 * tauyou machine translation (MT) service allows any company in the translation industry to integrate MT into their workflows, both to reduce costs and delivery times with human post-editing, and to solve new problems thanks to high-quality domain-specific machine translation.
 * KantanMT.com by Xcelerator Machine Translations Ltd.
 * KantanMT.com provides the leading cloud-based Machine Translation platform for the localization industry. It offers everything needed to integrate Machine Translation technology into corporate communication channels and localization workflows.
 * Slate by Precision Translation Tools
 * Package distribution of SMT binaries and scripts from Moses, MGIZA++ and Precision Translation Tools. Slate is the first SMT platform to train, tune and translate on native MS Windows machines without the performance bottleneck of the Cygwin emulation layer.

I removed the bold text from the names and neutralized the descriptions. For KantanMT, I deleted the descriptions altogether. We need someone familiar with those tools to provide neutral descriptions about what distinguishes them from other tools. --50.1.143.219 (talk) 16:22, 4 June 2016 (UTC)

Name
I think it would be better to change the name of the article to List of machine translation applications. Most similar articles use the word "list" for these types of articles. Serchia (talk) 16:03, 11 October 2021 (UTC)
 * We can split the article into two parts as "List of machine translation services" (for Google, MS, Yandex, etc.) and "List of machine translation engines" (Moses, NiuTrans, etc.). Some of these applications are only engines. Moses for example, according to its website they "train translation models for any language pair". I don't know how someone added those language pairs for Moses. --Joseph (talk) 16:32, 13 October 2021 (UTC)

Requested Edit

 * What I think should be changed: Add Textshuttle public machine translation to table
 * Why it should be changed: Textshuttle is used by many people in Switzerland and it would make the table more complete
 * References supporting the possible change in German and French:

Note: I already attempted this edit before and it was reverted, I apologize if I wasn't following the proper process. My undone revision with the relevant data is here: https://en.wikipedia.org/w/index.php?title=Comparison_of_machine_translation_applications&oldid=1207291979

COI: I'm an employee of Textshuttle (not in a marketing position and not paid to do this edit).

Mpoemsl (talk) 13:19, 14 February 2024 (UTC)


 * - this is a list of applications with preexisting Wikipedia articles. - MrOllie (talk) 13:29, 14 February 2024 (UTC)


 * I see, I didn't know that. Thanks for looking into it. Mpoemsl (talk) 14:00, 14 February 2024 (UTC)