Talk:Western Latin character sets (computing)

code to generate table
this code was written using delphi but should also build and run with freepascal. readtxt.pas can be obtained from the bewareserv source. the mappings come from the following locations.


 * http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-1.TXT
 * http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-15.TXT
 * http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT
 * http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/CP437.TXT
 * http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/CP850.TXT
 * http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ROMAN.TXT

as raw statements of facts about encodings that a huge amount of software uses i do not belive the raw mappings from theese can be considered eligable for copyright.

program charsetcomparisongen;

uses sysutils,readtxt; //we use our own text reader as the delphi one can't handle //unix format text const maxcharset =5; var buildarray : array[0..65535,0..maxcharset] of smallint; names : array[0..31] of string; procedure processcharset(name:string;number:byte;filename:string); var t: treadtext; line:string; i,j:integer; begin names[number] := name;

readtext_init(t,filename); repeat line := readtext_line(t); if (length(line)>=11) and (line[2]='x') and (line[7]='x') then begin; //writeln('processing line '+line); buildarray[strtoint('

— Preceding unsigned comment added by Plugwash (talk • contribs) 23:13, 7 July 2005 (UTC) +copy(line,8,4)),number] := strtoint('

— Preceding unsigned comment added by Plugwash (talk • contribs) 23:13, 7 July 2005 (UTC) +copy(line,3,2));   end;  until readtext_eof(t); end; var  t: textfile;  i,j : integer;  firstline : boolean;  goodline : boolean;  rowcounter : integer; begin  for i := 0 to 65535 do for j := 0 to maxcharset do buildarray[i,j] := -1;  processcharset('ISO-8859-1',0,'8859-1.txt');  processcharset('ISO-8859-15',1,'8859-15.txt');  processcharset('WINDOWS-1252',2,'CP1252.txt');  processcharset('IBM437',3,'CP437.txt');  processcharset('IBM850',4,'CP850.txt');  processcharset('Mac-Roman encoding {{cob}}

— Preceding unsigned comment added by Plugwash (talk • contribs) 23:13, 7 July 2005 (UTC)

Language represented
Gleaned from Czyborra.com. Can anyone fill in the blanks? —Michael Z. 2005-07-8 20:25 Z 


 * ok i've put a modified version of thas table below

Representation of Western European languages
Please don't edit the introduction to make it sound like ASCII is sufficient to represent non-English languages, and language-specific letters are a bonus. Either an encoding has the letters used in a language, or it doesn't. Imagine if you had to write English without the letters C, Q, and W, and had to substitude S, K, and "UU" for them—you would kuikly deside that this sukked, and undoubtedly kreate your own English-language enkoding by the end of the uueek. —Michael Z. 2005-07-9 07:19 Z 

Company logo?
From the article:
 * MacRoman simply replaced the generic currency sign (¤). This caused significant difficulty because organisations had found other uses for it, such as the company logo.

Just out of curiousity...how many companies were using the generic currency sign as their company logo??? 209.92.136.131 21:43, 19 February 2007 (UTC)


 * I don't know any facts indicating that they did, but the "generic currency sign" has been a singularly useless character, since its intended purpose was almost never needed in practical use... AnonMoos 01:53, 20 February 2007 (UTC)

Title
Shouldn't the title of this article be simply "Latin character sets (computing)"? The word "Western" belongs with "European," but not with "Latin." There's no such thing as "Western Latin," is there? Jim Monty (talk) 05:08, 9 February 2015 (UTC)


 * I think it's supposed to mean "Western character sets among the Latin ones" not "character sets which are Western Latin"... AnonMoos (talk) 04:22, 10 February 2015 (UTC)

typo?
"this has been addressed in HTM 5" --> "this has been addressed in HTML 5"? — Preceding unsigned comment added by 217.110.38.73 (talk) 15:47, 12 December 2016 (UTC)