Tex Texin | 1 Sep 11:43 2003

Unicode Standard version 4.0 published! Conference this week!

        NEWSFLASH: Version 4.0 of the Unicode Standard is here!

The Unicode Consortium and Addison-Wesley announced publication of Version 4.0
of the Unicode Standard last week.

You can read about the impact the standard is having on domain names, worldwide
communication and the digital divide, in the press release on the Unicode
Consortium web site.
http://www.unicode.org/press/press_release-4.0Book.html

You can learn more about this latest version as well as how to employ Unicode
in software and web applications, at the conference in Atlanta this week!
This is the conference to attend if you need to know about internationalization
best practices.
Join us!

************************************************************************
    Twenty-fourth Internationalization and Unicode Conference (IUC24)
     Unicode, Internationalization, the Web: Powering Global Business

                     http://www.unicode.org/iuc/iuc24
                            September 3-5, 2003
                           Atlanta, Georgia, USA
************************************************************************

NEWS

 > Meet industry leaders in Atlanta and discuss the hot topics for
   internationalization and Unicode in 2003! Check out the updated
   Conference program ( http://www.unicode.org/iuc/iuc24/program.html )
(Continue reading)

Tex Texin | 1 Sep 19:43 2003

Re: [OT?] QBCS

Doug,

In most industry usages, MBCS refers to variable width encodings, not fixed
width.

tex

Doug Ewell wrote:

> Paradoxically (at least to me), the term "multi-byte character set"
> refers to a fixed-width encoding, such as UCS-2.  The official name of
> ISO/IEC 10646 is "Universal Multiple-Octet Coded Character Set."

--

-- 
-------------------------------------------------------------
Tex Texin   cell: +1 781 789 1898   mailto:Tex <at> XenCraft.com
Xen Master                          http://www.i18nGuy.com

XenCraft		            http://www.XenCraft.com
Making e-Business Work Around the World
-------------------------------------------------------------

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com
(Continue reading)

Doug Ewell | 2 Sep 05:26 2003
Picon
Picon

Re: QBCS

Tex Texin <tex at i18nguy dot com> wrote:

> In most industry usages, MBCS refers to variable width encodings, not
> fixed width.

Well, if variable-width encodings are referred to as both DBCS (see, for
example, http://czyborra.com/charsets/cjk.html#dbcs) and MBCS, then what
term is used to describe a fixed-width encoding of more than 1 byte?  Or
was the concept not common enough to warrant a name until Unicode?

-Doug Ewell
 Fullerton, California
 http://users.adelphia.net/~dewell/

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 

Asmus Freytag | 2 Sep 09:24 2003
Picon

Re: QBCS

At 08:26 PM 9/1/03 -0700, Doug Ewell wrote:
>Tex Texin <tex at i18nguy dot com> wrote:
>
> > In most industry usages, MBCS refers to variable width encodings, not
> > fixed width.
>
>Well, if variable-width encodings are referred to as both DBCS (see, for
>example, http://czyborra.com/charsets/cjk.html#dbcs) and MBCS, then what
>term is used to describe a fixed-width encoding of more than 1 byte?  Or
>was the concept not common enough to warrant a name until Unicode?

The most common 'pure' DBCS was encountered in mainframe environments.
All the other platforms used 'mixed' single and double-byte or other
variable length encodings, so that 'DBCS' could stand in for a variable
lenght encoding with maximum length 2 without confusion (except when
talking to mainframe people).

A./

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html

(Continue reading)

Peter_Constable | 2 Sep 15:48 2003

[OT]Re: Breaking free from UNICODE

Michael Eversion wrote on 08/19/2003 03:14:47 PM:

> Golly, I was able to distinguish Latin and Georgian and Cyrillic on a 
> Mac SE 30 in 1985. Or was it 1987.(Long before Worldscript I admit.) 
> And years before that there was the Osborne with its dot-matrix 
> miracles.

IIRC, the Mac SE did not exist in 1985; I was using a relatively new Fat 
Mac in the summer of that year. 

BTW, the Osborne didn't particularly have dot-matrix miracles. That was 
the domain of printers like the Toshiba P321 and various Epson LQ models, 
and such printers could be connected to CP/M machines like the Osbornes 
and Kaypros, DOS machines like the IBM PC and Sharp PC 5000, and the Macs. 
But in 1985 I think the only dot matrix printers were the 9-pin variety, 
which weren't all that conducive to readable Latin with diacritics, let 
alone Chinese or Arabic typesetting. The P321 was one of the first 24-pin 
models, and I think it came out in 1987 or maybe late 1986.

Peter Constable

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

(Continue reading)

Peter_Constable | 2 Sep 15:48 2003

Re: Character codes for Egyptian transliteration

Peter Kirk on 08/21/2003 09:33:27 AM:

> As for the requirement for distinct upper and lower case variants of 
> ayin, I understood that there was a similar requirement in some minor 
> Cyrillic languages, at least for apostrophe and double apostrophe. 
> Earlier this year Peter Constable was gathering information for a 
> possible proposal. But I never heard if it was proceeded with.

I was given charts reporting these things being used for various 
languages, but don't think I ever got an explanation of what the purpose 
for them was, and I didn't get any confirmation of actual use let alone 
samples from actual publications. If you can provide samples, that would 
be great.

Peter Constable

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 

(Continue reading)

Michael Everson | 2 Sep 16:02 2003

Re: Last Resort Font

At 06:48 -0700 2003-09-02, Peter_Constable <at> sil.org wrote:
>Michael Eversion wrote on 08/19/2003 02:52:55 PM:
>
>>  >p. 63 (Syloti Nagri): both top and bottom read "SILOTI NAGRI".
>
>  > I will look into all of that, and thank you for it; but note that of
>>  those only Thaana can be expected to display, as none of the others
>  > have been encoded. So none of those could EVER be displayed; they are
>>  just extra glyphs in the current font.
>
>Syloti Nagri has been approved by UTC and assigned to A800..A82F, though
>this is yet to be ratified by WG2 (presumably will happen in October) and
>published in a new version of Unicode (will be 4.1) or an amendment to ISO
>10646 (I don't know what timetable is in place for publishing further
>amendments).

And it will be two years before the LR font has to be updated....
--

-- 
Michael Everson * * Everson Typography *  * http://www.evertype.com

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html
(Continue reading)

Peter_Constable | 2 Sep 15:48 2003

Re: Last Resort Font

Michael Eversion wrote on 08/19/2003 02:52:55 PM:

> >p. 63 (Syloti Nagri): both top and bottom read "SILOTI NAGRI".

> I will look into all of that, and thank you for it; but note that of 
> those only Thaana can be expected to display, as none of the others 
> have been encoded. So none of those could EVER be displayed; they are 
> just extra glyphs in the current font.

Syloti Nagri has been approved by UTC and assigned to A800..A82F, though 
this is yet to be ratified by WG2 (presumably will happen in October) and 
published in a new version of Unicode (will be 4.1) or an amendment to ISO 
10646 (I don't know what timetable is in place for publishing further 
amendments).

Peter Constable

------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information
available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare
| Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-unsubscribe <at> yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 

(Continue reading)

Philippe Verdy | 3 Sep 01:04 2003
Picon

Re: QBCS

From: "Asmus Freytag" <asmusf <at> ix.netcom.com>
> At 08:26 PM 9/1/03 -0700, Doug Ewell wrote:
> >Tex Texin <tex at i18nguy dot com> wrote:
> >
> > > In most industry usages, MBCS refers to variable width encodings,
not
> > > fixed width.
> >
> >Well, if variable-width encodings are referred to as both DBCS (see,
for
> >example, http://czyborra.com/charsets/cjk.html#dbcs) and MBCS, then
what
> >term is used to describe a fixed-width encoding of more than 1 byte?
Or
> >was the concept not common enough to warrant a name until Unicode?
>
> The most common 'pure' DBCS was encountered in mainframe environments.
> All the other platforms used 'mixed' single and double-byte or other
> variable length encodings, so that 'DBCS' could stand in for a
variable
> lenght encoding with maximum length 2 without confusion (except when
> talking to mainframe people).

In the late 80's, the acronym DBCS was also used to refer to
user-defined characters, that could be assigned in a codepage and
defined by a transferable bitmap, and accessed with an encoding sequence
allowing you to remap the upper-half of the 8-bit character set.

In a 7-bit environment, these 8-bit "characters" (in fact relative
positions in a 7-bit codepage) could be accessed using control sequences
(Continue reading)

Philippe Verdy | 3 Sep 02:07 2003
Picon

Re: Cyrillic character mapping tables, HP MSL to Unicode

First start with this page:
http://www.hp.com/cposupport/printers/support_doc/bpl04568.html
You may want to buy this:
"Refer to the HP PCL5 Technical Reference Bundle. To order, call HP's
driver/software distribution at 661-257-5565. The part number is
5961-0976."

You may also look at:
http://www.hp.com/cposupport/printers/support_doc/bpl02705.html
and refer to this:
"For further information about PCL commands, HP-GL/2, macros, or PJL
commands, use the Technical Reference Manual set, part number 5021-0377.
Order the manual set from HP's Support Materials Organization."

Or you may download this:
http://h20000.www2.hp.com/bc/docs/support/SupportManual/bpl13210/bpl13210.pdf
"PCL 5 Printer Language Technical Reference Manual - ENWW - HP Part No.
5961-0509. Printed in USA. First Edition - October 1992 PCL 5 Printer
LanguageTechnical Reference Manual."
I have the same book, but dated September 1990 (this was really the
first edition), HP part number 33459-90903.

Also:
http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?locBasepartNum=5961-0976&lang=English%20%28US%29
"HP PCL Tech Reference Manual CD-ROM - The HP PCL Tech Reference Bundle
CD-ROM includes, the Technical Quick Reference Guide, Printer Job
Language Technical Reference Manual, PCL 5 Color Technical Reference
Manual, PCL 5 Printer Language Technical Reference Manual. In English in
a PDF. Format."

(Continue reading)


Gmane