8 Jun 19:37
'car' in different ISO 639 parts
Doug Ewell <doug <at> ewellic.org>
2008-06-08 17:37:04 GMT
2008-06-08 17:37:04 GMT
I'm in the process of building a new 4645bis Registry that takes into
account the ISO 639-5 data, which I assume I have the rights to use.
Generally, when two different parts of ISO 639 apply different names to
the same code element, we have a policy of keeping all the names in the
Registry, because it was considered important for traceability. So we
will have things like:
Type: language
Subtag: ms
Description: Malay (macrolanguage)
Description: Malay
Added: 2005-10-16
Suppress-Script: Latn
because ISO 639-2 calls it simply "Malay" while ISO 639-3, which has to
differentiate macrolanguage codes from individual language codes, calls
it "Malay (macrolanguage)". This doesn't pose a problem since the
different names seem to refer to the same linguistic concept.
Now, ISO 639-5 draws a formal distinction between "language groups" and
"remainder groups," concepts used in (but not formally defined by) ISO
639-2. These are identified primarily by the words "languages" or
"(Other)" in the ISO 639-2 name. A code element like 'apa' ("Apache
languages") is considered to cover all Apache languages, whereas a code
element like 'gem' ("Germanic (Other)") covers only those Germanic
languages that do not have an ISO 639-2 code element of their own. Both
the LTRU WG and ietf-languages are familiar with this concept and have
discussed it often.
(Continue reading)
RSS Feed