Re: Data update from ISO 639-5 to draft-4645bis
2009-03-01 22:08:35 GMT
This is all resolved, and I'm not trying to re-open anything. I just thought people might be interested in some background on the term "Dayak" and why it's not sensible to try to infer genetic associations based on that term.
"Dayak" is not a linguonym but rather an ethnonym or, even, a socionym. What was explained to me some years back is that its history is a pejorative term used by the "civilized" Muslim majority on the island of Borneo to refer to the many uncivilized, non-Muslim "tribal" minorities.
I ran into this when doing early prep work for ISO 639-3: trying to figure out how the entry day in 639-2 should relate to entries in Ethnologue. The MARC Language Code List, which was the source for 639-2, used day to encompass languages that cut across major divisions of the Western Malayo-Polynesian language family. At the time, 639-2 treated it as an individual language ("languages" was not part of the name). The options were (a) to specify the denotation to some particular language (in spite of MARC), (b) change the scope to be a genetic collection and find some node in the W M-P language-family hierarchy to identify it with (breaking with past MARC usage, but less severely), (c) define the scope to be some ad-hoc collection (perhaps following MARC documentation), or (d) to deprecate it. The decision taken by the JAC was to treat it as a collection. As ISO 639 does not document the precise denotation of collections, the choice between options (b) and (c) was left undetermined.
If someone wants to try to improve on nailing down this bit of jello, good luck! J
Peter
From:
ltru-bounces <at> ietf.org [mailto:ltru-bounces <at> ietf.org] On Behalf Of CE
Whitehead
Sent: Saturday, February 21, 2009 7:53 AM
To: ltru <at> ietf.org
Cc: iso639-3 <at> sil.org; rgue <at> loc.gov
Subject: Re: [Ltru] Data update from ISO 639-5 to draft-4645bis
Hi, Kent, Doug:
It seems to me that
at ethnologue:
Dayak, Land [dyk] is a subset of the Land Dayak group of languages:
http://www.ethnologue.com/show_family.asp?subid=91242
(There is also a separate group of languages there, Malayic Dayak which is
under the group Malayik and not the group Land Dayak; both Malayik (in
the "Ibanic" group) and Land Dayak are classified as
Malayo-Polynesian but they are not the same--not in Wikipedia either; Wikipedia
classifies both as Dayak it seems.
So it may be just the naming that is different.
That's as near as I can make this classification out.)
Thanks.
--C. E. Whitehead
cewcathar <at> hotmail.com
From: Kent
Karlsson <kent.karlsson14 at
comhem.se>
Date: Sat, 21
Feb 2009 13:30:39 +0100
>Side remark: I have seen translations of this one as just
"Dayak".
> But the Land Dayak languages appears to be a subset of the Dayak
languages. At least according to Wikipedia:
> http://en.wikipedia.org/wiki/Dayak_languages,
> http://en.wikipedia.org/wiki/Bidayuh.
> (In addition there is the retired code dyk for Land Dayak [languages]...)
Recent examples can be found in the
"Action" announcements for approved documents. The approval
announcement contains the following sections:
Technical Summary
Relevant content can frequently be found in the abstract
and/or introduction of the document. If not, this may be
an indication that there are deficiencies in the abstract
or introduction.
Working Group Summary
Was there anything in WG process that is worth noting? For
example, was there controversy about particular points or
were there decisions where the consensus was particularly
rough?
Document Quality
Are there existing implementations of the protocol? Have a
significant number of vendors indicated their plan to
implement the specification? Are there any reviewers that
merit special mention as having done a thorough review,
e.g., one that resulted in important changes or a
conclusion that the document had no substantive issues? If
there was a MIB Doctor, Media Type or other expert review,
what was its course (briefly)? In the case of a Media Type
review, on what date was the request posted?
This document describes the structure, content, construction, and
semantics of language tags for use in cases where it is desirable to
indicate the language used in an information object. It also
describes how to register values for use in language tags and the
creation of user-defined extensions for private interchange.
This document is an update of RFC4646. The main change is the
addition of thousands of three-letter language subtags for languages
for which tagging was not possible up to now. Also, the registry
format and procedures were adjusted to deal with this change,
and to reflect experience from current practice.
The WG process for this document was mostly smooth and revolving
around details. There were some highly contentious issues, but
for all of them, a solution was found that was acceptable to
the involved parties and works for all scenarios identified.
The IANA Language Subtag Registry, and the language tags that can
be formed according to this document and its predecessor, are widely
used across the Internet to identify languages, both in implementations
(code) and in a wide range of data.
Regards, Martin.
#-#-# Martin J. Du"rst, Assoc. Professor, Aoyama Gakuin University
#-#-#
RSS Feed