Re: numbers in prose
Syd Bauman <Syd_Bauman <at> BROWN.EDU>
2003-03-28 13:55:00 GMT
> With the Digital Library of Dutch Letters, Leyden, the
> Netherlands, we digitize paper-editions of older Dutch literary
> text. Sometimes those texts have prose with a kind of line
> numbering in the margin.
How you encode these depends a lot on how you think of them, and a
bit on what you want to do with them. If I am imagining what you have
correctly, I think of the line numbers in the margin are a sort of
referencing system, much like page numbers. In which case, one good
method would be to encode them as <fw>. The rend= attribute of <fw>
would allow you to specify they were in the margin:
<fw type="lineNum" rend="place(left) pre([) post(])">17</fw>
> For us, using TEI-Lite, these numbers are hard to handle,
Oops. TEI Lite does not have <fw>. To me this is a good argument that
you should not be using Lite. There are probably some less desirable
solutions available in Lite, though.
E.g., one might think of the line numbers as sort of special-case
labels, and use
<label rend="place(left) pre([) post(])">17</label>
Or use the n= attribute of each <lb>, or at least each <lb> which
actually had an associated line number in the margin:
<lb n="17"/> Or, I think less desirable: <lb n="[17]"/>
(Note that you need to decide, and describe very clearly in the
<teiHeader>, whether the value of n= on <lb> refers to the preceding
or following lines sequence number. I.e.
| [1] ... rights language used by those in power against those on
| [2] the bottom should be treated with suspicion, while rights
| [3] language used by those on the bottom against those in power
| [4] should be treated with respect
could easily be encoded as either
<p>
<lb n="1"/>… rights language used by those in power against those on
<lb n="2"/>the bottom should be treated with suspicion, while rights
<lb n="3"/>language used by those on the bottom against those in power
<lb n="4"/>should be treated with respect<sic corr="."></sic>
</p>
or
<p>
… rights language used by those in power against those on<lb n="1"/>
the bottom should be treated with suspicion, while rights<lb n="2"/>
language used by those on the bottom against those in power<lb n="3"/>
should be treated with respect<sic corr="."></sic><lb n="4"/>
</p>
(Quote taken from end of the article that started at
http://www.arstechnica.com/wankerdesk/01q3/ip-ethics/ip-ethics-1.html.)
I think most folks would naturally go with the first example
(following), but I have no data to back up that assertion.
Note that the same difficulty applies with the use of <fw> or
<label>, but in those cases we would likely end up encoding what was
on the source page where it occurred on the source page.
> ... also in respect to a more or less functional and acceptable
> HTML-representation.
I will leave this to those who know more about HTML (a language I
know very little about).
> My question is: will TEI provide a simple solution for this kind of
> references?
Solution? Yes. Simple? You be the judge.
BTW, as you consider how to encode this stuff, you may very well want
to think about what you do when there's an error.
<fw type="lineNum" rend="place(right) pre([) post(])">
<sic corr="182">128</sic>
</fw>
(There's lots more to be said on the issue, to be sure, but I just
wanted to pique your interest. Gotta go ...)