Laurent Romary | 1 Aug 2011 08:29
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

It just happens that we had to deal with a similar problem of chunking a document in pages to visualize each page one by one in a frame and did this with Solenne Coutagne (student from Ecole des CHartes) by means of two variables that are propagated at each processing step. I have somehow simplified the stylesheet below by just outlining the entry template and providing a simple application for text nodes, "normal" element (p) and empty ones (lb).

    <xsl:template match="/">
       
<xsl:for-each select="//tei:pb">
           
<xsl:apply-templates select="//tei:body">
               
<xsl:with-param name="anfang" select="."/>
               
<xsl:with-param name="ende" select="following::tei:pb[1]"/>
           
</xsl:apply-templates>
       
</xsl:for-each>
    
</xsl:template>

   
<xsl:template match="text()">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<xsl:value-of select="."/>
       
</xsl:if>
   
</xsl:template>

   
<xsl:template match="tei:p">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<p>
               
<xsl:apply-templates>
                   
<xsl:with-param name="anfang" select="$anfang"/>
                   
<xsl:with-param name="ende" select="$ende"/>
               
</xsl:apply-templates>
           
</p>
       
</xsl:if>
   
</xsl:template>

   
<xsl:template match="tei:lb">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<br/>
       
</xsl:if>
   
</xsl:template>



Le 31 juil. 2011 à 05:05, Hayim Lapin a écrit :

I seem to have stepped into a mess. Thanks to those of you who responded. And may I say that this is an amazing group!

HL
Hayim Lapin
Robert H. Smith Professor of Jewish Studies
and Professor of History
Director, Joseph and Rebecca Meyerhoff Center for Jewish Studies.


From: Sebastian Rahtz <sebastian.rahtz <at> OUCS.OX.AC.UK>
Reply-To: Sebastian Rahtz <sebastian.rahtz <at> OUCS.OX.AC.UK>
Date: Sat, 30 Jul 2011 17:20:39 -0400
To: "TEI-L <at> listserv.brown.edu" <TEI-L <at> listserv.brown.edu>
Subject: Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

Trouble is, that means special-casing div. The para split is surely ok?

Sent from my HTC

----- Reply message -----
From: "Lou Burnard" <lou.burnard <at> retired.ox.ac.uk>
Date: Sat, Jul 30, 2011 16:08
Subject: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)
To: "Sebastian Rahtz" <sebastian.rahtz <at> oucs.ox.ac.uk>
Cc: "TEI-L <at> listserv.brown.edu" <TEI-L <at> listserv.brown.edu>


On 30/07/11 15:49, Sebastian Rahtz wrote:

>     1. This may not not be what you actually want to get out
>     2. All hell may break loose later when you end up with many copies of the same element (see the repeated
>         <div>  in the example below)
>
> Whether the use of <at> prev is right, I am wondering with myself.


I would say probably not: it only makes sense in the presence of an
additional attribute <at> part="yes" so that you can distinguish these <div
type="stave">s (which don't actually contain a complete stave) from
those which do.

I am wondering whether it wouldn't be better to introduce some other
intermediate element (a seg would do) to wrap these fragmentary divs.

Laurent Romary
INRIA & HUB-IDSL



Laurent Romary | 1 Aug 2011 08:32
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

Forgot to mention that each output page is generated by means of a specific xsl:result-document.

Le 1 août 2011 à 08:29, Laurent Romary a écrit :

It just happens that we had to deal with a similar problem of chunking a document in pages to visualize each page one by one in a frame and did this with Solenne Coutagne (student from Ecole des CHartes) by means of two variables that are propagated at each processing step. I have somehow simplified the stylesheet below by just outlining the entry template and providing a simple application for text nodes, "normal" element (p) and empty ones (lb).

    <xsl:template match="/">
       
<xsl:for-each select="//tei:pb">
           
<xsl:apply-templates select="//tei:body">
               
<xsl:with-param name="anfang" select="."/>
               
<xsl:with-param name="ende" select="following::tei:pb[1]"/>
           
</xsl:apply-templates>
       
</xsl:for-each>
    
</xsl:template>

   
<xsl:template match="text()">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<xsl:value-of select="."/>
       
</xsl:if>
   
</xsl:template>

   
<xsl:template match="tei:p">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<p>
               
<xsl:apply-templates>
                   
<xsl:with-param name="anfang" select="$anfang"/>
                   
<xsl:with-param name="ende" select="$ende"/>
               
</xsl:apply-templates>
           
</p>
       
</xsl:if>
   
</xsl:template>

   
<xsl:template match="tei:lb">
       
<xsl:param name="anfang"/>
       
<xsl:param name="ende"/>
       
<xsl:if test="not(following::tei:pb[ <at> n=$anfang/ <at> n]) and not(preceding::tei:pb[ <at> n=$ende/ <at> n])">
           
<br/>
       
</xsl:if>
   
</xsl:template>



Le 31 juil. 2011 à 05:05, Hayim Lapin a écrit :

I seem to have stepped into a mess. Thanks to those of you who responded. And may I say that this is an amazing group!

HL
Hayim Lapin
Robert H. Smith Professor of Jewish Studies
and Professor of History
Director, Joseph and Rebecca Meyerhoff Center for Jewish Studies.


From: Sebastian Rahtz <sebastian.rahtz <at> OUCS.OX.AC.UK>
Reply-To: Sebastian Rahtz <sebastian.rahtz <at> OUCS.OX.AC.UK>
Date: Sat, 30 Jul 2011 17:20:39 -0400
To: "TEI-L <at> listserv.brown.edu" <TEI-L <at> listserv.brown.edu>
Subject: Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

Trouble is, that means special-casing div. The para split is surely ok?

Sent from my HTC

----- Reply message -----
From: "Lou Burnard" <lou.burnard <at> retired.ox.ac.uk>
Date: Sat, Jul 30, 2011 16:08
Subject: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)
To: "Sebastian Rahtz" <sebastian.rahtz <at> oucs.ox.ac.uk>
Cc: "TEI-L <at> listserv.brown.edu" <TEI-L <at> listserv.brown.edu>


On 30/07/11 15:49, Sebastian Rahtz wrote:

>     1. This may not not be what you actually want to get out
>     2. All hell may break loose later when you end up with many copies of the same element (see the repeated
>         <div>  in the example below)
>
> Whether the use of <at> prev is right, I am wondering with myself.


I would say probably not: it only makes sense in the presence of an
additional attribute <at> part="yes" so that you can distinguish these <div
type="stave">s (which don't actually contain a complete stave) from
those which do.

I am wondering whether it wouldn't be better to introduce some other
intermediate element (a seg would do) to wrap these fragmentary divs.

Laurent Romary
INRIA & HUB-IDSL




Laurent Romary
INRIA & HUB-IDSL



Sebastian Rahtz | 1 Aug 2011 10:37
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

The preceding/following pb method does indeed do the job for some classes of text.

The problem with this type of method is that a) its really quite inefficient, and b) it does not deal with the
markup like

   <p>
       <quote>
           <p>
              <hi>
                <note>
                  <p>
                     <pb/>

which is not uncommon. As ever, the TEI is hoist on the petard of the descriptive generosity of its markup…..
--
Sebastian Rahtz      
Head of Information and Support Group
Oxford University Computing Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431

Sólo le pido a Dios
que el futuro no me sea indiferente

Laurent Romary | 1 Aug 2011 12:51
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

I fully agree with the inefficiency but do not grasp the issue of your example below. The recursive call does
go through all such embedded structure (twice, AAMOF, hence the inefficiency).

Le 1 août 2011 à 10:37, Sebastian Rahtz a écrit :

> The preceding/following pb method does indeed do the job for some classes of text.
> 
> The problem with this type of method is that a) its really quite inefficient, and b) it does not deal with the
markup like
> 
>   <p>
>       <quote>
>           <p>
>              <hi>
>                <note>
>                  <p>
>                     <pb/>
> 
> which is not uncommon. As ever, the TEI is hoist on the petard of the descriptive generosity of its markup…..
> --
> Sebastian Rahtz      
> Head of Information and Support Group
> Oxford University Computing Services
> 13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431
> 
> Sólo le pido a Dios
> que el futuro no me sea indiferente
> 

Laurent Romary
INRIA & HUB-IDSL
laurent.romary <at> inria.fr

Sebastian Rahtz | 1 Aug 2011 12:54
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

On 1 Aug 2011, at 11:51, Laurent Romary wrote:

> I fully agree with the inefficiency but do not grasp the issue of your example below. The recursive call
does go through all such embedded structure 

ah, in that case I did not read your XSL properly, for which apologies! maybe you could publish the whole thing?
--
Sebastian Rahtz      
Head of Information and Support Group
Oxford University Computing Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431

Sólo le pido a Dios
que el futuro no me sea indiferente

Laurent Romary | 1 Aug 2011 14:21
Picon
Picon
Favicon

Re: <pb/> promotion (was Has anybody tried converting Oxygen track changes to tei:add|del?)

Indeed. Let me ask Solenne (hereby) to upmload it on http://wiki.tei-c.org/index.php/Category:XSLT

Le 1 août 2011 à 12:54, Sebastian Rahtz a écrit :

> 
> On 1 Aug 2011, at 11:51, Laurent Romary wrote:
> 
>> I fully agree with the inefficiency but do not grasp the issue of your example below. The recursive call
does go through all such embedded structure 
> 
> ah, in that case I did not read your XSL properly, for which apologies! maybe you could publish the whole thing?
> --
> Sebastian Rahtz      
> Head of Information and Support Group
> Oxford University Computing Services
> 13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431
> 
> Sólo le pido a Dios
> que el futuro no me sea indiferente
> 

Laurent Romary
INRIA & HUB-IDSL
laurent.romary <at> inria.fr

DCMI Announce | 1 Aug 2011 19:38
Favicon

NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and the Impact on Library Cataloging

NISO/DCMI August webinar announcement

***Please excuse cross-posting***

--------------------------------------------------------------------------------------------
NISO/DCMI Webinar: "International Bibliographic Standards, Linked Data, 
    and the Impact on Library Cataloging"
DATE: 24 August 2011
TIME: 1:00pm - 2:30pm EDT (17:00-19:30 UTC)
--------------------------------------------------------------------------------------------

ABOUT THE WEBINAR

The International Federation of Library Associations and Institutions (IFLA) is responsible for the development and maintenance of International Standard Bibliographic Description (ISBD), UNIMARC, and the "Functional Requirements" family for bibliographic records (FRBR), authority data (FRAD), and subject authority data (FRSAD). ISBD underpins the MARC family of formats used by libraries world-wide for many millions of catalog records, while FRBR is a relatively new model optimized for users and the digital environment. These metadata models, schemas, and content rules are now being expressed in the Resource Description Framework language for use in the Semantic Web.

This webinar provides a general update on the work being undertaken. It describes the development of an Application Profile for ISBD to specify the sequence, repeatability, and mandatory status of its elements. It discusses issues involved in deriving linked data from legacy catalogue records based on monolithic and multi-part schemas following ISBD and FRBR, such as the duplication which arises from copy cataloging and FRBRization. The webinar provides practical examples of deriving high-quality linked data from the vast numbers of records created by libraries, and demonstrates how a shift of focus from records to linked-data triples can provide more efficient and effective user-centered resource discovery services.

SPEAKERS:

Gordon Dunsire is a freelance consultant with 25 years of experience working in academic libraries and ten years in digital library research. He is a member of IFLA's ISBD/XML Study Group and FRBR Review Group, and he chairs the IFLA Namespaces Task Group. He is currently a member of a W3C Incubator Group on Library Linked Data.

Thomas Baker, Chief Information Officer (Communications, Research and Development) of the Dublin Core Metadata Initiative, was recently co-chair of the W3C Semantic Web Deployment Working Group and currently co-chairs a W3C Incubator Group on Library Linked Data.

REGISTRATION:  

For registration and webinar technical information, see http://www.niso.org/news/events/2011/dcmi/linked .  Registration closes at 12:00 pm Eastern on 24 August 2011.  Webinar presentation slides and Q&A will be posted to the site following the live webinar.  Registrants will receive access information to the archived webinar following the event. An e-mail message containing archive access instructions will be sent within 48 hours of the event.
Sebastian Rahtz | 1 Aug 2011 20:14
Picon
Picon
Favicon

practicalities of using <timeline>

We're making some use of <timeline> for a project at Oxford to synchronise text and audio, and miss three things:

 a) the facility to point to an audio file
 b) the facility to have gaps in the timeline without making up redundant <when>s
 c) direct linking to a transcription with a standoff <link>

I'm thinking in my innocence that I could say
   <timeline corresp="foo.m4a">
for the first, and 
  <when corresp="#p45">…</when>
for the third, but do people agree with that usage of  <at> corresp?
should it always point to an XML element? if not, how else to specify the
media file?

for the second, I'd like to add  <at> from and  <at> to attributes to <when>, to complement
 <at> since and  <at> interval. Any views on that?
--
Sebastian Rahtz      
Head of Information and Support Group, Oxford University Computing Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431

Sólo le pido a Dios
que el futuro no me sea indiferente

Julia Flanders | 2 Aug 2011 20:30
Favicon

using an XML database for user-contributed content

Is anyone out there using an XML database to drive an interface for  
gathering user-contributed content (e.g. data entry of some kind)? If  
so, I'd be really interested in knowing:

--what sort of scale? How many users overall? How many simultaneous  
users?
--what kinds of data are being entered?
--what XML database are you using?
--any pitfalls or difficulties you've encountered?

We're planning a fairly substantial data-gathering exercise in which  
users would be using a form to enter bibliographic data and possibly  
also transcriptions of short documents, and we're curious about  
performance and also any back-end design challenges.

Thank you so much for any advice--

best wishes, Julia

Julia Flanders
Women Writers Project
Brown University

Tomaz Erjavec | 4 Aug 2011 18:07
Picon
Picon
Favicon

<at> class problems

Dear all,
we're working on a msDescription catalogue, where we'd like for
msItemStruc's to be given  <at> class, where the values are to one or several
text classes.
But it turns out that things like <msItemStruct class="#nrss_type_sermon
#nrss_author_ecclesial"> are invalid because of the  <at> class value.
Looking at
http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-msItemStruct.html :
class		identifies the text types or classifications applicable to
this item
Status 		Optional
Datatype 	data.code
Values 		One or more codes, each of which is used as the identifier
for a text classification element supplied in the TEI Header textClass
element.
Example	<msItemStruct n="2" defective="false" class="biblComm">

which seems to have two problems:
- it says "text types" and "One or more codes", but the datatype is
data.code (so, not repeatable)
- the example gives the  <at> class without the #, which is dangerously
misleading - for a while there we thought, well, data.code for some reason
does not use #, so we here happily writing  <at> class="nrss_type_sermon
nrss_author_ecclesial", which does validate, but is probably nonsense.
So, would this be an error or two in the Guidelines? 

And a related question: we actually thought to link the  <at> class values to a
taxonomy, but reading the definition, I see that it is meant only for
linking to textClass. 
But http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-textClass.html
defines it 
<textClass> (text classification) groups information which describes the
nature or topic of a text in terms of a standard classification scheme,
thesaurus, etc.
I would then imagine that each text gets only one textClass; in which case,
how could you have more than one value for  <at> class, and what is the point of
having  <at> class anyway, given that all msDesc belong to the same <TEI>? (well,
except for the borderline case where you would have a teiCorpus of
manuscripts).

Any illumination would be much appreciated!

Best,
Tomaž
--

-- 
Tomaž Erjavec, http://nl.ijs.si/et/
Dept. of Knowledge Technologies, Jožef Stefan Institute, Ljubljana