Kevin Hawkins | 1 Nov 2011 16:01

Re: best practices for <l> and manuscript transcription

In my opinion, there are two approaches to assinging IDs ( <at> xml:id).

One is to have a scheme for IDs designed to avoid collisions, much like 
the one you suggest.  There's a similar scheme for divs in Humanities 
E-Book XML titles ( 
http://www.humanitiesebook.org/xml/doc/acls-hebook-doc.html#divs ).  The 
trick here is that if you revise your encoded text later and add or 
remove an element, you have to decide whether to change your IDs to 
clean things up or break the system.  So if you think your text will be 
frequently revised, you might instead assign random ID strings to each 
line.  XSLT can do this easily for you. With such IDs, it won't be 
awkward if you add a line later.

As for numbering elements ( <at> n), you should think about what is the most 
stable in your text.  Will pagination likely change in the future as the 
text is presented in different media?  If so, don't restart numbering on 
each page.  Humanities E-Book numbers sequentially through the text ( 
http://www.humanitiesebook.org/xml/doc/acls-hebook-doc.html#para ).  I 
believe the intention is that users could cite a particular paragraph 
number for the book as a whole, not a number relative to a certain 
chapter or page.  Absolute numbering works well with MLA style, which 
basically recommends citing such numbering when available.

--Kevin

On 7/22/64 2:59 PM, Karen Desmond wrote:
> Hello,
> I'm wondering if anyone has thought through some best practices for assigning
> an xml:id to a transcription of a line of text in a manuscript.
> I am using the<l>  tag at the moment, until the new<ge:line>  element is
(Continue reading)

Sebastiaan Verweij | 2 Nov 2011 13:20
Picon
Picon

oXygen/white space/Format and Indent?

Dear list

I've a quick query regarding oXygen. Sorry if this is slightly TEI off-topic?

I'm working my way through a series of manuscript transcriptions, and when I hit 'Format and Indent', for all XML within my <text> / <body> elements, oXygen picks up on the <p> elements, by which it then organises my xml. I wonder if there's a way for it to organise by <pb> element instead, which to me is the more useful division. I've been through the preferences, but can't easily see a way. 

Having pressed 'format and indent' I also don't seem to be able to undo this, and it does't always give me the most useful text-flow on screen. Are you aware of a way to 'undo' this formatting, so that the XML just appears as a long horizontal text (which before I used to 'toggle line wrap')?

This seems very trivial, but it has some consequences for my html output. Before, I used to give one return after each instance of, say, <pb ="53r" />, and that was automatically transformed as a page reference followed by a single space. Now, since the <pb> elements are no longer getting their returns in 'format and indent' view, the spaces are absent, meaning that sometimes the page break and manuscript text run into each other. Like so: 

<pb n="53r"/>The text

Is rendered *[f. 53r]The text*; but 

<pb n="53r"/>
The text

Is rendered *[f. 53r] The text* (with space). 

I know I can sort this in my XML or XSL easily, but I am just trying to understand how oXygen handles white space. 

Huge thanks! 

Sebastiaan

---
Dr Sebastiaan Verweij || Hardie Postdoctoral Fellow, Lincoln College
Research Associate,The Oxford Edition of the Sermons of John Donne
a: English Faculty, St Cross Building, Manor Road, Oxford OX1 3UL
t: 0044 (0)1865 271931 || skype: seb.macv

George Cristian Bina | 2 Nov 2011 14:22
Favicon

Re: oXygen/white space/Format and Indent?

Hi Sebastian,

If you can post a short sample document, before and after applying 
formatting then I can explain you what happens exactly in that case.

The processing that takes place during format and indent follows the 
rules below:

- if oXygen determines that the space should be preserved for an element 
then it is not changed during format and indent
- otherwise we try to determine if the element has element only content 
or allows text.
     - if the element has element only content then we format and indent 
its elements
     - if the element allows text then we change the whitespace (but we 
never introduce whitespace when there is not or remove completely the 
whitespace) trying to satisfy the line width constraint set in the 
formatting options as well as other formatting options.

Now, for determining if the whitespace should be preserver or not oXygen 
looks into different places:
- inside the formatting options (there are a few lists there with 
different types of elements)
- inside the document, if there is an xml:space attribute
- inside the associated schema, if that defines for example a default 
xml:space attribute
- if the Author mode is used then oXygen looks also in the associated 
CSS to see if the style of the element specifies that the space should 
be preserved

For determining if an element has element only content or not oXygen 
looks again in different places:
- in the oXygen options
- inside the document itself, analyzing the content of the element
- inside the associated schema

I will be very interested to see a case when oXygen removed the space 
from a fragment like

<pb n="53r"/>
The text

resulting in

<pb n="53r"/>The text

because as far as I know this should not happen.

Best Regards,
George
--
George Cristian Bina
<oXygen/> XML Editor, Schema Editor and XSLT Editor/Debugger
http://www.oxygenxml.com

On 11/2/11 2:20 PM, Sebastiaan Verweij wrote:
> Dear list
>
> I've a quick query regarding oXygen. Sorry if this is slightly TEI
> off-topic?
>
> I'm working my way through a series of manuscript transcriptions, and
> when I hit 'Format and Indent', for all XML within my <text> / <body>
> elements, oXygen picks up on the <p> elements, by which it then
> organises my xml. I wonder if there's a way for it to organise by <pb>
> element instead, which to me is the more useful division. I've been
> through the preferences, but can't easily see a way.
>
> Having pressed 'format and indent' I also don't seem to be able to undo
> this, and it does't always give me the most useful text-flow on screen.
> Are you aware of a way to 'undo' this formatting, so that the XML just
> appears as a long horizontal text (which before I used to 'toggle line
> wrap')?
>
> This seems very trivial, but it has some consequences for my html
> output. Before, I used to give one return after each instance of, say,
> <pb ="53r" />, and that was automatically transformed as a page
> reference followed by a single space. Now, since the <pb> elements are
> no longer getting their returns in 'format and indent' view, the spaces
> are absent, meaning that sometimes the page break and manuscript text
> run into each other. Like so:
>
> <pbn="53r"/>The text
>
> Is rendered *[f. 53r]The text*; but
>
> <pbn="53r"/>
> The text
>
> Is rendered *[f. 53r] The text* (with space).
>
> I know I can sort this in my XML or XSL easily, but I am just trying to
> understand how oXygen handles white space.
>
> Huge thanks!
>
> Sebastiaan
>
> ---
> Dr Sebastiaan Verweij || Hardie Postdoctoral Fellow, Lincoln College
> Research Associate,The Oxford Edition of the Sermons of John Donne
> a: English Faculty, St Cross Building, Manor Road, Oxford OX1 3UL
> t: 0044 (0)1865 271931 || skype: seb.macv
>

McAulay, Elizabeth | 2 Nov 2011 23:50
Favicon

"TEI P5, I Presume?" David Livingstone's 1871 Field Diary

Dear TEI friends,

 

The UCLA Digital Library Program is pleased to announce the publication of Scottish explorer David Livingstone’s 1871 Field Diary: A Multispectral Critical Edition (http://livingstone.library.ucla.edu/1871diary/), together with the Livingstone Spectral Image Archive (http://livingstone.library.ucla.edu/livingstone_archive/). These two publications form part of the David Livingstone Spectral Imaging Project, a collaborative, international effort to use spectral imaging and digital publishing to make available a collection of faded, illegible texts written by Livingstone when stranded without ink or writing paper in Central Africa.

 

Project director Professor Adrian Wisnicki (Birkbeck, University of London and Indiana University of Pennsylvania) brought together a team of imaging specialists to create a series of digital images of the manuscript, from which he was able to transcribe the previously lost document. Wisnicki then encoded the 1871 Diary in TEI P5, guided by Lisa McAulay (UCLA), Doug Emery (Emery IT), and James Cummings (University of Oxford). The TEI transcript was initially prepared as a separate XML file for each folio, and the whole series of images, imaging metadata and TEI transcript are packaged as a unit in the Spectral Image Archive. For example, the TEI XML for page 105 of the Diary is available from this page: http://livingstone.library.ucla.edu/livingstone_archive/Data/DLC/DLC297c_105-130_002v/. Subsequently, McAulay created a single TEI file that includes the transcription for the entire 1871 Diary, which will soon be available on the site as well.

 

The transcription is viewable online, side-by-side with page images in an interface powered by XSLT, and a search interface is powered by Solr.  If you are interested in the encoding guidelines developed for this project, please visit: http://livingstone.library.ucla.edu/1871diary/xml_coding.htm.

 

We welcome all comments, including critiques and suggestions for improvement.

 

 

 

Thanks!

 

Elizabeth (Lisa) McAulay

Librarian for Digital Collection Development

UCLA Digital Library Program

http://digital.library.ucla.edu/

Email: emcaulay <at> library.ucla.edu

 

Ron Van den Branden | 3 Nov 2011 10:43
Picon

Re: "TEI P5, I Presume?" David Livingstone's 1871 Field Diary

Elisabeth,

Congratulations on this edition!

Coincidentally, I heard it being discussed on a Belgian national radio 
show yesterday 
(<http://www.radio1.be/programmas/nieuwe-feiten/dagboek-david-livingstone-ontcijferd>)! 
No mention of XML/TEI, though...

Ron

Malte Rehbein | 4 Nov 2011 11:28
Picon
Favicon

Fwd: Open Letter to the European Commission on Socio-economic Sciences and Humanities research in the new FP, 2014-2020

For your consideration, and apologies for cross-posting.

-------- Original-Nachricht -------- Betreff: Datum: Von: An:
Open Letter to the European Commission on Socio-economic Sciences and Humanities research in the new FP, 2014-2020
Wed, 2 Nov 2011 07:47:56 -0400
European Science Foundation <humanities <at> esf.org>
malte.rehbein <at> uni-wuerzburg.de


Open Letter to EC

Open Letter to the European Commission on Socio-economic Sciences and Humanities research in the new FP, 2014-2020

Dear colleagues,

with this message we would like to invite you to sign an Open Letter addressed to the European Commissioner for Research and Innovation (www.eash.eu/openletter2011 ), alerting her to the vital insights that Socio-economic Sciences and Humanities (SSH) contribute to address Europe’s and the world’s Grand Societal Challenges.

In view of legislative decisions to be taken on the next 100-Billion-worth EU Framework Programme Horizon 2020 (2014-2020), the letter stresses the necessity for a varied and strong research programme in the Socio-economic Sciences and Humanities (SSH): it argues that neglecting such potential contributions as SSH research risks undermining the EU strategy to develop innovative, inclusive and sustainable societies. Yet, there still is a distinct danger of insufficient funding in Horizon 2020 for research areas such as cultural change, demography, education, the economy and globalisation, identity politics and social cohesion, and many others. For background information on these matters see: www.eash.eu/openletter2011.

The Open Letter initiative has grown out of deliberations among a number of European umbrella organisations in the area of SSH, and seeks to bring to the attention of the European Commission and national governments the concerns of the largest research community in Europe.

If you agree, that a substantial and independent SSH-centered research programme should be included in all future European Framework Programmes, we invite you to sign the Open Letter online at www.eash.eu/openletter2011. Please also kindly spread this invitation to sign in your institutions and among your networks.

First results of this initiative will be presented to Commissioner Geoghegan-Quinn on 10 November 2011. We hope to be able to point to a high number of signatures as an expression of a groundswell of support and concern among SSH communities. The collection of signatures will, however, continue after this specific date, as the legislative decision process will last for longer.

Thank you in advance for signing and for supporting this initiative. Do not hesitate to contact us if you have any questions: SSH-letter <at> net4society.eu.

On behalf of the Inter-agency Task Group on SSH.

Professor Milena Zic-Fuchs (Chair ESF Standing Committee for the Humanities)
Sir Roderick Floud (Chair ESF Standing Committee for the Social Sciences)

Why did I receive this mail?

This message was sent to you because you have subscribed to the ESF's Humanities mailing list.
If you no longer wish to be a member of the Humanities mailing list, please use the link below to unsubscribe.

Powered by Mad Mimi®

Sebastiaan Verweij | 4 Nov 2011 11:57
Picon
Picon

Re: oXygen/white space/Format and Indent?

Dear George

Excuse delay in my message (teaching got in the way) - thanks for your full and helpful response! I understand the white space handling better now. 

I will be very interested to see a case when oXygen removed the space 
from a fragment like
<pb n="53r"/>
The text
resulting in
<pb n="53r"/>The text
because as far as I know this should not happen.

You are right, this isn't happening. I get the spaces introduced once there's a carriage return. I was just wondering at how it got there, since there is no space typed after the return, or before the text in the next line. But to oXygen these two must mean something different: 

<pbn="53r"/>The text

<pbn="53r"/>
The text

Anyway - I've addressed the issue in my XSLT (also I'm a total XSL newbie and may approach these problems from the wrong end sometimes!). 

Big thanks, 

Sebastiaan

---
Dr Sebastiaan Verweij || Hardie Postdoctoral Fellow, Lincoln College
Research Associate,The Oxford Edition of the Sermons of John Donne
a: English Faculty, St Cross Building, Manor Road, Oxford OX1 3UL
t: 0044 (0)1865 271931 || skype: seb.macv

Dot Porter | 10 Nov 2011 19:12
Picon

Metadata Analyst position open at Indiana University Libraries

Hi everyone,

Please take note of the following position with has just opened in the
Digital Library Program at Indiana University. We welcome applications
from candidates with strong backgrounds in metadata crosswalking and
XML/XSLT. This is a tenure-track librarian position.

Dot Porter
(Associate Dir. for Content & Services, DLP)

(Full description posted here, also available at )

Metadata Analyst

Assistant Librarian/Associate Librarian

Indiana University Bloomington Libraries

The IU Bloomington Libraries are seeking an energetic, innovative and
service-oriented individual for the newly created position of Metadata
Analyst for the Indiana University Digital Library Program (DLP).

The DLP is a system-wide resource that is dedicated to the production,
maintenance, delivery, and preservation of a wide range of
high-quality networked resources. As part of the Library Technologies
and Digital Libraries division, the DLP assumes a leadership role in
the development and delivery of services to preserve and provide
access to digital content from IU’s libraries, archives, museums,
faculty, and academic departments to local users, citizens of the
state of Indiana, and beyond. The DLP provides a focus for the
development of strategies and initiatives that will help guide the
University's libraries through the transformations necessary to
succeed in the digital age.

The Metadata Analyst reports to the Associate Director for Digital
Library Content and Services and will be responsible for:

    Planning long-term metadata strategies for the IU Libraries and DLP
    Advising on appropriate standards and approaches to creating and
managing metadata for digital projects, collections, and systems
    Working with collection managers, subject specialists, and
catalogers to identify appropriate metadata standards in digital
projects covering a variety of media, including text, music, still
images, audio, and video
    Designing and/or adapting XML schemas, XSLT stylesheets,
Schematron assertions, and RDF ontologies for digital systems and
projects
    Translating metadata between formats and integrating metadata from
a variety of sources
    Participating in the design and development of services to support
preservation of and access to digitized and born-digital content from
IU's collections
    Collaborating with local, national, and international partners on
research to advance the state of the art in digital libraries
    Serving as a liaison between the Digital Library Program and
internal and external partners on collaborative metadata projects
    Performing project management activities for individual digital
collection and systems projects

The Metadata Analyst will work on projects across the spectrum of the
Digital Library Program, including those in support of the Lilly
Library, University Archives, Cook Music Library, Archives of
Traditional Music, Wells Library, and other units on the IU
Bloomington campus, as well as in support of other IU campuses. The
Metadata Analyst will also provide metadata planning and analysis
support for IUScholarWorks, IU’s suite of online scholarly
communication services (based on DSpace and Open Journal System
software), and IU’s Fedora-based digital collections repository.

The Metadata Analyst will also contribute to DLP's involvement in IU's
efforts to support data management and curation for its researchers
and in the groundbreaking work on data publishing and curation to be
undertaken as part of an NSF funded DataNet award. Additional work
will overlap with the HathiTrust Digital Library
(http://hathitrust.org) and the HathiTrust Research Center
(http://www.hathitrust-research.org).

QUALIFICATIONS:
Required:

    ALA-accredited master's degree in library science, information
management/science, or related degree
    Minimum 2 years of experience in metadata analysis and support
    Knowledge of the concepts and applications used in the standards
and practices of organizing information
    Experience in recommending, designing and implementing metadata
schemes for digital library projects
    Experience in the transformation of XML documents using XSLT
    Experience with metadata standards including MARC, MARCXML, EAD,
Dublin Core, METS, MODS, FRBR, VRA and others
    Experience with creation and/or knowledge and management of
digital objects in various text, image, sound, and/or video formats
    Experience with digital repository systems such as Fedora or DSpace
    Excellent written and oral communication skills
    Excellent interpersonal skills and ability to work well with
diverse population of faculty, students, and academic colleagues
    Good organization skills and an aptitude for complex analytical
and detailed work
    Ability to meet the requirements of a tenure-track librarian position

Preferred:

    Demonstrated ability to plan, coordinate, manage, and implement projects
    Experience with programming using Perl or Java
    Experience in writing grant proposals and/or managing grants
    Experience cataloging in MARC

SALARY AND BENEFITS:
Salary is competitive and commensurate with experience and education;
benefits include a university healthcare plan, university-funded base
retirement plan, a 100% university paid group life insurance plan, and
a generous paid time off plan. This is a tenure-track academic
appointment that includes eligibility for sabbatical leaves. For a
full list of benefit programs, please refer to the following
resources:

    Website: http://www.indiana.edu/~uhrs/benefits/neweeo-profe.html
    Video: http://www.indiana.edu/~uhrs/benefitsvideo/academic.html
    Brochure: http://hr.iu.edu/enroll/video.html

APPLICATION REVIEW:

Review of applications will begin on January 1, 2012. The position
will remain open until filled. How to apply.

--

-- 
*~*~*~*~*~*~*~*~*~*~*
Dot Porter (MA, MSLS)
Digital Medievalist, Digital Librarian
Email: dot.porter <at> gmail.com
*~*~*~*~*~*~*~*~*~*~*

Dot Porter | 10 Nov 2011 19:22
Picon

Re: Metadata Analyst position open at Indiana University Libraries

And here is the URL for the online job posting! Sorry about that.

http://www.libraries.iub.edu/index.php?pageId=1410

Dot

On Thu, Nov 10, 2011 at 1:12 PM, Dot Porter <dot.porter <at> gmail.com> wrote:
> Hi everyone,
>
> Please take note of the following position with has just opened in the
> Digital Library Program at Indiana University. We welcome applications
> from candidates with strong backgrounds in metadata crosswalking and
> XML/XSLT. This is a tenure-track librarian position.
>
> Dot Porter
> (Associate Dir. for Content & Services, DLP)
>
>
> (Full description posted here, also available at )
>
> Metadata Analyst
>
> Assistant Librarian/Associate Librarian
>
> Indiana University Bloomington Libraries
>
> The IU Bloomington Libraries are seeking an energetic, innovative and
> service-oriented individual for the newly created position of Metadata
> Analyst for the Indiana University Digital Library Program (DLP).
>
> The DLP is a system-wide resource that is dedicated to the production,
> maintenance, delivery, and preservation of a wide range of
> high-quality networked resources. As part of the Library Technologies
> and Digital Libraries division, the DLP assumes a leadership role in
> the development and delivery of services to preserve and provide
> access to digital content from IU’s libraries, archives, museums,
> faculty, and academic departments to local users, citizens of the
> state of Indiana, and beyond. The DLP provides a focus for the
> development of strategies and initiatives that will help guide the
> University's libraries through the transformations necessary to
> succeed in the digital age.
>
> The Metadata Analyst reports to the Associate Director for Digital
> Library Content and Services and will be responsible for:
>
>    Planning long-term metadata strategies for the IU Libraries and DLP
>    Advising on appropriate standards and approaches to creating and
> managing metadata for digital projects, collections, and systems
>    Working with collection managers, subject specialists, and
> catalogers to identify appropriate metadata standards in digital
> projects covering a variety of media, including text, music, still
> images, audio, and video
>    Designing and/or adapting XML schemas, XSLT stylesheets,
> Schematron assertions, and RDF ontologies for digital systems and
> projects
>    Translating metadata between formats and integrating metadata from
> a variety of sources
>    Participating in the design and development of services to support
> preservation of and access to digitized and born-digital content from
> IU's collections
>    Collaborating with local, national, and international partners on
> research to advance the state of the art in digital libraries
>    Serving as a liaison between the Digital Library Program and
> internal and external partners on collaborative metadata projects
>    Performing project management activities for individual digital
> collection and systems projects
>
> The Metadata Analyst will work on projects across the spectrum of the
> Digital Library Program, including those in support of the Lilly
> Library, University Archives, Cook Music Library, Archives of
> Traditional Music, Wells Library, and other units on the IU
> Bloomington campus, as well as in support of other IU campuses. The
> Metadata Analyst will also provide metadata planning and analysis
> support for IUScholarWorks, IU’s suite of online scholarly
> communication services (based on DSpace and Open Journal System
> software), and IU’s Fedora-based digital collections repository.
>
> The Metadata Analyst will also contribute to DLP's involvement in IU's
> efforts to support data management and curation for its researchers
> and in the groundbreaking work on data publishing and curation to be
> undertaken as part of an NSF funded DataNet award. Additional work
> will overlap with the HathiTrust Digital Library
> (http://hathitrust.org) and the HathiTrust Research Center
> (http://www.hathitrust-research.org).
>
> QUALIFICATIONS:
> Required:
>
>    ALA-accredited master's degree in library science, information
> management/science, or related degree
>    Minimum 2 years of experience in metadata analysis and support
>    Knowledge of the concepts and applications used in the standards
> and practices of organizing information
>    Experience in recommending, designing and implementing metadata
> schemes for digital library projects
>    Experience in the transformation of XML documents using XSLT
>    Experience with metadata standards including MARC, MARCXML, EAD,
> Dublin Core, METS, MODS, FRBR, VRA and others
>    Experience with creation and/or knowledge and management of
> digital objects in various text, image, sound, and/or video formats
>    Experience with digital repository systems such as Fedora or DSpace
>    Excellent written and oral communication skills
>    Excellent interpersonal skills and ability to work well with
> diverse population of faculty, students, and academic colleagues
>    Good organization skills and an aptitude for complex analytical
> and detailed work
>    Ability to meet the requirements of a tenure-track librarian position
>
> Preferred:
>
>    Demonstrated ability to plan, coordinate, manage, and implement projects
>    Experience with programming using Perl or Java
>    Experience in writing grant proposals and/or managing grants
>    Experience cataloging in MARC
>
> SALARY AND BENEFITS:
> Salary is competitive and commensurate with experience and education;
> benefits include a university healthcare plan, university-funded base
> retirement plan, a 100% university paid group life insurance plan, and
> a generous paid time off plan. This is a tenure-track academic
> appointment that includes eligibility for sabbatical leaves. For a
> full list of benefit programs, please refer to the following
> resources:
>
>    Website: http://www.indiana.edu/~uhrs/benefits/neweeo-profe.html
>    Video: http://www.indiana.edu/~uhrs/benefitsvideo/academic.html
>    Brochure: http://hr.iu.edu/enroll/video.html
>
> APPLICATION REVIEW:
>
> Review of applications will begin on January 1, 2012. The position
> will remain open until filled. How to apply.
>
> --
> *~*~*~*~*~*~*~*~*~*~*
> Dot Porter (MA, MSLS)
> Digital Medievalist, Digital Librarian
> Email: dot.porter <at> gmail.com
> *~*~*~*~*~*~*~*~*~*~*
>

--

-- 
*~*~*~*~*~*~*~*~*~*~*
Dot Porter (MA, MSLS)
Digital Medievalist, Digital Librarian
Email: dot.porter <at> gmail.com
*~*~*~*~*~*~*~*~*~*~*

Steffen Wolfrum | 10 Nov 2011 21:31
Picon

anchors and targets

Hi,

as far as I have understood there is 
one <anchor xml:id="123"/> with one <ref target="#123"/>

and there is also this:
several <anchor xml:id="123"/> ... <anchor xml:id="456"/> ... <anchor xml:id="789"/> ...  with one <ref
target="#123 #456 #789"/>

But I am looking for a way to encode the opposite in a handy way:

one <head>
<anchor xml:id="123"/>
<anchor xml:id="456/>
<anchor xml:id="123"/>
...
</head>

with several <ref target="#123"/> ... <ref target="#456"/> ... <ref target="#789"/>

Is there for this case some short version like <anchor xml:id="123 456 789"/>?!

Steffen