James Cummings | 3 Jun 2012 18:25
Picon
Picon
Favicon

TEI Technical Council meeting minutes

A colleague pointed out that I had forgotten to announce the TEI 
Technical Council meeting minutes from our face to face meeting 
in Ann Arbor.  These are available at:

http://www.tei-c.org/Activities/Council/Meetings/tcm50.xml

We covered a large number of Bugs and Feature Requests submitted 
on sourceforge as well as discussing a number of larger topics. 
We have since been working through these and resolving as a many 
as possible in preparation for our next release of the TEI 
Guidelines.

-James

--

-- 
Dr James Cummings, InfoDev,
Computing Services, University of Oxford

Marco BÜCHLER | 4 Jun 2012 10:43
Favicon

Re: lucene/Solr import filters

Comes this week. Was on vacation in May.
Best,
Marco

On 05/22/2012 11:59 PM, Stuart A. Yeates wrote:
> On Wed, May 23, 2012 at 1:28 AM, Peter MacDonald<pmacdona <at> hamilton.edu>  wrote:
>> Stuart:
>>
>> We have a new repository here at our small college that uses
>> Islandora/Fedora as its delivery system and one of the collections consists
>> of 456 TEI files. We use Solr to index our TEI files, but it has been a
>> challenge to configure Solr to meet our delivery needs. You can see it at
>> work here:
>>
>> http://cwl.dhinitiative.org/
>>
>> So far we only index roleName, forename, surname, orgName, geogName,
>> geogFeat, and sic. We are also experimenting with full persName and
>> placeName elements, but those are not quite ready for primetime because our
>> TEI files are not yet 100% consistent in their encoding of these elements.
>> We eventually want to use the placeName elements in a mapping interface.
>>
>> Anyway, I wonder if you would be willing to share the file you use for
>> configuring Solr for TEI. I've never seen it done by anyone but us and there
>> might be a more efficient way of doing it. I would be happy to send you mine
>> if you were interested.
> I'm not sure that our particular config will help you very much, but
> I'm happy to share what I learnt.
>
> * We use the same XSLT tool chain for solr as I use for HTML, so we
(Continue reading)

Benjamin Wolff Bohl | 5 Jun 2012 19:36
Picon
Favicon

TEI, XSL-FO, and Glyphs

Hi everybody,
I was wondering, whether anybody has a best practice way of handling 
Glyphs (especially asian ones) when transformin TEI files to PDF using 
oXygen. Of course if I select a font containing the glyphs but that 
would make me check on every glyph beforehand. Is there a way of 
fallback handling glyphs not available in the font my xml-fo file specifies?

Best wishes,
Benjamin

--

-- 
Benjamin Wolff Bohl

***********************************************************
Edirom - Projekt "Digitale Musikedition"
Musikwissenschaftliches Seminar Detmold/Paderborn
Gartenstraße 20
D – 32756 Detmold

Tel. +49 (0) 5231 / 975-669
Fax: +49 (0) 5231 / 975-668

http://www.edirom.de
***********************************************************

Tommie Usdin | 5 Jun 2012 20:14
Favicon

[ANN] Call for Late-breaking News at Balisage 2012

NOTE TO TEI Members: As TEI members you are eligible for discount 
registration at Balisage. To get the discount simply select the appropriate 
discount registration fee and put "TEI Member" in the Discount Category.

==========================================================================

It is not too late for you to speak at Balisage!

The peer-reviewed part of the Balisage 2011 program has been
scheduled (http://www.balisage.net/2012/Program.html) But the Balisage
world moves quickly, so every year some speaking slots are set aside
for late-breaking news, updates, and fast-breaking stories.

The time is now. Proposals for late-breaking presentations at
Balisage are due June 15th. Details here:

<http://www.balisage.net/latebreaking-call.html>http://www.balisage.net/latebreaking-call.html

Your proposal should be either:

a) really late-breaking (it reports on something that happened in
  the last month or two) or

b) a well-developed paper, an extended paper proposal, or a very long
  abstract with references on a topic related to Markup and not
  already on the 2012 conference program.

The competition for late-breaking slots is fierce, so start working
on your late-breaking proposal today!

(Continue reading)

Marcus Bingenheimer | 6 Jun 2012 06:46
Picon

TEI, XSL-FO, and Glyphs

Hi Benjamin,

It is some years ago that I have used XSL-FO with Chinese, so things might have changed, but here are my five cents anyway:
 
I was wondering, whether anybody has a best practice way of handling Glyphs (especially asian ones) when transformin TEI files to PDF using oXygen. Of course if I select a font containing the glyphs but that would make me check on every glyph beforehand.

If with "glyphs" you mean CKJ-Ideographs any large font like PMingLiU ( 新細明體)(on Windows) should work.
If you must assume that your document contains (very) rare characters (Unicode CJK Unified Ideographs Extension B, C or D) then you should see a square in the output.
 
Is there a way of
fallback handling glyphs not available in the font my xml-fo file specifies?

I seem to remember that the default Apache XSL-FO engine (Apache FOP) in oXygen would not work for the PMingLiU Extension B Extension, but you could try with the Han Nom A and B fonts.
Anyway the default engine is relatively weak. I am involved with two projects that include text with lots of rare CJK-characters. We produce PDFs from TEI via Open Office, which provides sufficient formatting options for our purposes, certainly more than XSL-FO, but of course less than LateX or Indesign.

all the best

marcus

--
Dr. Marcus Bingenheimer 馬德偉
Department of Religion, Temple University

Conal Tuohy | 6 Jun 2012 07:33
Picon
Favicon

Re: TEI, XSL-FO, and Glyphs

Hi Benjamin

Years ago I remember generating font-metrics files and building a 
userconfig.xml file for FOP, in order to be able to print special 
glyphs. I don't remember all the details, but this, for instance, might 
be helpful: http://www.firebirdsql.org/manual/fontembed.html

FOP also has a font-substitution feature which may be helpful: you could 
group all your fonts into one substitution group and rely on this 
feature to choose the best font: 
http://xmlgraphics.apache.org/fop/1.0/fonts.html#substitution

In the worst case you may have to explicitly map characters to the 
appropriate fonts:

You could specify in your XSL-FO document which font to use for a 
particular character. You could do this with an XSLT that post-processes 
the XSL-FO, matching particular characters, and wrapping them in 
<fo:inline elements> in order to specify a font with a matching glyph. 
See http://xmlgraphics.apache.org/fop/faq.html#pdf-characters

To identify which fonts contains glyphs with a particular character, you 
could use the FOP font metrics tools, or you could try this (Microsoft 
Windows) utility program: http://wiki.digitalclassicist.org/Find_Glyph

I hope that's helpful!

Conal

On 06/06/12 03:36, Benjamin Wolff Bohl wrote:
> Hi everybody,
> I was wondering, whether anybody has a best practice way of handling 
> Glyphs (especially asian ones) when transformin TEI files to PDF using 
> oXygen. Of course if I select a font containing the glyphs but that 
> would make me check on every glyph beforehand. Is there a way of 
> fallback handling glyphs not available in the font my xml-fo file 
> specifies?
>
> Best wishes,
> Benjamin
>

--

-- 
Conal Tuohy
eResearch Business Analyst
Victorian eResearch Strategic Initiative
+61-466324297

Enrico Cima | 6 Jun 2012 18:06
Picon
Favicon

Authorities Heading

Dear Tei-List,

I would like to know which is the best way to encode the authority heading of a personal or corporate name.
Something like an index with the authorized form, refs and see also...

Thank you,
 
Enrico
David Sewell | 6 Jun 2012 18:38
Favicon

Digital Humanities Congress 2012 (fwd)

[Forwarded by request. See http://www.shef.ac.uk/hri/dhc2012 for more 
information.]

---------- Forwarded message ----------
Date: Wed, 6 Jun 2012 17:23:57 +0100
From: Clare Mills <c.e.mills <at> sheffield.ac.uk>
To: info <at> tei-c.org, news <at> tei-c.org
Subject: Digital Humanities Congress 2012

Dear Colleague,

I'm pleased to let you know that registration has opened for the Digital
Humanities Congress 2012. This is a new conference to promote the sharing of
knowledge, ideas and techniques within the digital humanities.

Hosted by the University of Sheffield's Humanities Research Institute the
conference will take place from 6 - 8 September 2012.

The keynote speakers will be:

*	Professor Andrew Prescott (Head of Department, Department of Digital
Humanities, King's College London)
*	Professor Lorna Hughes (University of Wales, Chair in Digital
Collections at the National Library of Wales)
*	Professor Philip Ethington (Professor of History and Political
Science, University of Southern California and Co-Director of the USC Center
for Transformative Scholarship)

Early bird discounts are available until 30 June.

For further details and registration visit:

http://hridigital.shef.ac.uk/dhc2012

Please consider forwarding this on to any colleagues you think might be
interested in attending.

With best wishes,

Clare

Clare Mills

HRI Coordinator

Humanities Research Institute

University of Sheffield

34 Gell Street

Sheffield

S3 7QY

Tel: 0114 222 9890

Fax: 0114 222 9894

Email: c.e.mills <at> sheffield.ac.uk

Web: http://www.shef.ac.uk/hri

Times Higher Education University of the Year

Benjamin Wolff Bohl | 7 Jun 2012 09:44
Picon
Favicon

Re: TEI, XSL-FO, and Glyphs

Hi Marcus,
hi Conal,

thank you for your advice.
As I read from what you've said I will have to "know", e.g. modify my data.
What I've done so far is generate font-etrics files for all teh fonts I 
want to embed and directly reference them in my fop-config.xml. Moreover 
I decided on Using "Aria Unicode MS" font for anything "foreign". In 
order grasp the glyphs and to automatically tag them with <foreign> I 
used regular expressions searching for certain unicode codepoint ranges 
to search all my TEI files.

Thanks,
Benjamin

Benjamin Wolff Bohl

***********************************************************
Edirom - Projekt "Digitale Musikedition"
Musikwissenschaftliches Seminar Detmold/Paderborn
Gartenstraße 20
D – 32756 Detmold

Tel. +49 (0) 5231 / 975-669
Fax: +49 (0) 5231 / 975-668

http://www.edirom.de
***********************************************************


Am 06.06.2012 07:33, schrieb Conal Tuohy:
> Hi Benjamin
>
> Years ago I remember generating font-metrics files and building a 
> userconfig.xml file for FOP, in order to be able to print special 
> glyphs. I don't remember all the details, but this, for instance, 
> might be helpful: http://www.firebirdsql.org/manual/fontembed.html
>
> FOP also has a font-substitution feature which may be helpful: you 
> could group all your fonts into one substitution group and rely on 
> this feature to choose the best font: 
> http://xmlgraphics.apache.org/fop/1.0/fonts.html#substitution
>
> In the worst case you may have to explicitly map characters to the 
> appropriate fonts:
>
> You could specify in your XSL-FO document which font to use for a 
> particular character. You could do this with an XSLT that 
> post-processes the XSL-FO, matching particular characters, and 
> wrapping them in <fo:inline elements> in order to specify a font with 
> a matching glyph. See 
> http://xmlgraphics.apache.org/fop/faq.html#pdf-characters
>
> To identify which fonts contains glyphs with a particular character, 
> you could use the FOP font metrics tools, or you could try this 
> (Microsoft Windows) utility program: 
> http://wiki.digitalclassicist.org/Find_Glyph
>
> I hope that's helpful!
>
> Conal
>
>
>
> On 06/06/12 03:36, Benjamin Wolff Bohl wrote:
>> Hi everybody,
>> I was wondering, whether anybody has a best practice way of handling 
>> Glyphs (especially asian ones) when transformin TEI files to PDF 
>> using oXygen. Of course if I select a font containing the glyphs but 
>> that would make me check on every glyph beforehand. Is there a way of 
>> fallback handling glyphs not available in the font my xml-fo file 
>> specifies?
>>
>> Best wishes,
>> Benjamin
>>
>
>

Kevin Hawkins | 7 Jun 2012 15:18

Re: Authorities Heading

Enrico,

The Best Practices for TEI in Libraries, a community customization now 
included in the list at http://www.tei-c.org/Guidelines/Customization/ , 
gives examples of personal and corporate authority headings in the TEI 
header:

http://www.tei-c.org/SIG/Libraries/teiinlibraries/main-driver.html#index.xml-body.1_div.4_div.1_div.6

As you can see, these Best Practices simply indicate to use the 
authority form but don't give a way to note that a particular value of 
<author>, <editor>, etc. is from an authority file.

--Kevin

On 2:59 PM, Enrico Cima wrote:
> Dear Tei-List,
>
> I would like to know which is the best way to encode the authority
> heading of a personal or corporate name.
> Something like an index with the authorized form, refs and see also...
>
> Thank you,
> Enrico