3 Mar 2003 21:14
ANN: RefDB 0.9.2
Markus Hoenicka <markus.hoenicka <at> MHOENICKA.DE>
2003-03-03 20:14:28 GMT
2003-03-03 20:14:28 GMT
Hi all, this is an announcement of the latest version of RefDB, sent approx. annually in order not to clutter the mailing list. RefDB is a reference manager and bibliography tool for markup languages. RefDB uses a client/server model to allow easy sharing of reference data between researchers but it works just fine on a single workstation. Data are stored in a SQL database server. MySQL and PostgreSQL are supported as external database servers, but there's a SQLite-based embedded SQL engine available too. DocBook SGML and XML as well as TEI XML are supported out of the box, but other document types can be added as needed. Bibliography and citation styles can be supplied as XML documents according to publisher's specifications. Document processing is Makefile-based, so e.g. "make pdf" is essentially all it takes to turn your TEI document containing RefDB-compatible citations into a PDF document with formatted citations and a bibliography. The RefDB homepage (http://refdb.sourceforge.net) has all the details, an extensive manual (> 200 pages), a short tutorial targeted at new users, and a few example documents. The RefDB sources as well as a starterkit containing the RefDB sources as well as the sources of all required libraries and Perl modules are available at http://sourceforge.net/projects/refdb/. regards, Markus(Continue reading)
Differentiating soft from hard hyphens can be quite the trick. One
(automated) method that has trickled through my brain (the brainchild
of linguist Jacque Russom, if I recall correctly) but I have never
coded is as follows.
* Initially encode the file with "-" for all hyphens and "<lb/>" for
all line-breaks.
* Read the file in and build a word-list of all words that do *not*
have "-<lb/>" in 'em (should use clever TEI parsing software that
knows how to build words from TEI-encoded text, e.g. to extract
SOnum "duck" from "du<sic corr='ck'>kc</sic>"). Note that you might
want to create the word-list from the single file you are operating
on, from the corpus of all your texts, or (as we at the WWP would
RSS Feed