Manuscript and Physical Document Encoding

Julia Flanders

2007-05-13/14

Some philosophical issues

Note that these are really two separate, though closely related issues:

There are some aspects of the encoding of physical document structures which are common to print and MS documents, so we treat them together

Similarly, there are some issues having to do with our perception of the physical exemplar (esp. having to do with legibility and conjecture) that are common to both.

By and large, the TEI is focused, methodologically, on the text as linguistic rather than material information: its encoding provisions for genre, language, content are rich and detailed, while its provisions for material information are fairly minimal. Textual materiality poses some interesting conceptual problems for markup systems:

We raise these here mostly as signposts to issues that may be interesting, rather than trying to give an adequate treatment here; there's a lot of interesting debate on this topic and if you're interested we can provide some pointers

For now, we're just going to cover some practical encoding points.

Encoding the physical document

In TEI, the primary emphasis of the encoding is on the text stream (paragraphs, divisions, and so forth)

An Alternate View of the Universe

Note that the physical structure can also be represented as the primary structure, for instance using div type="page | signature | etc."

Illegibility and unclarity

Additions, deletions

add and del: these contain brief additions or deletions in the text; provide a way to indicate where located, whose handwriting

addSpan and delSpan: these are empty elements which mark the start of an addition or deletion, and point (with the spanTo attribute) to the end of the span, which is marked by an anchor that carries an xml:id.

Note that for spans and handwriting identification, need to declare Chapter 18 (transcr)

Handwriting Identification

To identify the person whose handwriting is present, and also to provide information about the handwriting (what style, what ink, etc.)

Handwriting Identification: in the text