Type of text, general keywords, other taxonomies

There is no specific TEI tag for the category or type of text, so different projects have found different approaches to the encoding of this information. One approach was to use a bespoke <rs> (reference string) element, with type of "textType", around the relevant word in the edition, often in the title:

<title>
 <rs type="textType">Honours</rs> for an emperor
</title>
(InsAph: 4.311)

Another approach is to include the category of text in a list of keywords in <msContents>/<summary>:

<msContents>
 <summary corresp="#doc1 #doc4">
  <seg>Building inscription</seg>
  <seg>Demonstrative</seg>
 </summary>
</msContents>
<!-- example from IOSPE V 2 -->

Perhaps the most TEI-compliant, although least specific, solution would be to include text category information in <textClass>/<keywords>:

<textClass>
 <keywords scheme="hgv">
  <term>Brief (privat)</term>
 </keywords>
</textClass>
(Source)

General keywords or index terms of any other kind can be tagged (by default using <rs>, but any other semantically appropriate element may be captured for this purpose) anywhere in the document where they appear in running text: e.g. in the Greek or Latin edition, translations, commentary, or any of the descriptive fields in the header. Depending on the needs of a particular project and any internal or external taxonomies in use, such a tag might look something like:

<provenance type="observed">
 <p>Outside the <rs type="monuListkey="db934">Cyrene Sculpture Museum</rs>.</p>
</provenance>

The above examples could be indexed using project-specific code for the purpose of generating indices, a search form, or for sorting or filtering texts. If a taxonomy is also in use, the <rs> may carry ref for the purpose of normalizing to a term in this taxonomy. In the example below, normalization is to the EDR vocabulary for materials (see Rei Material GoogleDoc):

<material ref="edr:marmor">Blue-veined white marble.</material>

You can align your definitions by aligning to a local or to an external Controlled Vocabulary.

Subjects or keywords describing text

General keywords that do not belong in any particular part of the text or supporting data of the edition may be gathered conveniently in a <keywords> element in the header. These keywords will generally be used only for purposes of search, indexing, or compatibility within a larger collection, rather than appearing in the human-readable part of the edition.

<textClass>
 <keywords scheme="IRCyr">
  <term>
   <placeName type="ancientRegionkey="Cyrene">Cyrenaica</placeName>
  </term>
  <term>
   <placeName type="modernCountryref="iso3166-1:LY">Libya</placeName>
  </term>
  <term>
   <placeName
     type="modernFindspot"
     ref="http://sws.geonames.org/81584">
Marsa
       Suza</placeName>
  </term>
 </keywords>
</textClass>

For keywords, subject terms or descriptions that apply specifically to the categorization or definition of the text, the <msContents> and <summary> may be used to contain either a list of terms or general prose description, as in the examples below:

<msContents>
 <summary corresp="edr:oper-publ-priv-que">
  <seg xml:lang="ru">Строительная надпись.</seg>
  <seg xml:lang="en">Building inscription.</seg>
 </summary>
</msContents>
<msContents>
 <summary>This is the first 2 lines of P.Lond. IV 1370, a letter (entagion) from the
   governor of Egypt, Kurrah b. Sharik, to the pagarch, Basilius, seeking remedy of
   a deficit in the "embola," or grain tax</summary>
</msContents>
(Source)

Responsibility for this section

  1. Gabriel Bodard, author
  2. Simona Stoyanova, author

EpiDoc version: 8.19

Date: 2014-07-31