TEI/EpiDoc provides mechanisms for encoding languages and scripts (writing systems) as they relate to the contents of an EpiDoc file and to the text(s) described and transcribed therein. In doing so, we make use of Internet standards for the identification of these languages and scripts. This portion of the Guidelines addresses all relevant aspects.
TEI and EpiDoc follow the best current practice outlined in the Network Working Group's RFC 5646: Tags for Identifying Languages, which establishes the norms for same on an Internet-wide basis. The RFC and supporting documents define a syntax for creating short strings of characters (‘language tags’) that function as unique identifiers for any desired combination of language and script. These tags are composed of ‘subtags’ for language qua language, writing sysem (script), and regional and dialectical variation. The RFC also establishes a process for registration and maintenance of these subtags by the Internet Assigned Numbers Authority.
A valid EpiDoc file must make use of subtags recorded in the IANA Language Subtag Registry. Many EpiDoc creators will already be familiar with some of these codes from other digital projects, for example:
When the IANA registry does not provide appropriate codes, then an EpiDoc project may devise "private use subtags", so long as they are internally defined in the EpiDoc file as outlined in the following paragraph and so long as they conform syntactically to the specifications laid out in RFC 5646, sections 2.1: Syntax and 4.6: Considerations for Private Use Subtags. For example, the Campā Inscriptions team determined that the two Cham language subtags (cja = Western Cham and cjm = Eastern Cham) and the associated script subtag (Cham) were substantively different from the ancient Cham language and script represented in the inscriptions. Therefore the private use subtag "x-oldcam-latn-ci" was invented and given the project-specific meaning "Old Cam language in Old Cam script transliterated in Latin characters." Whenever possible, EpiDoc projects and practitioners should undertake to register new subtags with the IANA for the benefit of others. A procedure for same is set out in RFC 5646 Section 3.5.
Describe langUsage here.