TAN Tutorial 1

Preparing a TEI Corpus for the Text Alignment Network

Session 1

Vocabulary

Objectives

  1. Learn how TAN handles vocabulary and semantic-based identification
  2. Create a TAN-voc file
  3. Understand TAN metadata structures and principles
  4. Understand validation phases, and how to interpret validation reports

Discussion

Semantics, Vocabulary, URIs

Exercise 1

Discussion

TAN-voc

Exercise 2

Discussion

TAN-voc body

Exercise 3

Discussion

TAN metadata

Exercise 4

Demonstration
Result of demonstration
Your turn

TAN-voc tips and tricks

  • For help in some attributes, type: ???
  • Looking for vocabulary?
    • subdirectory vocabulary
    • Guidelines appendix

Reminders

General

  • Local vocabulary overrules TAN vocabulary

Name making

  • @xml:id: case sensitive, no spaces
  • <name>: case insensitive, spaces = hyphen = underscore

Name calling

  • @which: only one value, from a <name>
  • other attributes: many values, space delimited (use underscore to substitute for spaces in names)

Recap

What we've learned

  1. How TAN handles vocabulary and semantic-based identification
  2. How to make a TAN-voc file
  3. TAN metadata structures and principles
  4. How to use and interpret validation

General discussion

Finish, evaluate exercises