TAN-A-tok: alignments may claim bitext relations and reuse type
Formal Definition
@xml:id
?,@bitext-relation
?,@reuse-type
?
Used by: ~alignment
TAN-A-tok: alignments must contain one or more toks
Formal Definition
(<tok>
|~tok-sequence
)+
Used by: ~alignment
Formal Definition
(@cert
| (@cert
,@cert2
))
Used by: ~claim
, ~tok-with-src-and-cont
, ~cert-opt
, ~certainty-stamp
Formal Definition
(@cert
| (@cert
,@cert2
))?,~ed-stamp
?
Used by: ~alignment
, ~test-pattern
, ~TAN-LM-item
, ~lexeme
, ~morph
Formal Definition
<comment>
Used by: ~split
, ~realignment
, ~alignment
, ~feature-pattern
, ~feature-pattern-no-code
, ~category
, ~decl-div
, ~decl-filt
, ~decl-filt-norm
, ~func-replace
, ~decl-pattern-default
, ~decl-pattern-no-id
, ~decl-pattern-language
, ~decl-group-type
, ~TAN-head
, ~TAN-body
, ~nonsource-rights
, ~inclusion-item
, ~key-item
, ~source-item
, ~source-rights
, ~see-also-item
, ~decl-opt
, ~agent-list
, ~role-list
, ~TAN-LM-item
Formal Definition
( {[TAN-class-2 (~source-refs
):]@src
} OR {[TAN-core (~source-refs
):] {empty}} |@work
), (<div-ref>
|<tok>
)+
Used by: ~textual-reference
Formal Definition
{empty}@cont
TAN-class-1: Class 1 files must declare a single work, perhaps one version, one or more divisions, one or more recommended tokenizations, perhaps one filter, and perhaps one
Formal Definition
(<work>
&<version>
? &<div-type>
+ &<token-definition>
* &<filter>
?)
Used by: ~declaration-items
TAN-class-1: Replacements that have been made to an XML source file.
Formal Definition
<replace>
Used by: ~decl-filter-content
TAN-A-div: declarations by default have ids
TAN-A-tok: all declarations must have ids
TAN-c: declarations by default have ids
TAN-key: all declarations must have ids
TAN-mor: no declarations may have ids
TAN-core: Option to allow an @xml:id
in children of <declarations>
Formal Definition
@xml:id
@xml:id
@xml:id
@xml:id
{empty}{empty}
Used by: ~decl-filt-norm
, ~decl-pattern-default
, ~decl-pattern-language
TAN-class-1: Reserved for declarations specific to individual types of class 1 files
Formal Definition
{empty}
Used by: ~declaration-items
Formal Definition
~ed-stamp
?, (~inclusion
| ( {[TAN-A-div (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-A-tok (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-c (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-key (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-mor (~decl-id-ref-opt
):] {empty}} OR {[TAN-core (~decl-id-ref-opt
):] {empty}}, (<comment>
* & ((<IRI>
+,~metadata-human
) |@which
))))
Used by: ~decl-brel
, ~decl-reus
, ~decl-place
, ~decl-topic
, ~decl-verb
, ~decl-unit
, ~decl-pers
, ~decl-mode
, ~decl-scri
, ~decl-work
, ~decl-vers
Formal Definition
~ed-stamp
?, (~inclusion
| ( {[TAN-A-div (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-A-tok (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-c (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-key (~decl-id-ref-opt
):]@xml:id
} OR {[TAN-mor (~decl-id-ref-opt
):] {empty}} OR {[TAN-core (~decl-id-ref-opt
):] {empty}}, (<comment>
* & (<for-lang>
*, ((<IRI>
+,~metadata-human
) |@which
)))))
Used by: ~decl-filt-tlit
Formal Definition
~ed-stamp
?, (~inclusion
| (<comment>
* & ((<IRI>
+,~metadata-human
) |@which
)))
TAN-core: declaration-items to be filled out by each TAN-class-X.rnc file
TAN-LM-core: TAN-LM files must each have at least one lexicon and one TAN-mor declaration
Formal Definition
(<token-definition>
* &<suppress-div-types>
* &<rename-div-ns>
* & )(<token-definition>
* &<suppress-div-types>
* &<rename-div-ns>
* &<bitext-relation>
+ &<reuse-type>
++ &<group-type>
*)<group-type>
*( (<work>
&<version>
? &<div-type>
+ &<token-definition>
* &<filter>
?) & {empty}){empty}(<token-definition>
* &<suppress-div-types>
* &<rename-div-ns>
* &<lexicon>
+ &<morphology>
+ &<group-type>
*)
Used by: ~decl-opt
TAN-core: String that specifies a range of divs using the div-ref pattern joined by a hyphen or a comma.
TAN-core: String that specifies a single <div>
Formal Definition
string (pattern (\w+([^\w\-]\w+)*)|.*\?\?\?.*)
Used by: ~pointer-to-div-item
TAN-core: For more see the section called “Flattened References, and the Leaf Div Uniqueness Rule”
Formal Definition
string (pattern (\w+([^\w\-]\w+)*)(((\s*-\s*)|(\s*,\s+))(\w+([^\w\-]\w+)*))*|.*\?\?\?.*)
Used by: ~pointer-to-div-range
Formal Definition
@div-type-ref
Used by: ~div-type-ref-cluster
, ~decl-supp-div-type
, ~decl-rename-div-n
TAN-core: Editorial stamp: who created or edited the enclosed data and when.
Formal Definition
@ed-who
,@ed-when
Used by: ~work-equiv
, ~div-type-equiv
, ~div-type-ref-cluster
, ~split
, ~anchor-div-ref-item
, ~reanchor-div-ref-item
, ~realignment
, ~claim-div-ref-item
, ~TAN-key-item
, ~feature-pattern
, ~feature-pattern-no-code
, ~category
, ~text-div
, ~claim
, ~complex-subject
, ~object-element
, ~locus
, ~complex-rationale
, ~decl-div
, ~decl-filt
, ~decl-filt-norm
, ~func-replace
, ~decl-supp-div-type
, ~decl-rename-div-n
, ~tok-attr-core
, ~certainty-stamp
, ~decl-pattern-default
, ~decl-pattern-no-id
, ~decl-pattern-language
, ~decl-group-type
, ~decl-tok-def
, ~IRI-gen-ref
, ~loc-self
, ~loc-src
, ~metadata-desc
, ~TAN-root
, ~TAN-head
, ~TAN-body
, ~body-group
, ~nonsource-rights
, ~inclusion-item
, ~key-item
, ~source-item
, ~source-rights
, ~see-also-item
, ~relationship
, ~decl-opt
, ~agent-list
, ~role-list
, ~agent-role-list
, ~change-list
, ~decl-alias
, ~decl-morph
, ~decl-lexi
, ~TAN-LM-item
TAN-core: Reference to an external digital entity that is not a TAN file
Formal Definition
((<IRI>
+,~metadata-human
,<checksum>
*,<location>
+) |@which
)
Used by: ~TAN-key-item
, ~source-item
, ~see-also-item
, ~decl-lexi
TAN-core: Reference to an external digital entity that is a TAN file; unlike other types, the keyword-ref option much be turned on and off for specific elements (e.g., not advisable for <inclusion>
)
Formal Definition
(@href
| (<IRI>
,~metadata-human
,<checksum>
*,<location>
+))
Used by: ~TAN-key-item
, ~metadata-human
, ~inclusion-item
, ~key-item
, ~source-item
, ~see-also-item
, ~decl-morph
TAN-core: Reference to self as digital entity (i.e., TAN file)
Formal Definition
~metadata-human
,<master-location>
*
Used by: ~TAN-head
TAN-core: Reference to an external non-digital entity (e.g., agents, roles, works, topics)
Formal Definition
((<IRI>
+,~metadata-human
) |@which
)
Used by: ~TAN-key-item
, ~feature-pattern
, ~feature-pattern-no-code
, ~decl-div
, ~decl-filt-norm
, ~checksum
, ~decl-pattern-default
, ~decl-pattern-no-id
, ~decl-pattern-language
, ~decl-group-type
, ~nonsource-rights
, ~source-item
, ~source-rights
, ~see-also-item
, ~relationship
, ~agent-list
, ~role-list
, ~decl-lexi
TAN-mor: TAN-R-mor files declare the morphological features that are allowed for a given language
Formal Definition
<feature>
Used by: ~feature-list
, ~category
Formal Definition
~ed-stamp
?, (~inclusion
| (@code
, (<comment>
* & ((<IRI>
+,~metadata-human
) |@which
))))
Used by: ~feature
Formal Definition
~ed-stamp
?, (~inclusion
| (<comment>
* & ((<IRI>
+,~metadata-human
) |@which
)))
Used by: ~category-feature
TAN-class-2: Option to include an internal id. Not needed in TAN-LM files.
Formal Definition
@xml:id
{empty}
Formal Definition
@include
,@help
?
Used by: ~work-equiv
, ~div-type-equiv
, ~split
, ~realignment
, ~alignment-inclusion-opt
, ~TAN-key-item
, ~feature-pattern
, ~feature-pattern-no-code
, ~category
, ~test-pattern
, ~text-div
, ~claim
, ~decl-div
, ~decl-filt-norm
, ~func-replace
, ~decl-supp-div-type
, ~decl-rename-div-n
, ~decl-pattern-default
, ~decl-pattern-no-id
, ~decl-pattern-language
, ~decl-group-type
, ~decl-tok-def
, ~body-group
, ~nonsource-rights
, ~key-item
, ~source-item
, ~source-rights
, ~see-also-item
, ~relationship
, ~agent-list
, ~role-list
, ~agent-role-list
, ~decl-alias
, ~decl-morph
, ~decl-lexi
, ~TAN-LM-item
Formal Definition
@xml:id
Used by: ~decl-id-ref-opt
, ~source-id-opt
, ~alignment-attributes-non-class-2
, ~decl-div
, ~id-option
, ~decl-group-type
, ~inclusion-item
, ~agent-list
, ~role-list
, ~decl-alias
, ~decl-morph
, ~decl-lexi
, ~TAN-LM-item
TAN-core: Any generic IRI identifier.
Formal Definition
anyURI (pattern [a-zA-Z][\-.+a-zA-Z0-9]+:\S+)
Used by: ~IRI-gen-ref
Formal Definition
<IRI>
Used by: ~entity-digital-tan-other-ref
, ~entity-digital-generic-ref
, ~entity-nondigital-ref
TAN-core: String that specifies a single item from a sequence: digits or "last(
-digit)?" Similar to seq-picker.
Formal Definition
string (pattern ((last|max)|((last|max)-\d+)|(\d+))|.*\?\?\?.*)
Used by: ~item-pos-ref
Formal Definition
@which
Used by: ~decl-tok-def
, ~entity-digital-generic-ref
, ~entity-nondigital-ref
, ~metadata-human
, ~decl-morph
Formal Definition
@xml:lang
Used by: ~other-body-attributes
, ~text-div
, ~nontextual-reference
, ~metadata-desc
Formal Definition
<for-lang>
Used by: ~source-list
, ~TAN-R-mor-body
, ~decl-pattern-language
, ~decl-morph
, ~decl-lexi
Formal Definition
<name>
+,<desc>
*
Used by: ~entity-digital-tan-other-ref
, ~entity-digital-generic-ref
, ~entity-digital-tan-self-ref
, ~entity-nondigital-ref
, ~entity-tok-def
TAN-core: Acceptable values of @n
, used by class 1 and class 2 files; this is related to div-range-ref, in that it allows ranges of items, but ignores "last", "max", or "all" keyterms.
Formal Definition
string (pattern (\w+|\d+-\d+)(\s+(\w+|\d+-\d+))*)
Used by: ~name-change
, ~n
Formal Definition
@bitext-relation
,@reuse-type
?, ?@affects-element
?@xml:lang
{empty}@lexicon
,@morphology
Used by: ~TAN-body-core
TAN-class-2: Pointer (link) to a div in a TAN-T(
EI) file
Formal Definition
{empty}@ref
Used by: ~anchor-div-ref-item
, ~reanchor-div-ref-item
, ~simple-textual-reference
, ~claim-div-ref-item
, ~tok-regular
TAN-core: specifies whether the creation and editing of the data is still in progress. Default value is true.
Formal Definition
@in-progress
Used by: ~TAN-body
Formal Definition
@seg
Used by: ~reanchor-div-ref-item
, ~simple-textual-reference
, ~claim-div-ref-item
TAN-core: String that specifies a range of items in a sequence: digits or "last(
-digit)?" joined by hyphens (ranges) or commas. Similar to item-picker.
Formal Definition
string (pattern ((last|max|all|\*)|((last|max)-\d+)|(\d+))(\s*-\s*(((last|max))|((last|max)-\d+)|(\d+)))?(\s*[, ]\s*(((last|max))|((last|max)-\d+)|(\d+))(\s+-\s+(((last|max))|((last|max)-\d+)|(\d+)))?)*|.*\?\?\?.*)
Used by: ~seg-ref
, ~feature-qty-test
, ~seq-pos-ref
, ~char-ref
Formal Definition
( {[TAN-class-2 (~source-refs
):]@src
} OR {[TAN-core (~source-refs
):] {empty}} |@work
),@ref
,@seg
?
Used by: ~textual-reference
TAN-A-div: TAN-A-div sources must be named
TAN-A-tok: TAN-A-tok sources must be named
TAN-core: Parameter to indicate whether <source>
should be allowed to take xml:id (forbidden for TAN files with only one source; mandated otherwise)
Formal Definition
@xml:id
@xml:id
@xml:id
?{empty}
Used by: ~source-item
TAN-A-div: TAN-A-div files must have one or more sources
TAN-A-tok: TAN-A-tok files take exactly two sources
TAN-c: claims made of textual objects should be done in the context of TAN-A-div files
TAN-LM-lang: TAN-LM files point only to one source
TAN-LM: TAN-LM files point only to one source
TAN-class-3: Class 3 files allow zero or many sources
Formal Definition
<source>
+<source>
,<source>
{empty}{empty}<for-lang>
<source>
<source>
*<source>
Used by: ~TAN-head
TAN-LM-core: Because TAN-LM files depend on only one source, no id references to sources are needed
Formal Definition
@src
{empty}{empty}
Used by: ~div-type-ref-cluster
, ~split
, ~reanchor-div-ref-item
, ~simple-textual-reference
, ~complex-textual-reference-set
, ~decl-supp-div-type
, ~decl-rename-div-n
, ~tok-source-ref-opt
, ~decl-tok-def
TAN-class-2: All sources are TAN files, so no source rights should be declared--they're already stated
Formal Definition
{empty}<rights-source-only>
Used by: ~source-item
TAN-A-div: Redefining TAN-body-core not only defines what is in the body of a TAN-A-div but also excludes groups from it.
Formal Definition
<equate-works>
*,<equate-div-types>
*,<split-leaf-div-at>
*,<realign>
*,~set-of-claims
?~TAN-R-mor-body
{[TAN-A-tok (~other-body-attributes
):]~other-body-attributes
} OR {[TAN-c (~other-body-attributes
):]~other-body-attributes
} OR {[TAN-key (~other-body-attributes
):]@affects-element
?} OR {[TAN-T (~other-body-attributes
):]@xml:lang
} OR {[TAN-core (~other-body-attributes
):] {empty}} OR {[TAN-LM-core (~other-body-attributes
):]~other-body-attributes
}, ( {[TAN-A-tok (~body-group-opt
):]<group>
} OR {[TAN-key (~body-group-opt
):]<group>
} OR {[TAN-class-2 (~body-group-opt
):]<group>
*} OR {[TAN-core (~body-group-opt
):] {empty}} OR {[TAN-LM-core (~body-group-opt
):]<group>
} | {[TAN-A-tok (~item
):]<align>
} OR {[TAN-c (~item
):] +} OR {[TAN-key (~item
):]<item>
} OR {[TAN-T (~item
):]<div>
} OR {[TAN-core (~item
):] {empty}} OR {[TAN-LM-core (~item
):]<ana>
})+
Used by: ~TAN-body
Formal Definition
(<work>
* &<place>
* &<unit>
* &<person>
* &<modal>
* &<version>
* &<scriptum>
* &<topic>
* &<verb>
*)
Used by: ~declaration-items
, ~TAN-c-decl
TAN-mor: TAN-R-mor body consists of zero or more assert, report, category, or options
Formal Definition
<for-lang>
+, ((<feature>
* |<category>
*) &<assert>
* &<report>
*)
Used by: ~TAN-body-core
Formal Definition
@id
,@TAN-version
,~ed-stamp
?,<head>
,<body>
,<tail>
?
Used by: <TAN-A-div>
, <TAN-A-tok>
, <TAN-c>
, <TAN-key>
, <TAN-mor>
, <TAN-T>
, <TAN-LM>
TAN-mor: Test pattern attributes for determining whether to return the message provided by text.
Formal Definition
~certainty-stamp
?, (~inclusion
| (@context
?, (@matches-m
|@matches-tok
|@feature-test
|@feature-qty-test
), text))
Formal Definition
(~simple-textual-reference
|~complex-textual-reference-set
){empty}
Used by: ~complex-subject
, ~object-element
, ~locus
, ~complex-rationale
Formal Definition
<tok>
Used by: ~split
, ~complex-text-ref
, ~alignment-content-non-class-2
, ~TAN-LM-item
Formal Definition
~tok-attr-core
, {[TAN-LM-lang (~pointer-to-div-item
):] {empty}} OR {[TAN-class-2 (~pointer-to-div-item
):]@ref
}, (@val
|@pos
| (@val
,@pos
))
Used by: ~tok-with-src-and-cont
, ~tok-with-cont-but-no-src
, ~tok-without-cont-or-src
Formal Definition
{empty} {[TAN-class-2 (~source-refs
):]@src
} OR {[TAN-core (~source-refs
):] {empty}} OR {[TAN-LM-core (~source-refs
):] {empty}}
Used by: ~tok-regular
TAN-core: Tag URN, mandatory pattern for the IRI name of every TAN file
Formal Definition
anyURI (pattern tag:([\-a-zA-Z0-9._%+]+@)?[\-a-zA-Z0-9.]+\.[A-Za-z]{2,4},\d{4}(-(0\d|1[0-2]))?(-([0-2]\d|3[01]))?:\S+)
Used by: ~TAN-root