TEI Lex-0

— A baseline encoding for lexicographic data

12.1.89. <pc>

<pc> (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [18.1.2. Below the Word Level 18.4.2. Lightweight Linguistic Annotation]
Moduleanalysis — Specification
Attributes
forceindicates the extent to which this punctuation mark conventionally separates words or phrases.
StatusOptional
Datatypeteidata.enumerated
Legal values are:
strong
the punctuation mark is a word separator
weak
the punctuation mark is not a word separator
inter
the punctuation mark may or may not be a word separator
unitprovides a name for the kind of unit delimited by this punctuation mark.
StatusOptional
Datatypeteidata.enumerated
preindicates whether this punctuation mark precedes or follows the unit it delimits.
StatusOptional
Datatypeteidata.truthValue
Member of
Contained by
May contain
analysis: c
core: abbr expan
gaiji: g
character data
Example
<phr>
 <w>do</w>
 <w>you</w>
 <w>understand</w>
 <pc type="interrogative">?</pc>
</phr>
ExampleExample encoding of the German sentence Wir fahren in den Urlaub., encoded with attributes from att.linguistic discussed in section [[undefined AILALW]].
<s>
 <w pos="PPERmsd="1.Pl.*.Nom">Wir</w>
 <w pos="VVFINmsd="1.Pl.Pres.Ind">fahren</w>
 <w pos="APPRmsd="--">in</w>
 <w pos="ARTmsd="Def.Masc.Akk.Sg.">den</w>
 <w pos="NNmsd="Masc.Akk.Sg.">Urlaub</w>
 <pc pos="$.msd="--join="left">.</pc>
</s>
Content model
<content>
 <alternate minOccurs="0"
  maxOccurs="unbounded">
  <textNode/>
  <classRef key="model.gLike"/>
  <elementRef key="c"/>
  <classRef key="model.pPart.edit"/>
 </alternate>
</content>
    
Schema Declaration

element pc
{
   att.global.attributes,
   att.cmc.attributes,
   att.linguistic.attributes,
   att.segLike.attributes,
   att.typed.attributes,
   attribute force { "strong" | "weak" | "inter" }?,
   attribute unit { text }?,
   attribute pre { text }?,
   ( text | model.gLike | c | model.pPart.edit )*
}