< 

Standard Spelling XML Files

< 

Table of Contents

< 

Translation XML Files


The Benson Gloss XML File

The file benson-glosses.xml defines Professor Larry Benson's glosses for the Chaucer corpus. The standard NU file is:

<?xml version="1.0" encoding="utf-8"?>
<WordHoardBensonGlosses>
    <bensonPartsOfSpeech>
        <pos id="0" tag="?">?</pos>
        <pos id="1" tag="abbrev">abbreviation</pos>
        ...
        <pos id="169" tag="for">for in 'for to'</pos>
        <pos id="171" tag="adv#of">adverbial use of 'of'</pos>
    </bensonPartsOfSpeech>
    <bensonLemmas>
        <lemma id="1" homonym="0">
            <word>+</word>
            <wordClass>plussign</wordClass>
            <definition></definition>
            <comment></comment>
            <oedLemma></oedLemma>
        </lemma>
        <lemma id="2" homonym="0">
            <word>--</word>
            <wordClass>dash</wordClass>
            <definition></definition>
            <comment></comment>
            <oedLemma></oedLemma>
        </lemma>
        ...
        <lemma id="10702" homonym="0">
            <word>zodiak</word>
            <wordClass>n</wordClass>
            <definition>zodiac,</definition>
            <comment>s.v. zodiac sb. (a.) OED.</comment>
            <oedLemma>zodiac</oedLemma>
        </lemma>
        <lemma id="11001" homonym="1">
            <word>Dite</word>
            <wordClass>n</wordClass>
            <definition>Dictys Cretensis, historian of the Trojan War,</definition>
            <comment>proper n.; not in MED.</comment>
            <oedLemma></oedLemma>
        </lemma>
    </bensonLemmas>
    <bensonLemPos>
        <lemPos id="1" lemma="1" pos="97"/>
        <lemPos id="2" lemma="4" pos="88"/>
        ...
        <lemPos id="17607" lemma="10702" pos="68"/>
        <lemPos id="17608" lemma="11001" pos="74"/>
    </bensonLemPos>
</WordHoardBensonGlosses>

The Benson gloss definition file has the following elements:

  • WordHoardBensonGlosses. The root element.

    Children:

    • bensonPartsOfSpeech (1). Benson's parts of speech.
    • bensonLemmas (1). Benson's lemmas.
    • bensonLemPos (1). Benson's lemma and part of speech combinations.
  • bensonPartsOfSpeech. Benson's parts of speech.

    Children:

    • pos (0..n). Parts of speech.
  • pos. A part of speech.

    Attributes:

    • id. Required unique numeric id.
    • tag. Required tag.

    Children:

    • TEXT (0..1). Optional description.
  • bensonLemmas. Benson's lemmas.

    Children:

    • lemma (0..n). Lemmas.
  • lemma. A lemma.

    Attributes:

    • id. Required unique numeric id.
    • homonym. An optional homonym number.

    Children:

    • word (1). The lemma word.
    • wordClass (1). The word class.
    • definition (0..1). The definition.
    • comment (0..1). A comment.
    • oedLemma (0..1). The OED lemma.
  • word. A lemma word.

    Children:

    • TEXT (1). The lemma word.
  • wordClass. A word class.

    Children:

    • TEXT (1). The word class.
  • definition. A definition.

    Children:

    • TEXT (0..1). The definition.
  • comment. A comment.

    Children:

    • TEXT (0..1). The comment.
  • oedLemma. An OED lemma.

    Children:

    • TEXT (0..1). The OED lemma.
  • bensonLemPos. Benson's lemma and part of speech combinations.

    Children:

    • lemPos (0..n). Lemma and part of speech combinations.
  • lemPos. A lemma and part of speech combination.

    Attributes:

    • id. Required unique numeric id.
    • lemma. Required lemma id.
    • pos. Required part of speech id.

< 

Standard Spelling XML Files

< 

Table of Contents

< 

Translation XML Files