< 

Work XML Files

< 

Table of Contents

< 

The Benson Gloss XML File


Standard Spelling XML Files

IMPORTANT NOTE: We removed the standard spelling feature in version 1.1.6 of WordHoard. There were a number of problems with our notion of "standard spelling", enough so that we judged this feature to be less than useful. The documentation that follows describes how the feature used to work, but should be ignored, at least for now. In the current version, the file spellings/nu-spellings.XML described below is present, but it is empty.

Standard spelling XML files define standard spellings for lemma and part of speech combinations. They are optional. Standard spellings are displayed in the information window for words but are currently otherwise unused by WordHoard.

The directory spellings contains the standard spelling definition files. The file spellings/nu-spellings.XML contains the standard spelling definition file for the three NU English language corpora.

You can add additional standard spelling files to the spellings directory if you wish.

The NU standard spellings file is:

<?xml version="1.0" encoding="utf-8"?>
<WordHoardStandardSpellings charset="roman" posType="english">
    <standardSpelling lemma="a (chr)" pos="chr">a</standardSpelling>
    <standardSpelling lemma="a (at)" pos="at">a</standardSpelling>
    ...
    <standardSpelling lemma="abject (aj)" pos="aj">abject</standardSpelling>
    <standardSpelling lemma="abject (aj)" pos="aj-ly-av">abjectly</standardSpelling>
    <standardSpelling lemma="abject (aj)" pos="aj-pl-n">abjects</standardSpelling>
    ...
    <standardSpelling lemma="thou (pnp)" pos="pnp-2sg.ge">thou</standardSpelling>
    <standardSpelling lemma="thou (pnp)" pos="pnp-2sg.ge-dt">thou</standardSpelling>
</WordHoardStandardSpellings>

A standard spellings definition file has the following elements:

  • WordHoardStandardSpellings. The root element.

    Attributes:

    • charset = roman or greek. The character set used by this set of standard spellings. Required.
    • posType = english or greek. The part of speech taxonomy used by this set of standard spellings. Required.

    Children:

    • standardSpelling (0..n). Standard spelling definitions.
  • standardSpelling. A standard spelling definition.

    Attributes:

    • lemma = The lemma id. Required.
    • pos = The part of speech id. Required.

    Children:

    • TEXT (1). The standard spelling of this lemma and part of speech combination.

< 

Work XML Files

< 

Table of Contents

< 

The Benson Gloss XML File