< 

Tracking Word Form Use Over Time

< 

Table of Contents

< 

Comparing Collocates


Finding Collocates and Multiword Units

The Latin maxim noscitur a soclis (one knows them by their associates) applies to words as well as people. Often we want to know not only how often a word form appears in a text, but also how frequently two or more specific word forms appear in proximity in a text. Words which appear near each other in a text more frequently than we would expect by chance are called collocates.

WordHoard allows you to analyze bigram (two word) collocates by selecting "Find Collocates" from the "Analysis" menu.

WordHoard also defines a multiword unit as a special type of collocate in which the component words comprise a meaningful phrase. For example, "King of England" is a meaningful multiword unit or phrase. WordHoard uses the notion of a pseudo-bigram to generalize the computation of bigram (two word) statistical measures to phrases longer than two words, and to allow comparisons of these measures for phrases with different word counts. WordHoard applies the LocalMaxs algorithm to the pseudo-bigram statistics to identify potentially interesting phrases that "stand out" in a text. Select "Find multiword units" from the "Analysis" menu to initiate a multiword unit analysis.

More details and examples for these two types of collocational analysis will be available shortly.


< 

Tracking Word Form Use Over Time

< 

Table of Contents

< 

Comparing Collocates