Linguistics 482 - Computational Linguistics Fall 2001 Laura Proctor lproctor@uvic.ca
Department of Linguistics
University of Victoria
(250) 721-8282
Last updated October 22, 2001

Lexical Semantics, Wordnet & Dictionaries (continued)

Practical applications of lexical semantics include:

Reference

  • Sage, Juan C. (1990) A Practical Course in Terminology Processing. Amsterdam:John Benjamins. Terminologists formalize definitions of terms as they are used in special/technical domains (in contrast to lexicographers who are concerned with words as they are used in common/general language). Understanding terms is essential to understanding specialized texts.

    Knowledge (concepts) in specialized domains are expressed through terminology, therefore it is essential to understand the relationship between concepts as well as to understand the relationship between concept and term. One common framework for representing concepts (and their associated terms) is to consider that concepts are related on a number of different dimensions or features. The variation of feature values is potentially infinite though most are practically discrete and restricted by interdependence with other features.

    Description of concepts in specialized fields encompasses the following general types of concept:

    Mapping concept to words can lead to dividing concepts into:

    This is not as useful in languages which do not make formal distinctions between parts of speech. For example: a concept like "white" can be expressed in English as: Noun: whiteness, Adjective: white, Verb: whiten.

    Relationships between concepts include:

    Methods of definition - in lexicography and terminology: (Sager pp42-43)

    Many definitions make use of more than one of these methods.

    In order for a word to be considered a specialized term, there must be general agreement (in the specialized field) that the term as defined represents the concept.

    Reference

  • Sinclair, J.M. Ed. (1987) Looking Up: An accoun of the COBUILD Project in lexical computing. London: Collins ELT.

    Lexicography is concerned with definition of words in general (use of) language. Lexical relationships based on concept type but must also be affected by syntactic restrictions. That is, the words "white", "whiteness" and "whiten" must be related to the same underlying concept of white, but the sentence frames (syntactic restriction) in which they may occur are different.

    WordNet

    (excerpts from the 5 papers) "Definitions of common nouns typically give a superordinate term plus distinguishing features; that information provides the basis for organizing noun files in WordNet. The superordinate relation (hyponymy) generates a hierarchical semantic organization that is duplicated in the noun files by the use of labeled pointers between sets of synonyms (synsets). The hierarchy is limited in depth, seldom exceeding more than a dozen levels. Distinguishing features are entered in such a way as to create a lexical inheritance system, a system in which each word inherits the distinguishing features of all its superordinates. Three types of distinguishing features are discussed: attributes (modification), parts (meronymy), and functions (predication), but only meronymy is presently implemented in the noun files. Antonymy is also found between nouns, but it is not a fundamental organizing principle for nouns."

    The semantic relation that is represented above by @> has been called the ISA relation, or the hypernymic or superordinate relation (since it points to a hypernym or superordinate term); it goes from specific to generic and so is a generalization. Whenever it is the case that a noun Wh @-> a noun Ws , there is always an inverse relation, Ws ~-> Wh . That is to say, if Ws is the superordinate of Wh , then Wh is the subordinate or hyponym of Ws . The inverse semantic relation ~- goes from generic to specific (from superordinate to hyponym) and so is a specialization.

    "In other words, database experts and lexicographers both resort to hierarchical structures for the same reason: to save space. Computer scientists call such hierarchies inheritance systems, because they think of specific items inheriting information from their generic superordinates. That is to say, all of the properties of the superordinate are assumed to be properties of the subordinate as well; instead of listing those properties redundantly with both items, they are listed only with the superordinate and a pointer from the subordinate to the superordinate is understood to mean 'for additional properties, look here.' {tree} is not the only hyponym of {plant, flora}, general terms can replace specific term (anaphora)

    Hypernym is the generic term used to designate a whole class of specific instances. Y is a hypernym of X if X is a (kind of) Y. Hyponym is the generic term used to designate a member of a class. X is a hyponym of Y if X is a (kind of) Y. Coordinate words are words that have the same hypernym.

    Exercise:

    1. Pick a specific term, find its hypernym (more general term), i.e. Y where your term is a kind of Y.
    2. Find co-ordinate terms - ie. those other things which are also Y's.

    In defense of inheritance systems as a psychological reality: "They observed, for example, that it takes less time to respond True to "A canary can sing" than to "A canary can fly," and still more time is required to respond True to "A canary has skin." In this example, it is assumed that can sing is stored as a feature of canary, can fly as a feature of bird, and has skin as a feature of animal. If all three features had been stored directly as features of canary, they could all have been retrieved with equal speed."

    List of 25 unique beginners for WordNet nouns:

    {act, action, activity}
    {natural object}
    {animal, fauna}
    {natural phenomenon}
    {artifact}
    {person, human being}
    {attribute, property}
    {plant, flora}
    {body, corpus}
    {possession}
    {cognition, knowledge}
    {process}
    {communication}
    {quantity, amount}
    {event, happening}
    {relation}
    {feeling, emotion}
    {shape}
    {food}
    {state, condition}
    {group, collection}
    {substance}
    {location, place}
    {time}
    {motive}
    

    "Three different kinds of distinguishing features (Miller, in press):
    (1) Attributes: small, yellow
    (2) Parts: beak, wings
    (3) Functions: sing, fly

    Each type of distinguishing feature must be treated differently.

    Meronymy is obviously a complex semantic relation - or set of relations. Only three of these types of meronymy are coded in WordNet:
    - Wm is a component part of Wh ;
    - Wm is a member of Wh ; and
    - Wm is the stuff that Wh is made from.

    WordNet contains descriptive adjectives (such as big, interesting, possible) and relational adjectives (such as presidential and nuclear). A relatively small number of adjectives including former and alleged constitute the closed class of reference-modifying adjectives. Each of these classes is distinguished by the particular semantic -and syntactic properties of its adjectives.

    Antonymy: The basic semantic relation among descriptive adjectives is antonymy. The importance of antonymy first became obvious from results obtained with word association tests: When the probe is a familiar adjective, the response commonly given by adult speakers is its antonym.

    Semantics: Relational adjectives differ from descriptive adjectives in that they do not relate to an attribute: there is no scale of criminality or musicality on which the adjectives in criminal law and musical training express a value. The adjective and the related noun refer to the same concept, but they differ formally (morphologically).

    Note that attributes are given by adjectives, parts by nouns, and functions by verbs.

    Word Net is available at: http://www.cogsci.princeton.edu/~wn/

    A list of dictionaries online can be found on the Library Gateway.

    Further reading

  • Saint-Dizier, Patrick & Evelyne Viegas, eds. (1995) Computational Lexical Semantics. Cambridge: Cambridge University Press.
  • Pustejovsky, James. (1995) The Generative Lexicon Cambridge, Mass.: MIT Press.

    Required Readings

  • http://www.cogsci.princeton.edu/~wn/obtain/5papers.pdf - pages 1 - 9 (the first paper).

  • Linguistics 482 Home Page Class Note Index Top of Page