Xspan logo University of Edinburgh logo Heriot-Watt University logo
Home
Overview
Technical
Formal
Ontology
Expert
Mapping
Gene
Expression
Visualisation
Terminology
and Language
Issues
XSPAN spacer gif
COBrA ../spacer gif
SAEL spacer gif
Cell Ontology spacer gif
Publications
People
Collaborators
Contact Us
 
 
 
 
 

Terminology and Language Issues

Terminology Problems


In different models of the same species, parts with similar names may only denote roughly the same tissues.

In GALEN: Lobe of left lung
   
Maps in FMA to: Upper Lobe of left lung
  Lower lobe of left lung


More of a problem is that in anatomical models of different species, parts with the same (or similar) names do not always denote homologous tissues:

In Mouse “tail
(posterior extension of the spine)
   
In C.Elegans “tail
(male sex organ)


The model species we are comparing have many anatomical parts. For
example, Mouse has 3559 anatomical parts, Drosophila has 506
anatomical parts, and C. Elegans, 242 anatomical parts.

Does the language used in terminologies and anatomical ontologies suggest what parts may be similar? Yes. But not directly - context is crucial. The entire series of names from root to leaf node is needed to ground terms that are themselves underspecified. Grounding refers to connecting natural language expressions such as mouse tail, with a model of the world. In this case, the model is a mouse ontology.

Context is the key to clarifying terms such as "tail". If you look at the paths for tail in Mouse and C.Elegans, its clear that they are different:

Path name in Mouse:
- embryo
- tail
- nervous system
- peripheral nervous system
- segmental spinal nerve
 
Path name in C.Elegans:
- organ_system
- sex-associated system
- male-associated system
- male tail
- hypodermis
- fan hypodermis

Language processing techniques are used to help compare anatomical part names across species. They include:

a. Normalizing terms to limit the effect of different descriptive styles.
b. Comparing content words by removing stop words.
c. Ensuring comparable forms of words by stemming and lemmatizing.
d. Results are then treated as an unordered set.

The results of the language processing experiments show that the lexical suggestions were indicative of structural support in 75% of the comparisons.

Made on a Mac logo
 
All logos appear with kind permission of their owners. Material on external sites is beyond the control and responsibility of XSPAN.
 
Last Updated: 09 December 2004