REWERSE-RP-2006-038

Felix Weigel, Klaus U. Schulz, Levin Brunner, Eduardo Torres-Schumann:
Integrated Document Browsing and Data Acquisition for Building Large Ontologies.


Complete Text [
.pdf, 445KB]
In: Proceedings of 10th International Conference on Knowledge-Based & Intelligent Information & Engineering Systems (KES2006), Bournemouth, UK (9th - 11th October 2006), Organization: KES International, LNCS 4253, 614-622, October 2006
© Springer

Abstract
Named entities (e.g., "Kofi Annan", "Coca-Cola", "Second World War") are ubiquitous in web pages and other document types and often render a simplified picture of the document's content. We present a database containing currently 20,000 named entities in different languages from various domains such as history, geography, politics, sports, arts etc., which is being developed at the University of Munich. The underlying graph data model is simple and yet extremely versatile in different application scenarios. We demonstate a prototype of a graphical interface to both the database and to documents on the web or in a local document repository, with a tight interaction in both directions. Occurrences of entities from the database are highlighted and hyperlinked in the documents. Unrecognized entities are easily added to the database and related to other concepts in a semiautomatic process. The entity database can also be used for searching the web or the repository for matches semantically close to a full-text query and for indexing different kinds of named entities in the document repository. Similar to a programming IDE, the system illustrates how integrated browsing, search and update functionality contributes to the construction of high-quality ontologies, fundamental to the vision of a truly "semantic" web.

URL:
http://rewerse.net/publications/rewerse-publications.html#REWERSE-RP-2006-038

BibTeX:

@inproceedings{REWERSE-RP-2006-038,
	author = {Felix Weigel and Klaus U. Schulz and Levin Brunner and Eduardo Torres-Schumann},
	title = {Integrated Document Browsing and Data Acquisition for Building Large Ontologies},
	booktitle = {Proceedings of 10th International Conference on Knowledge-Based & Intelligent Information & Engineering Systems, Bournemouth, UK (9th--11th October 2006)},
	year = {2006},
	volume = {4253},
	organization = {KES International},
	series = {LNCS},
	pages = {614--622},
	url = {http://rewerse.net/publications/rewerse-publications.html#REWERSE-RP-2006-038}
}