REWERSE-RP-2006-086

Bernhard Krüpl, Marcus Herzog:
Visually Guided Bottom-Up Table Detection and Segmentation in Web Documents.


Complete Text [
.pdf, 180KB]
Poster File 1 [.svg, 921KB]
Poster File 2 [.svg, 496KB]
In: Proceedings of 15th International World Wide Web Conference (WWW2006), Edinburgh, Scotland (23rd - 26th May 2006), 933-934, May 2006
© ACM Press

Abstract
In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. Our algorithm works bottom-up by grouping word bounding boxes into larger groups and uses a set of heuristics. It has already been implemented and a preliminary evaluation on about 6000 Web documents has been carried out.

URL:
http://rewerse.net/publications/rewerse-publications.html#REWERSE-RP-2006-086

BibTeX:

@inproceedings{REWERSE-RP-2006-086,
	author = {Bernhard Kr\"upl and Marcus Herzog},
	title = {Visually Guided Bottom-Up Table Detection and Segmentation in Web Documents},
	booktitle = {Proceedings of 15th International World Wide Web Conference, Edinburgh, Scotland (23rd--26th May 2006)},
	year = {2006},
	pages = {933--934},
	url = {http://rewerse.net/publications/rewerse-publications.html#REWERSE-RP-2006-086}
}