[top]

Finding Context Paths for Web Pages

by Yoshiaki Mizuuchi, Keishi Tajima

Abstract

The contents of Web pages are often not self-contained. A page author often assumes all the readers of the page come through the same path, and he sometimes omit the information described in the pages on that path because the readers must already know it. Therefore, indexes used by search engines based on the contents of each page are also incomplete. In this paper, we propose a method of discovering those paths assumed by page authors, and of complementing the incomplete indexes with keywords extracted from the pages on those paths.

Full Text: free download from ACM

Slides: pdf

BibTex entry

Keywords

query, structure discovery, information discovery, Web, WWW, hypertext, context, path, heirarchy
Published in Proc. of ACM Hypertext, pp.13-22, Feb. 1999, Darmstadt, Germany.

Copyright © 1999 by the Association for Computing Machinery, Inc. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.
tajima@i.kyoto-u.ac.jp / Fax: +81(Japan) 75-753-5978 / Office: Research Bldg. #7, room 404