摘要
Linear algebra-based techniques have long been used to correlate similar documents. They map the documents to a multi-dimensional vector space, in which each document is represented by a vector. Searching related documents then translates into searching nearest neighbors in the vector space. In this paper, we propose an indexing structure, called cosine R-tree, which indexes multidimensional vector space and provides efficient nearest neighbor search. Our preliminary results show that it gives better performance than a brute-force linear scan strategy.
| 原文 | English |
|---|---|
| 文章編號 | 884720 |
| 頁(從 - 到) | 210-211 |
| 頁數 | 2 |
| 期刊 | Proceedings - IEEE Computer Society's International Computer Software and Applications Conference |
| DOIs | |
| 出版狀態 | Published - 25 10月 2000 |
指紋
深入研究「Tree indexing for efficient search of similar documents」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver