摘要
Scientific analysis of the linguistic styles of different authors has generated considerable interest. We present a generic approach to measuring the similarity of two symbolic sequences that requires minimal background knowledge about a given human language. Our analysis is based on word rank order-frequency statistics and phylogenetic tree construction. We demonstrate the applicability of this method to historic authorship questions related to the classic Chinese novel "The Dream of the Red Chamber," to the plays of William Shakespeare, and to the Federalist papers. This method may also provide a simple approach to other large databases based on their information content.
原文 | English |
---|---|
頁(從 - 到) | 473-483 |
頁數 | 11 |
期刊 | Physica A: Statistical Mechanics and its Applications |
卷 | 329 |
發行號 | 3-4 |
DOIs | |
出版狀態 | Published - 15 11月 2003 |