Automatic patent document summarization for collaborative knowledge systems and services

Amy J.C. Trappey, Charles V. Trappey, Chun Yi Wu

研究成果: Article同行評審

58 引文 斯高帕斯(Scopus)

摘要

Engineering and research teams often develop new products and technologies by referring to inventions described in patent databases. Efficient patent analysis builds R&D knowledge, reduces new product development time, increases market success, and reduces potential patent infringement. Thus, it is beneficial to automatically and systematically extract information from patent documents in order to improve knowledge sharing and collaboration among R&D team members. In this research, patents are summarized using a combined ontology based and TF-IDF concept clustering approach. The ontology captures the general knowledge and core meaning of patents in a given domain. Then, the proposed methodology extracts, clusters, and integrates the content of a patent to derive a summary and a cluster tree diagram of key terms. Patents from the International Patent Classification (IPC) codes B25C, B25D, B25F (categories for power hand tools) and B24B, C09G and H011 (categories for chemical mechanical polishing) are used as case studies to evaluate the compression ratio, retention ratio, and classification accuracy of the summarization results. The evaluation uses statistics to represent the summary generation and its compression ratio, the ontology based keyword extraction retention ratio, and the summary classification accuracy. The results show that the ontology based approach yields about the same compression ratio as previous non-ontology based research but yields on average an 11% improvement for the retention ratio and a 14% improvement for classification accuracy.

原文English
頁(從 - 到)71-94
頁數24
期刊Journal of Systems Science and Systems Engineering
18
發行號1
DOIs
出版狀態Published - 3月 2009

指紋

深入研究「Automatic patent document summarization for collaborative knowledge systems and services」主題。共同形成了獨特的指紋。

引用此