文件名称:072282
-
所属分类:
- 标签属性:
- 上传时间:2012-11-16
-
文件大小:98.39kb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
提出了一种自动构造特定领域本体的方法,该方法应用术语抽取和多重聚类技术。在术语抽取阶段,通过术语在专业语料与背景语料中出现概率的对比,采用LLR公式对术语进行评分,取得了更好的抽取效果。在层级关系发现过程中,采用上下文共现信息结合HowNet中词语的语义相似度,进行术语间相似度度量,力求获得术语间最合理的相关状况。同时改进了k-medoids聚类算法,更准确地发现术语的层级关系,进而构造出特定领域的本体。-This paper presents an approach to mining domain-dependent ontologies using term extraction and relationship discovery technology.There are two main innovations in the approach. One is extracting terms using log-likelihood ratio, which is based on the contrastive probabilityofterm occurrence in domain corpus and background corpus. The other is fusing together information from multiple knowledge sources as evidencesfor discovering particular semantic relationships among terms. In the experiment, traditional k-mediods algorithm is improved for multi-levelclustering. The approach to produce an ontology for the domain of computer science is applied and promising results are obtained.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
072282.pdf
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.