1 基于WEKA平台的文本聚类研究与实现文本聚类是文本挖掘领域的一个重要研究分支 - 下载

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

首页

资源下载

源码下载

Windows编程

文件名称:1

所属分类：

Windows Develop
标签属性：

[PDF]
上传时间：

2012-11-16
文件大小：

999.14kb
已下载：

0次
提供者：

yue***
相关连接：

无
下载说明：

别用迅雷下载，失败请重下，重下不扣分！

电信下载联通下载

报告错误！

修正介绍说明

介绍说明－－下载内容来自于网络，使用问题请自行百度

基于WEKA平台的文本聚类研究与实现

文本聚类是文本挖掘领域的一个重要研究分支，是聚类方法在文本处理领域的应用。本文对基于空间向量模型的文本聚类过程做了较深入的讨论和总结，利用文本语料库，基于数据挖掘工具研究并实现了文本聚类的过程。本文首先给出了文本聚类的思想和过程，回顾了文本聚类领域的已有成果，列举了文本聚类领域在特征表示、特征提取等方面的基础研究工作。另外，本文回顾了现有的文本聚类算法，以及常用的文本聚类效果评价指标。在研究了已有成果的基础上，本文利用20 Newsgroup文本语料库，针对向量空间表示模型，在开源的数据挖掘平台WEKA上实现了文本预处理和k-means聚类算法，并根据实际聚类效果，就文本表示、特征选择、特征降维、等方面提出优化方案。-Text clustering is an important field of text mining research branch, is the clustering in the field of text processing applications. In this paper, based on vector space model for text clustering process to do a more in-depth discussion and summary, the use of the text corpus, based on data mining tools to study and realize the document clustering process. This paper shows the ideas and text clustering process, reviewed the existing text clustering results of the field, citing the field of document clustering in the feature representation, feature extraction and other aspects of basic research. In addition, the paper reviews the existing text clustering algorithm, as well as common text clustering validity. In the study has been based on the results, we use 20 Newsgroup corpus, for the vector space representation model, in the WEKA open source data mining platform to achieve a text preprocessing and k-means clustering algorithm, and according to the actual clustering effect to the tex

下载文件列表

基于WEKA平台的文本聚类研究与实现.pdf

*快速评论：	推荐一般有密码和说明不符不是源码或资料文件不全不能解压纯粹是垃圾
*内　　容：
*验证码：

文件名称:1

介绍说明－－下载内容来自于网络，使用问题请自行百度

下载文件列表

相关说明

相关评论

发表评论

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

界面编程

系统编程

网络编程

驱动编程

数据库编程

GDI/图象编程

C#编程

.net编程

多媒体编程

通讯编程

Shell编程

ActiveX/DCOM

输入法编程

ISAPI/IE编程

钩子与API截获

屏幕保护

DirextX编程

进程与线程

控制台(字符窗口)编程

文件操作

打印编程

多显示器编程

DNA

其他小程序

在结果中搜索

浏览历史记录