搜索资源列表
WebExtract20070417
- 从htm/html格式的网页文件中提取内容。将要提取内容的网页文件用鼠标拖入窗口,按回车即可完成转换。转换后的文件是与原文件同名的文本文件。 支持文件夹批量转换!-from htm / html format of the document from the website content. Will be from the website content with the mouse into the document window, press the Enter conversion
extract_document
- 这是一个提取 Reuter-21578 的程序, 用做自然语言处理, 文本分类聚类,和信息检索的测试集!-This is an extract of the Reuter-21578 procedure, used for natural language processing, text classification clustering, and information retrieval test collection!
IntServer
- 复杂网络聚类算法进行文本分析,能够进行关键字的提取和分类功能。-Complex network clustering algorithm for text analysis, to carry out keyword extraction and classification capabilities.
WordNet
- 用于数据挖掘方面的。潜在语义索引最初是一种知识的自动提取和表示的方法,近年来广泛地应用到文本检索中-For data mining. Latent Semantic Indexing is a knowledge of the first automatic extraction and representation methods in recent years, widely used in text retrieval
Text-classification
- 文本分类之词频统计 分词、词干提取、去停用词、计算词频,有界面-Text classification of word frequency statistics word stemmer, to stop words, calculate word frequency, interface
TopicModel.tar
- 文本主题模型程序,由于互联网数据文本主题提取功能,深入挖掘用户行为-Text topic model program, due to Internet data text subject extraction, dig user behavior
finallyliuyuClassifier
- 用于文本分类,文本挖掘,文本特征提取,文本聚类,文本关联等(It is used for text classification, text mining, text feature extraction, text clustering, text association, etc.)
CNN_sentence_tensorflow-master
- 基于卷机神经网络的文本信息提取应用的设计与实现,cn(Design and Implementation of Text Information Extraction Application Based on Reel Neural Network)