文件名称:Text-Classification_libSVM
介绍说明--下载内容来自于网络,使用问题请自行百度
用seg进行分词
输入参数一:输入文本语料所在的文件夹路径。 如 文本文件语料都放在 train//text 文件夹下,则参数为:train//text//* 。 注意:必须每篇文章在一个txt文本中。
输入参数二:输入存储分词后的结果文件所在的文件夹路径:如:result//text。注意:不需要加*
本工具采用了中科院的中文分词工具,ICTCLAS,请自行到ICTCLAS官网下载该工具。并把Data文件夹,Configure.xml,ICTCLAS30.h,ICTCLAS30.lib,ICTCLAS30.dll放在和seg.exe同文件夹下面。
2.getFea-Seg segmentation
Input parameters: input text corpus where the folder path. Such as text documents corpus on the train// text folder under the parameters: train// text//*. Note: You must each article in a txt text.
Input parameters: input memory segmentation results file folder path: such as: result// text. Note: You do not need to add*
This tool uses the Chinese Academy of Sciences of the Chinese word segmentation tools ICTCLAS your own to ICTCLAS official website to download the tool. And the Data folder Configure.xml, ICTCLAS30.h ICTCLAS30.lib, ICTCLAS30.dll placed and seg.exe same folder below.
2.getFea
输入参数一:输入文本语料所在的文件夹路径。 如 文本文件语料都放在 train//text 文件夹下,则参数为:train//text//* 。 注意:必须每篇文章在一个txt文本中。
输入参数二:输入存储分词后的结果文件所在的文件夹路径:如:result//text。注意:不需要加*
本工具采用了中科院的中文分词工具,ICTCLAS,请自行到ICTCLAS官网下载该工具。并把Data文件夹,Configure.xml,ICTCLAS30.h,ICTCLAS30.lib,ICTCLAS30.dll放在和seg.exe同文件夹下面。
2.getFea-Seg segmentation
Input parameters: input text corpus where the folder path. Such as text documents corpus on the train// text folder under the parameters: train// text//*. Note: You must each article in a txt text.
Input parameters: input memory segmentation results file folder path: such as: result// text. Note: You do not need to add*
This tool uses the Chinese Academy of Sciences of the Chinese word segmentation tools ICTCLAS your own to ICTCLAS official website to download the tool. And the Data folder Configure.xml, ICTCLAS30.h ICTCLAS30.lib, ICTCLAS30.dll placed and seg.exe same folder below.
2.getFea
(系统自动生成,下载前可以参看下载内容)
下载文件列表
Configure.xml
Data/
Data/BiWord.big
Data/charset.type
Data/CoreDict.pdat
Data/CoreDict.pos
Data/CoreDict.unig
Data/FieldDict.pdat
Data/FieldDict.pos
Data/GranDict.pdat
Data/GranDict.pos
Data/ICTCLAS30.ctx
Data/ICTCLAS_First.map
Data/ICTPOS.map
Data/nr.ctx
Data/nr.fsa
Data/nr.role
Data/PKU.map
Data/PKU_First.map
dict.txt
feature/
featureselection.exe
feature/3.txt
feature/4.txt
feature/5.txt
feature/6.txt
getFeature.exe
getRandFile.exe
getSVMfeture(df).exe
getSVMTtrain.exe
ICTCLAS30.dll
ICTCLAS30.log
mergeFile.bat
readme.txt
readme文本分类的主要流程.txt
seg/
seg.exe
seg/3.txt
seg/4.txt
seg/5.txt
seg/6.txt
seg/7.txt
seg/8.txt
seg/9.txt
seg/test1.txt
seg/test2.txt
svmfeature/
svmfeature/3.txt
svmfeature/4.txt
svmfeature/5.txt
svmfeature/6.txt
svmfeature/7.txt
svmfeature/8.txt
svmtrain/
svmtrain/svm.scale
svmtrain/train.scale
train/
train/3.txt
train/4.txt
train/5.txt
train/6.txt
train/7.txt
train/8.txt
train/9.txt
train/test1.txt
train/test2.txt
Data/
Data/BiWord.big
Data/charset.type
Data/CoreDict.pdat
Data/CoreDict.pos
Data/CoreDict.unig
Data/FieldDict.pdat
Data/FieldDict.pos
Data/GranDict.pdat
Data/GranDict.pos
Data/ICTCLAS30.ctx
Data/ICTCLAS_First.map
Data/ICTPOS.map
Data/nr.ctx
Data/nr.fsa
Data/nr.role
Data/PKU.map
Data/PKU_First.map
dict.txt
feature/
featureselection.exe
feature/3.txt
feature/4.txt
feature/5.txt
feature/6.txt
getFeature.exe
getRandFile.exe
getSVMfeture(df).exe
getSVMTtrain.exe
ICTCLAS30.dll
ICTCLAS30.log
mergeFile.bat
readme.txt
readme文本分类的主要流程.txt
seg/
seg.exe
seg/3.txt
seg/4.txt
seg/5.txt
seg/6.txt
seg/7.txt
seg/8.txt
seg/9.txt
seg/test1.txt
seg/test2.txt
svmfeature/
svmfeature/3.txt
svmfeature/4.txt
svmfeature/5.txt
svmfeature/6.txt
svmfeature/7.txt
svmfeature/8.txt
svmtrain/
svmtrain/svm.scale
svmtrain/train.scale
train/
train/3.txt
train/4.txt
train/5.txt
train/6.txt
train/7.txt
train/8.txt
train/9.txt
train/test1.txt
train/test2.txt
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.