文件名称:zhengdike
-
所属分类:
- 标签属性:
- 上传时间:2012-11-16
-
文件大小:1.42mb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
(个人原创)《中文网页自动分类》
牵扯的技术有:分词,统计词频,踢出网页中一些特殊字符(用正则表达式),还有需要提取培训集等等!!
此软件禁止商业活动,版权所属“qyTT论坛--www.qyclass.org/bbs”
本文来自: qyTT论坛 http://www.qyclass.org/bbs 我们的使命:让世界认识qyTT,让qyTT认识世界!
结果分析的思想:就是把得到的词频与建立的词库里每一类进行比较,如果存在一个最大匹配程度,就去这个类作为结果,如果存在多个最大值,那么就去词库里特征词最少的一个!!
-(Individual original) " Chinese Web Page Automatic Classification" involves technology are: Word, word frequency statistics, kicked out the page number of special characters (using regular expressions), and need to extract the training set and more! ! Results of the idea: is to get the word frequency and the establishment of the vocabulary in each category of comparison, if there is a maximum matching degree, and went to the class as a result, if there is more than the maximum, then go inside thesaurus features at least one word! !
牵扯的技术有:分词,统计词频,踢出网页中一些特殊字符(用正则表达式),还有需要提取培训集等等!!
此软件禁止商业活动,版权所属“qyTT论坛--www.qyclass.org/bbs”
本文来自: qyTT论坛 http://www.qyclass.org/bbs 我们的使命:让世界认识qyTT,让qyTT认识世界!
结果分析的思想:就是把得到的词频与建立的词库里每一类进行比较,如果存在一个最大匹配程度,就去这个类作为结果,如果存在多个最大值,那么就去词库里特征词最少的一个!!
-(Individual original) " Chinese Web Page Automatic Classification" involves technology are: Word, word frequency statistics, kicked out the page number of special characters (using regular expressions), and need to extract the training set and more! ! Results of the idea: is to get the word frequency and the establishment of the vocabulary in each category of comparison, if there is a maximum matching degree, and went to the class as a result, if there is more than the maximum, then go inside thesaurus features at least one word! !
(系统自动生成,下载前可以参看下载内容)
下载文件列表
zhengdike/build.xml
zhengdike/ceshi/1/www.txt
zhengdike/ceshi/2/www.txt
zhengdike/ceshi/3/www.txt
zhengdike/ceshi/4/WWW.txt
zhengdike/ceshi/5/www.txt
zhengdike/ceshi/6/www.txt
zhengdike/ceshi/7/WWW.txt
zhengdike/ceshi/8/www.txt
zhengdike/jar/filterbuilder.jar
zhengdike/jar/htmllexer.jar
zhengdike/jar/htmlparser.jar
zhengdike/jar/smallseg4j_0.6.jar
zhengdike/jar/thumbelina.jar
zhengdike/manifest.mf
zhengdike/nbproject/build-impl.xml
zhengdike/nbproject/genfiles.properties
zhengdike/nbproject/private/private.properties
zhengdike/nbproject/private/private.xml
zhengdike/nbproject/project.properties
zhengdike/nbproject/project.xml
zhengdike/src/zhengdike/classbao/getcharset.java
zhengdike/src/zhengdike/classbao/getfilelist.java
zhengdike/src/zhengdike/classbao/getunicode.java
zhengdike/src/zhengdike/classbao/htmlparser.java
zhengdike/src/zhengdike/classbao/segtext.java
zhengdike/src/zhengdike/classbao/state.java
zhengdike/src/zhengdike/classbao/stateline.java
zhengdike/src/zhengdike/hztounicode.form
zhengdike/src/zhengdike/hztounicode.java
zhengdike/src/zhengdike/mainframe.form
zhengdike/src/zhengdike/mainframe.java
zhengdike/xlciku/1.txt
zhengdike/xlciku/2.txt
zhengdike/xlciku/3.txt
zhengdike/xlciku/4.txt
zhengdike/xlciku/5.txt
zhengdike/xlciku/6.txt
zhengdike/xlciku/7.txt
zhengdike/xlciku/8.txt
zhengdike/src/zhengdike/classbao
zhengdike/ceshi/1
zhengdike/ceshi/2
zhengdike/ceshi/3
zhengdike/ceshi/4
zhengdike/ceshi/5
zhengdike/ceshi/6
zhengdike/ceshi/7
zhengdike/ceshi/8
zhengdike/nbproject/private
zhengdike/src/zhengdike
zhengdike/bin
zhengdike/build
zhengdike/ceshi
zhengdike/jar
zhengdike/nbproject
zhengdike/src
zhengdike/test
zhengdike/xlciku
zhengdike
zhengdike/ceshi/1/www.txt
zhengdike/ceshi/2/www.txt
zhengdike/ceshi/3/www.txt
zhengdike/ceshi/4/WWW.txt
zhengdike/ceshi/5/www.txt
zhengdike/ceshi/6/www.txt
zhengdike/ceshi/7/WWW.txt
zhengdike/ceshi/8/www.txt
zhengdike/jar/filterbuilder.jar
zhengdike/jar/htmllexer.jar
zhengdike/jar/htmlparser.jar
zhengdike/jar/smallseg4j_0.6.jar
zhengdike/jar/thumbelina.jar
zhengdike/manifest.mf
zhengdike/nbproject/build-impl.xml
zhengdike/nbproject/genfiles.properties
zhengdike/nbproject/private/private.properties
zhengdike/nbproject/private/private.xml
zhengdike/nbproject/project.properties
zhengdike/nbproject/project.xml
zhengdike/src/zhengdike/classbao/getcharset.java
zhengdike/src/zhengdike/classbao/getfilelist.java
zhengdike/src/zhengdike/classbao/getunicode.java
zhengdike/src/zhengdike/classbao/htmlparser.java
zhengdike/src/zhengdike/classbao/segtext.java
zhengdike/src/zhengdike/classbao/state.java
zhengdike/src/zhengdike/classbao/stateline.java
zhengdike/src/zhengdike/hztounicode.form
zhengdike/src/zhengdike/hztounicode.java
zhengdike/src/zhengdike/mainframe.form
zhengdike/src/zhengdike/mainframe.java
zhengdike/xlciku/1.txt
zhengdike/xlciku/2.txt
zhengdike/xlciku/3.txt
zhengdike/xlciku/4.txt
zhengdike/xlciku/5.txt
zhengdike/xlciku/6.txt
zhengdike/xlciku/7.txt
zhengdike/xlciku/8.txt
zhengdike/src/zhengdike/classbao
zhengdike/ceshi/1
zhengdike/ceshi/2
zhengdike/ceshi/3
zhengdike/ceshi/4
zhengdike/ceshi/5
zhengdike/ceshi/6
zhengdike/ceshi/7
zhengdike/ceshi/8
zhengdike/nbproject/private
zhengdike/src/zhengdike
zhengdike/bin
zhengdike/build
zhengdike/ceshi
zhengdike/jar
zhengdike/nbproject
zhengdike/src
zhengdike/test
zhengdike/xlciku
zhengdike
本网站为编程资源及源代码搜集、介绍的搜索网站,版权归原作者所有! 粤ICP备11031372号
1999-2046 搜珍网 All Rights Reserved.