搜索资源列表
JAVA实现文本聚类,用到TF/IDF权重
- JAVA实现文本聚类,用到TF/IDF权重,用余弦夹角计算文本相似度,用k-means进行数据聚类等数学和统计 知识。,JAVA realization of text clustering, using TF/IDF weight, calculated using cosine angle between the text of similarity, using k-means clustering for data such as mathematical and statistical
similarity.文本相似度计算余弦相似度代码
- 文本相似度计算余弦相似度代码,计算文本相似度,用于搜索引擎,Cosine similarity of text similarity computation code, the text of the similarity calculation for the search engine
java-version--VSM
- VSM模型的实现,通过java、语言编写的,能够打开多个文件,计算出VSM的值即查询相似度能够得出来。-VSM model realization, through java, language, and can open multiple files, calculate the value of the VSM similarity queries can be out.
DM4
- 执行流程: 1. 用户输入参数:K的选择,训练数据,测试数据的路径; 2. 读取训练数据集和测试数据集文件,用ArffFileReader类读取并组织起InstanceSet数据结构; 3. 利用上面的相似度量标准,对每一个测试集中的Instance,计算与其最相似的K个训练集中的Instance,通过投票进行分类,将分类结果存储经Instance的成员变量targetGuess中; 4. 对分类结果进行度量,包括分类正确率,各种类别实例的Precision,Recall;Con
dtw
- 用java写的DTW程序组合,可用于进行动态弯曲路径的计算和时间序列相似性比较-Using java to write the DTW procedure combinations that can be used for dynamic bending path calculation and comparison of time series similarity
java
- 采用标准的 Levenshtein Distance 算法计算两个文件的相似度。 2。 程序使用简单。选入多个文件,然后按分析便个分析出两个文-Standard Levenshtein Distance algorithm for calculating the similarity of two documents. 2. Using a simple procedure. Selected multiple files, and then analyzed by analysis of tw
similarity
- 通过将待进行计算的向量放入二维数组中,然后编程实现皮尔森和余弦相似度公式,计算向量之间的相似度。-caculate the similarity of vector
RightRate
- 本项目采用Java编写,功能是比较2个文件的相同率的程序-This project is coded by java which function is comparing two file amount of similarity
JavaWordNet
- 基于wordnet的词汇比较程序,词汇比较程序。 能够比较两个词汇的相似度程序,基于wordnet-Java WordNet Similarity.beta.11.01.zip
wordnet
- 使用woednet计算语义相似,输入两个字符,给出相似度-Semantic calculated using woednet, enter two characters, given the similarity
cluster
- 我做的用Kmeans方法,分别采用欧式距离。夹角余弦,和度量函数的方法来表示两点的相似度-I do use Kmeans methods were used Euclidean distance. Angle cosine, and methods of measurement functions to represent the similarity of two
New_Dominant_Color
- Calculates dominante color and measure the similarity between the input image and directory contains other images
1
- 基于面向对象编程的文档向量相似度匹配算法-Object-oriented programming document vector similarity matching algorithm
Mini-HowNet
- mini-hownet,用于查询计算词语的相似度-mini-hownet, terms of similarity calculated for the query
IDFCal
- tf-idf程序,朋友写的,很好。对中文句子进行相似度计算,有计算句子权值、排序、两两句子之间的相似度计算。有语料,可以直接运行-tf-idf program, friends wrote, very good. Similarity calculation for Chinese sentences, the sentence weights are calculated, sort, twenty-two similarity between sub-calculation. A corpu
(java)wenbenjulei
- 文本距离,文本相似度计算的java源代码,内含测试文档-Text from the text similarity calculation java source code, containing the test document
information-selection-VSM--Model
- VSM 实现计算文件的相似度,查询相似度计算出值来-VSM implementation document similarity calculation, the query similarity values calculated
java---topsis
- TOPSIS (Technique of Order Preference by Similarity to Ideal Solution) method is a method for multi-criteria decision making.
similarity
- 网络爬虫相关,计算文档相似性,JAVA编写-Web crawler related document similarity calculation, JAVA write
java-string-similarity-master
- similarity cosine ontology