资源列表
distributed_word_embedding-master
- The Distributed Word Embedding tool is a parallelization of the Word2Vec algorithm on top of our DMTK parameter server. It provides an efficient scaling to industry size solution for word embedding. -The Distributed Word Embedding tool is a paralle
distributed_skipgram_mixture-master
- The Distributed Multisense Word Embedding(DMWE) tool is a parallelization of the Skip-Gram Mixture [1] algorithm on top of the DMTK parameter server. It provides an efficient scaling to industry size solution for multi sense word embedding. -The Di
fnlp-master
- FNLP涓昏
naivebayes
- 朴素贝叶斯算法 求导致某一结果或现象发生的最可能的条件-Naive Bayes algorithm for the most likely cause of the condition or a result of the phenomenon
maxminjulei
- 最大最小聚类算法改进,和书上的步骤完全一样,可以运行,简单-Maximum and minimum clustering algorithm improvements, and steps on the book exactly the same, it can run, simple
f24
- 使用24点进行快速傅里叶变换fft,进行时域和频域之间的转换-use 24 points to do fft
K-Nearest-Neighbor
- 数据挖掘中经典的KNN(K-最近邻)算法,导入即可运行-Data Mining the classical KNN (K- nearest neighbor) algorithm, you can import operation
keyword_find
- 实现了将pdf转换为txt,并且进行分模块的关键词抽取算法-Realized convert pdf to txt, and dividing module keyword extraction algorithm
pyspark_process
- 使用pyspark进行文本分类算法实现,其中使用了tf-idf表示-Use pyspark text classification algorithm, which uses the tf-idf representation
FP-tree
- 通过pyspark实现了fptree的使用,得到频繁项集-By pyspark realized fptree used to obtain frequent itemsets
libsvm-3.21.tar
- svm仿真,基于c语言的编写,可以用于matlab,python以及java等多种语言-simulation for svm
rgyq
- 人工鱼群算法的实现代码,有效处理优化寻优问题,比遗传算法、PSO具有更好的优化效果-Artificial fish swarm algorithm code to deal effectively optimize the optimization problem, with better optimization results than the genetic algorithm