资源列表
LDA-topic-model
- 首先声明,这是别人写的LDA主题模型代码,本人测试过,可以运行,但是输出跟输出有点不尽人意,输入的是词的序号和该词在文档中出现的次数,要是可以直接读取文档就完美了。输出是主题以及词在该主题出现的概率,其中得到的主题我就看不懂了,不知道是算法问题,还是因为我的水平有限。在研究LDA主题模型的朋友,可以下载试一下-First statement, which is written by someone else LDA topic model code, I tested, you can run,
NaiveBayesClassifier.m
- I use Matlab 2008a which does not support Naive Bayes Classifier. scr ipt supports normal and kernel distributions. Statistics toolbox for 2008a version is used in the scr ipt. Also includes function for confusionmat
BayesClassify
- Requirements : 1) function fileOpen (user written) to open files : also uploaded 2)function strsplit1 also uploaded 3)training data, also uploaded
NB_All_Variables
- multi class naive bayes algorithm i coded for predicting football results
mixBern
- Just like EM of Gaussian Mixture Model, this is the EM algorithm for fitting Bernoulli Mixture Model. GMM is useful for clustering real value data. However, for binary data (such as bag of word feature) Bernoulli Mixture is more suitable.
datato1ofm
- Take categorical data matrix and transform whole matrix to binary sparse 1ofM matrix, keeping track of what came where. Ideal for any form of count-based probabilistic analysis.-Take categorical data matrix and transform whole matrix to binary sparse
text-mining
- text mining using R programming
google-analytics
- google analytics in R programming
wordcloud_new
- implementation of wordcloud
data--preprocessing-using-kdd-data-set
- Data Mining process model selected is KDD which starts selection of data.Initially the researcher has taken the Kddcup.data-10-perecnt which contains total of 311,027 records which includes both labeled and unlabeled records-Data Mining process model
Ashor
- Initialization Algorithm for k-means
SVM
- State vector machine with single class output. The code works on 32 bit numbers in IEEE 754 floating point format for single precision numbers.