搜索资源列表
POETR
- 此为POET的R语言源代码程序,目的是为实现最大方差估计-Large Covariance Estimation by Thresholding Principal Orthogonal Complements
FCM
- 核聚类算法:聚类是将一组给定的未知类标号的样本分成内在的多个类别,使得同一类中 的样本具有较高的相似度,而不同类中的样本差别大。侧重于软聚类(模糊C-均值——FCM),但其描述手段同样适合于硬聚 类(HCM)等同类问题。-Clustering algorithm: cluster is a group of unknown samples given class label into internal multiple categories, so that the same class
rvm-wind-power-forecast
- 相关向量机用于风电场功率预测和各种大数据分类问题,单变量输出-Relevance vector machine for a wind farm power prediction and a variety of large data classification problem, univariate output
SAP_HANA_pdf
- HANA是一个软硬件结合体,提供高性能的数据查询功能,用户可以直接对大量实时业务数据进行查询和分析,而不需要对业务数据进行建模、聚合。-HANA is a combination of hardware and software to provide high-performance data query capabilities, users can perform a large number of real-time business data query and analysis dir
change-detection
- 对数据做变化检测,分析哪些部分变化比较大,并绘图-The data do change detection, which analyzes some of these changes is relatively large, and drawing
big-data
- 介绍了大数据时代的发展以及大数据与大数据在经济学的应用,还有大数据的安全与隐私保护,网络大数据的现状与展望-It introduced the development of big data and big data era of big data applications and economics, as well as large data security and privacy protection, the current situation and prospect of larg
large-data
- 对大数据的应用,很有启发负荷预测。我希望你能有所帮助。-Load forecasting on large data applications, very enlightening. I hope you can help.
FPtree
- 数据挖掘中关联规则算法的FPtree算法的Python实现。FPtree算法比apriori算法更擅于处理大规模的数据-Data Mining Association Rules algorithm FPtree algorithm implemented in Python. FPtree algorithm apriori algorithm is more than adept at handling large data
lightlda-master
- LightLDA is a distributed system for large scale topic modeling. It implements a distributed sampler that enables very large data sizes and models. LightLDA improves sampling throughput and convergence speed via a fast O(1) metropolis-Hastings algori
gplvm
- 这是一个用于高斯过程隐变量模型的工具箱,其中包含了MATLAB/C/PYTHON三种语言版本-As of July 2005 a C++ implementation of the GPLVM exists which has most of the flexibility of this software but runs much faster. However as of this time it cannot handle very large data sets as the spar
Data-partitioning
- 在海量数据处理中可以用到,对大量数据的预处理,可以知道数据分布情况,并进行归类。 -In the mass data processing can be used for a large number of pre-processing of data, we can know the distribution of data, and collation.
suanfasheji
- 常见算法设计与分析源代码,包括背包问题,n皇后问题,大整数乘法,阶乘问题,汉诺塔等-Common algorithm design and analysis of source code, including the knapsack problem, n queens problem, large integer multiplication, factorial problem, Tower of Hanoi, etc.
DataTest
- 统计一亿个IP中每个出现的次数,找不到大数据之类的分类,只能选择数据挖掘-Statistics IP in one hundred million times each appears, can not find such a large data classification, data mining can only choose
AprioriMain
- Apriory算法是数据挖掘中常用的挖掘初始数据的算法,传统的apriory算法在大数据的情况下实现效率很低,我通过java中的hash结构进行了改进,将效率提高。-Apriory data mining algorithms commonly used in the initial data mining algorithms, the traditional apriory inefficient algorithm in the case of large data, I have bee
ex4-003(Week5)_finished
- week5 百度大数据实验室NG的机器学习教程,包括文档以及代码,有些基本的分类聚类的机器学习算法,很有帮助-week5 Baidu large data laboratory NG machine learning tutorials, including documentation and code, some basic classification machine learning clustering algorithm helpful
ex5-003(Week6)_finished
- week6 百度大数据实验室NG的机器学习教程,包括文档以及代码,有些基本的分类聚类的机器学习算法,对于初学者很有帮助-week6 Baidu large data laboratory NG machine learning tutorials, including documentation and code, some basic clustering classification machine learning algorithms, very helpful for beginne
DS3_v1.1
- 基于相异性的稀疏子集选择。作者是大名鼎鼎的Ehsan Elhamifar。-Dissimilarity-based Sparse Subset Selection (DS3) is an algorithm based on simultaneous sparse recovery for finding data/model representatives a large collection of data/models.
code_BPMF
- 如何使它工作: 1。创建一个单独的目录,并将所有这些文件下载到相同的目录中 2。下载7个文件: *demo:主文件demo:PMF和贝叶斯PMF * PMF.m:训练的PMF模型 * bayespmf.m贝叶斯PMF模型实现吉布斯采样器。 * moviedata.mat样本数据包含三元组(user_id,movie_id,评分) * makematrix.m:辅助功能转换成大型矩阵的三元组。 * PRED.m:辅助功能使得预测验证集。 三.在Matlab只需运
powerest
- 用于大量数据挖掘,预测,在很多领域有实际,有不错的实际应用功能。-For a large number of data mining, prediction, in many areas have practical, have a good practical application function.
OpenTSTOOL
- 时间序列处理工具箱,大量丰富的处理函数,可供选择。(Time series processing toolbox, a large number of processing functions, available for selection.)