资源列表
python-fp-growth-master
- source code for fp_grouth algorithm by paython
frequency_filters
- 本人在实际项目中使用的各种滤波算法,实践证明可用,效果好。-Use a variety of filters in their own projects to verify the good use.
Drools_CleanData
- 使用Drools规则引擎,自定义规则来进行脏数据清洗的实例-Use the Drools rules engine to customize rules for dirty data cleaning instances
lof
- LOF(局部异常因子)是用于识别基于密度的局部异常值的算法-It uses local outlier mining method to count the Local Outlier Factor(LOF) of the outlier candidated object。
icwsm14-T4-code
- 利用开源SNAP网络处理工具进行网络分析,是已经发表过的论文中用到的,目的主要是为了推广SNAP工具包。-SNAP network processing using an open source tool for network analysis, already published in the paper used, the main purpose is to promote SNAP toolkit.
DIGG-all
- 图生成器DEGG,能够自动生成各种属性的图,注意是graph,而不是image。-Map generator DEGG, can automatically generate various figures attributes, note the graph, rather than the image.
Kalman
- Kalman算法示例,java实现通过Klman对GPS点进行过滤,实现平滑-Kalman algorithm example, trajectory filtering by Kalman filtering useing java
beautifulsoup4-4.0.0b3.tar
- Beautiful Soup提供一些简单的、python式的函数用来处理导航、搜索、修改分析树等功能。-Beautiful Soup offers some simple, python-like functions to handle navigation, search, modify the parse tree and so on.
K-means
- k-means简单实现,实现了k近邻的实现,以图像的形式显示出来,简单实用-k-means simple to achieve achieve a k neighbors realized and presented in the form of an image, simple and practical
cnbeta
- 运用python爬取cnbeta的最新内容,运用到了scarpy模块。-The use of python crawl cnbeta the latest content, the use of the scarpy module.
pachongtest2
- 运用python爬取知乎日报的内容,对知乎日报网页中的每一个子链接进行爬取,并对内容进行修改,运用re,urllib2,BeautifulSoup模块。-Use python to crawl the contents of daily news, to know every page in the daily sub-links to crawl, and to modify the content, the use of re, urllib2, BeautifulSoup module.
beautifulsoup4test1
- 爬取糗事百科,运用BeautifulSoup模块对爬取内容进行处理。-Crawling embarrassing encyclopedia, using BeautifulSoup module to crawl content processing.