We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
使用java+httpclient+httpcleaner,多线程、分布式爬去电商网站商品信息,数据存储在hbase上,并使用solr对商品建立索引,使用redis队列存储一个共享的url仓库;使用zookeeper对爬虫节点生命周期进行监视等。
Java 236 150
文本特征提取算法,卡方校验(chi-square)和信息增益算法提取文本特征算法实现
Java 18 9
Forked from binglind/alchemy
给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群
Java
Forked from hemaGitHub/mppwhater
Forked from harbby/sylph
Stream computing platform for bigdata
There was an error while loading. Please reload this page.