這些年,我們一起追的HadoopAvro:Language-Neutral Data Serialization System (2010-05 成為 Top-Level Project) Mahout:Scalable Library for Machine Learning HBase:Distributed Data Storage (2010-05 成為 Top-Level Project) Pig:High Level Language for Data Analysis Hadoop: Impala Presto Drill/Dremel/BigQuery ... Data Collector: Flume Chukwa Scribe ... Machine Learning: Mahout ... 跟 Hadoop 一起解決 Big Data 問題吧! 47 / 74 Tez Hortonworks 主導 A framework for near real-time 其實也一樣是抽換 Query Planner,從 Hive on MapReduce 變成 Hive on Tez on YARN 58 / 74 架在 Hadoop 上的 Machine Learning 平台 目前提供 Recommendation Mining、 Clustering、Classification 等 Use Case 2014-04-25 發表了 Goodbye MapReduce0 码力 | 74 页 | 45.76 MB | 1 年前3
大数据时代的Intel之Hadoop– Sort – WordCount – TeraSort – Enhanced DFSIO – Nutch Indexing – Page Rank Machine Learning – Bayesian Classification – K-Means Clustering Analytical Query HiBench 1.0 paper (“The HiBench0 码力 | 36 页 | 2.50 MB | 1 年前3
共 2 条
- 1













