MapReduce的并行处理模式给自己增添了各种问题。虽然Java常用于MapReduce程序,你不见得非要成为Java专家才能享受到Hadoop上MapReduce的好处。 三种 MapReduce开发的替代方法包括Pig, streaming MapReduce和域特定语言,比如Scalding。 Pig是一种Hadoop下不借助Java而处理大数据的平台。
Google and its MapReduce framework may rule the roost when it comes to massive-scale data processing, but there’s still plenty of that goodness to go around. This article gets you started with Hadoop, ...
I gave an introductory talk on Hadoop yesterday at the Visual Studio Live! conference in Las Vegas. During the talk, I discussed how Hadoop Streaming, a utility which allows arbitrary executables to ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Hadoop is the most significant concrete technology behind the so called “Big Data” revolution. Hadoop combines an economical model for storing massive quantities of data – the Hadoop Distributed File ...