» tagged pages
» logout

sorted by: recent | see : popular
Content Tagged with parallel + mapreduce

Welcome to Pig!

Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenab

opensource: del.icio.us tag/opensource

The Two Flavors of Google

On Hadoop, and how it might eclipse Map/Reduce in the future, as even Google is backing Hadoop for a variety of applications (Map/Reduce is only good for its limited set of tasks)

opensource: del.icio.us tag/opensource

paper.pdf (application/pdf Object)

Google's MapReduce programming model serves for processing and generating large data sets in a massively parallel manner. We deliver the first rigorous description of the model including its advancement in Google's domain-specific language Sawzall.

Haskell: del.icio.us tag/haskell

Google's MapReduce Programming Model -- Revisited

We revisit the MapReduce programming model in an attempt to provide a rigorous description of the model. We focus on the key abstraction for MapReduce computations; this abstraction is parameterized by the problem-specific ingredients for data extraction

Haskell: del.icio.us tag/haskell