30 December 2006
Welcome to Hadoop!
by psylle & 1 otherHadoop is a framework for running applications on large clusters of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named map/reduce, where the appli
Welcome to Nutch!
by psylle & 4 othersNutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
14 December 2006
1
(3 marks)