Even if the data isn't big, there can be a benefit from the Hadoop infrastructur...

dk8996 · on May 14, 2013

Unless your problem is I/O bound (you can't get it off the disks fast enough, or network bound -- transforming data to a worker nodes takes too long) using Hadoop is the wrong choose. CPU bound problems are better solved with Grid solutions that do a better job of scaling up (with in a single node) and scale out to multiple machines. Taking a step back, you should always ask your self if this can be done on a single machine, taking advantage of Moore's Law.

cma · on May 13, 2013

What kind of processing takes 1s per row? That's several billion instructions. And you can easily fit 86400 rows in memory, so disk seeks aren't an issue.

codeulike · on May 14, 2013

Decent RDBMS servers will parallelise where possible, and use the servers 8 cores (or whatever) to optimise such a problem.