Software Engineering discussion
date
newest »

message 1:
by
Tom
(new)
Jan 13, 2013 06:41PM

reply
|
flag

I think Riak is academically interesting because it shows the problems and solutions with non-centralized data management, and the opportunity for pure, autonomous, distribution and scale, as well as the resulting fault tolerance benefits.
Practically speaking, a minimal amount of centralized management (like the name node in Hadoop) makes the Riak consistency issues disappear. I am not fond of schemes that leave it up to the user to reconcile versions. Also, the data management language, at least with the HTTP/MIME/JSON approach, seems very complex to me. Finally, although Riak supports MapReduce, it doesn't appear to honor the same contract as Hadoop, which delivers (from the map to the reduce) tuples grouped by common keys, with all the keys on a given reducer in sorted order. But, I didn't dig very far enough into the documentation on this.
Practically speaking, a minimal amount of centralized management (like the name node in Hadoop) makes the Riak consistency issues disappear. I am not fond of schemes that leave it up to the user to reconcile versions. Also, the data management language, at least with the HTTP/MIME/JSON approach, seems very complex to me. Finally, although Riak supports MapReduce, it doesn't appear to honor the same contract as Hadoop, which delivers (from the map to the reduce) tuples grouped by common keys, with all the keys on a given reducer in sorted order. But, I didn't dig very far enough into the documentation on this.