Jump to ratings and reviews
Rate this book

Hadoop in Action

Rate this book
Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs.

The book begins by making the basic idea of Hadoop and MapReduce easier to grasp by applying the default Hadoop installation to a few easy-to-follow tasks, such as analyzing changes in word frequency across a body of documents. The book continues through the basic concepts of MapReduce applications developed using Hadoop, including a close look at framework components, use of Hadoop for a variety of data analysis tasks, and numerous examples of Hadoop in action.

Hadoop in Action will explain how to use Hadoop and present design patterns and practices of programming MapReduce. MapReduce is a complex idea both conceptually and in its implementation, and Hadoop users are challenged to learn all the knobs and levers for running Hadoop. This book takes you beyond the mechanics of running Hadoop, teaching you to write meaningful programs in a MapReduce framework.

This book assumes the reader will have a basic familiarity with Java, as most code examples will be written in Java. Familiarity with basic statistical concepts (e.g. histogram, correlation) will help the reader appreciate the more advanced data processing examples.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.

325 pages, Paperback

First published January 1, 2010

12 people are currently reading
269 people want to read

About the author

Chuck Lam

1 book1 follower

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
24 (20%)
4 stars
45 (39%)
3 stars
32 (27%)
2 stars
11 (9%)
1 star
3 (2%)
Displaying 1 - 9 of 9 reviews
Profile Image for Todd N.
357 reviews256 followers
February 19, 2012
Good overview of the Hadoop core. I particularly liked the coverage of Hadoop Streaming, which is handy if (like me) you are approaching Hadoop without a lot of Java experience.

I prefer the O'Reilly book on Hadoop by Tom White to this book because the O'Reilly book goes into more depth and has better coverage of the related projects. However, Hadoop In Action is excellent as an extended tutorial on Hadoop with clear examples.

I also liked the way that you can register the book and get multiple electronic formats for the book.
Profile Image for Yilmaz Aksoy.
62 reviews1 follower
March 7, 2018
Eski bir sürüm için ve çabucak fikir sahibi olmak için okunabilir. Benim de hedefim buydu. Yer yer kitaptaki şeyleri deneyemedim, çünkü sistemi apache'nin planladığı şekilde kurmayı beceremedim. Şimdi bir de kurulum üzerine bir kitap okumayı deneyeyim. Bu sistem üzerine ilginç ve kullanabileceğim teknikler var gerçekten de.
Profile Image for Paul Childs.
183 reviews3 followers
August 5, 2011
Read this book to find out what Hadoop is since we are going to start using it at work. I found the chapters the introductory chapters pretty useful in getting an idea of that Hadoop can do, and the chapter on how to manage a Hadoop cluster was helpful too.

Much of the book has to do with using MapReduce programming to work with the data that Hadoop will store. Since this was not my main interest I only skimmed some of that since I wouldn't be using it anyway. The book was very well written though.

When actually following through the processes he describes though to set up a hadoop cluster nothing seems to actually work the way that it is described in the book. It has been a frustrating experience setting this up and the book hasn't made the actual process any easier.
16 reviews
January 30, 2012
Lam explained the theory behind Hadoop very well. The environment settings and running examples are actually scattered on Chapter 1 and Chapter 2. This would not be the best way for the beginners. When you crossed Chapter 5, you would definitely very handful in Hadoop. This is one good point about this book. The US Patent example make me but uncomfortable.
Profile Image for Brijesh Bolar.
10 reviews6 followers
February 28, 2013
The best book to get up and running with Hadoop. As a beginner I found Hadoop the Definitive Guide very intimidating so this book gave me a head start. Reading Definitive Guide after reading Hadoop in Action and digging in some Map Reduce code on CDH4 was easier.
Profile Image for Vadim.
116 reviews2 followers
July 17, 2014
Quite interesting to read even if you do not intend to use Hadoop on a daily basis. It describes the "map-reduce" way of thinking and some real-world usage examples. Hadoop extensions described (Pig Latin and Hive) are also particularly interesting.
4 reviews1 follower
October 10, 2012
By now it is pretty outdated because of the new API, but you can still learn about mapreduce and HDFS reading this book.
Displaying 1 - 9 of 9 reviews

Can't find what you're looking for?

Get help and learn more about the design.