Machine Learning - Made Easy To UnderstandIf you are looking for a book to help you understand how the machine learning algorithms "Random Forest" and "Decision Trees" work behind the scenes, then this is a good book for you. Those two algorithms are commonly used in a variety of applications including big data analysis for industry and data analysis competitions like you would find on Kaggle.
This book explains how Decision Trees work and how they can be combined into a Random Forest to reduce many of the common problems with decision trees, such as overfitting the training data.
Several Dozen Visual ExamplesEquations are great for really understanding every last detail of an algorithm. But to get a basic idea of how something works, in a way that will stick with you 6 months later, nothing beats pictures. This book contains several dozen images which detail things such as how a decision tree picks what splits it will make, how a decision tree can over fit its data, and how multiple decision trees can be combined to form a random forest.
This Is Not A TextbookMost books, and other information on machine learning, that I have seen fall into one of two categories, they are either textbooks that explain an algorithm in a way similar to "And then the algorithm optimizes this loss function" or they focus entirely on how to set up code to use the algorithm and how to tune the parameters.
This book takes a different approach that is based on providing simple examples of how Decision Trees and Random Forests work, and building on those examples step by step to encompass the more complicated parts of the algorithms. The actual equations behind decision trees and random forests get explained by breaking them down and showing what each part of the equation does, and how it affects the examples in question.
Python Files & Excel File For Many Of The Examples Shown In The BookSome topics in machine learning don't lend themselves to equations in an Excel table. Things like error checking or complicated conditionals are hard to replicate outside of code. However some topics work quite well in a spreadsheet. Topics such as entropy and information gain, which is how a decision tree picks its splits, can be easily calculated in a spreadsheet. The spreadsheet used to generate many of the examples in this book is available for free download, as are all of the Python scripts that ran the Random Forests & Decision Trees in this book and generated many of the plots and images.
If you are someone who learns by playing with the code, and editing the data or equations to see what changes, then use those resources along with the book for a deeper understanding.
Topics CoveredThe topics covered in this book are
An overview of decision trees and random forestsA manual example of how a human would classify a dataset, compared to how a decision tree would workHow a decision tree works, and why it is prone to overfittingHow decision trees get combined to form a random forestHow to use that random forest to classify data and make predictionsHow to determine how many trees to use in a random forestJu
One of the greatest books I read about tree-based models. The book includes all important info, such as information theory, gini & entropy concepts, out of bag, in depth on how the model works, and many more.
Also this book provides example by using small dataset which helps readers to understand the underlying concepts quite well.
I would recommend for those who are working with tree-based model to read this book.
This book is perfect for people who wants to know how Tree Based model works. It used easy and understandable way to explain the algorithms and how it is adopted. Definitely recommended it for anyone who wants to be able to say now I know how Tree Based model works!
A lucid read to learn the intuition and concepts behind Decision Trees and Random Forests. Machine learning concepts were familiar to me for sometime now, yet I never felt this confident about fiddling with the approaches, having spent most of my time in feature engineering. This was an extremely quick refresher which I could use immediately.
Easy to read, and focuses on the core concepts. I’d say that if you have no background in statistics, this is approachable though you may have questions as to the mathematical principles (e.g., logarithms in Entropy, and why one uses that to calculate information gain as opposed to Gini).
One major improvement would be to add clearer definitions earlier on in the book for certain terms used repeatedly (node, leaf, branch, tree, feature, category). Some of these are supposed to be intuitive, though the author could be more disciplined in how he applies each of these terms.
I like how it was written - clear and concise with examples
I was reading content related to Random Forest that I found on the web but I couldn't really grasp what's happening underneath, until I found this book. It shows you the concept and mechanism on how to use RF, why it works and when not to used it. After reading it, I got a feeling that I can write the algorithm myself too! Good book to read!
This is the first book I have read on this topic, and I found it very easy to understand. I have a background in more traditional data analysis such as multiple linear regression, control charts and cluster analysis. This book has given me a good start in extending my options for analyzing data.
Very basic concept for those who want to learn about Machine Learning. Fortunately, the explanations are easy to understand, but if you already knew some of them, you must thought that the author could explore a bit more to add the joy. I gave 3 stars because it made me want to read his other more advanced books.
This is an outstanding primer for the layperson or beginning student. In just under an hour, this book will provide the reader a solid foundation for understanding how Random Forests and Decision Trees work. Scott provides you clear examples and access to resources (data and code) to practice the examples on your own.
Being my first read on machine learning, I absolutely liked the concise and a very hands-on approach used by the author to keep the text engaging. Highly recommended if you are looking to learn about some machine learning algorithms and resources to get started.
It's a great book and provide lot of details on random forest algorithm, how it works, how gini works, how feature importance works and all. I recommend this book for anyone starting to use ML or one who is already using Random forest without knowing how it works. Thanks
Would recommend this book to anyone interested in learning about Random Forests. It is a clear intro, relating to human thought processes, and giving simple mathematical examples each time. I feel it has increased my knowledge in the field greatly, and my next mission to to read the Gradient Boosting book by the same author!
I read this book to quickly refresh on random forests for a data science course and found the examples to be simple and intuitive. Links and references are included throughout if you need more details.
I was curious about machine learning, and decided to purchase. I found that the book was pitched at the right level. Some links to supporting material were also provided (always useful). Superb as a beginner's/introductory book.
The book has a nice flow. All the concepts have been properly explained. I feel its a very level zero kind of book. But i appreviate Scott for witong this bokk, because the fundamentals are way more important than the implementation.
Coming from a recommendation from another author; I’m glad I picked this up as it gives clear explanations on the generation of decision trees and random forests. Will look into the author’s other offerings.
Basic but energetic review of Implementing decision trees in a machine learning environment. Not too much statistics, not too deep, but relatively thorough.
It's no easy task to explain random forests with any detail in a few pages, but this was a solid attempt a little hard to absorb in some places, but very helpful.
Very intuitive guide, the author builds the knowledge about random forests without difficult words. For me, the main topics he covers are: - How it works; - How to measure the different built forests; - How to understand better the generated models and data.
Written in a way that makes it easy to understand. Examples are also rather straightforward. Book has a pace without unnecessary content that would make it too long. Just contains what is necessary to grasp the topic and nothing more.
This quick intro gives you all the elements to know how to apply Random Forest to your dataset and understand the results and use them in your decision making processes
I finished this book in one gym biking session. It was a great introduction to Random Forests/Decision Trees. I'll be picking up more of Scott's guides.
Pretty good book for beginners to get an easy understanding of Random Forests and Decision Trees. Well illustrated examples in semi-tech and semi-layman terms.
This was a great book on Random Forests and would probably make a great first lecture for a class, it is simple to understand so big kudos to Scott on this.