Jump to ratings and reviews
Rate this book

Text Analytics with Python: A Practitioner's Guide to Natural Language Processing

Rate this book

Leverage Natural Language Processing (NLP) in Python and learn how to set up your own robust environment for performing text analytics. This second edition has gone through a major revamp and introduces several significant changes and new topics based on the recent trends in NLP. 

You’ll see how to use the latest state-of-the-art frameworks in NLP, coupled with machine learning and deep learning models for supervised sentiment analysis powered by Python to solve actual case studies. Start by reviewing Python for NLP fundamentals on strings and text data and move on to engineering representation methods for text data, including both traditional statistical models and newer deep learning-based embedding models. Improved techniques and new methods around parsing and processing text are discussed as well.   

Text summarization and topic models have been overhauled so the book showcases how to build, tune, and interpret topic models in the context of an interest dataset on NIPS conference papers. Additionally, the book covers text similarity techniques with a real-world example of movie recommenders, along with sentiment analysis using supervised and unsupervised techniques.

There is also a chapter dedicated to semantic analysis where you’ll see how to build your own named entity recognition (NER) system from scratch. While the overall structure of the book remains the same, the entire code base, modules, and chapters has been updated to the latest Python 3.x release.


What You'll Learn

• Understand NLP and text syntax, semantics and structure• Discover text cleaning and feature engineering• Review text classification and text clustering • Assess text summarization and topic models• Study deep learning for NLP
Who This Book Is For
IT professionals, data analysts, developers, linguistic experts, data scientists and engineers and basically anyone with a keen interest in linguistics, analytics and generating insights from textual data.

700 pages, Kindle Edition

Published May 21, 2019

60 people are currently reading
73 people want to read

About the author

Dipanjan Sarkar

14 books4 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
15 (30%)
4 stars
27 (55%)
3 stars
6 (12%)
2 stars
0 (0%)
1 star
1 (2%)
Displaying 1 - 4 of 4 reviews
Profile Image for Eugene.
158 reviews15 followers
April 13, 2021
easy to understand explanations of NLP techniques and processes. Good read, yet simple to understand
Profile Image for Uyen.
188 reviews8 followers
November 25, 2022
If you are new to text analytics, natural language processing and all that jazz, this will be a good introductory book to guide you through the concepts, the high level ideas and what will/should be included in the pipeline. Plus it comes with Python implementation (source code available on Github) which is greatttt. However, bear in mind that the stuffs written here will be quite simple, conventional and not particularly in-depth of any techniques which are not unusual for this kind of "introductory" book. Although I wish the author dived a bit deeper into discussing the results from the implementation rather than just showing the mere results and visualisations. Still a good read though.
Profile Image for Mervat.
61 reviews8 followers
April 19, 2020
It's a very useful book though it's a little bit confusing for beginners.
in"lemmatization "section for example the code was too complex, otherwise it can be written in three lines with (pattern)library :
import pattern
from pattern.en import lemma

def lemmatization(text):

lemmatization =" ".join([lemma(wd) for wd in text.split()])
return lemmatization

Displaying 1 - 4 of 4 reviews

Can't find what you're looking for?

Get help and learn more about the design.