Time series data is of growing importance, especially with the rapid expansion of the Internet of Things. This concise guide shows you effective ways to collect, persist, and access large-scale time series data for analysis. You’ll explore the theory behind time series databases and learn practical methods for implementing them. Authors Ted Dunning and Ellen Friedman provide a detailed examination of open source tools such as OpenTSDB and new modifications that greatly speed up data ingestion.
You’ll learn: - A variety of time series use cases - The advantages of NoSQL databases for large-scale time series data - NoSQL table design for high-performance time series databases - The benefits and limitations of OpenTSDB - How to access data in OpenTSDB using R, Go, and Ruby - How time series databases contribute to practical machine learning projects - How to handle the added complexity of geo-temporal data
More like 1.5 stars. The books starts on a good premise of Time Series Databases but soon devolves into a review of OpenTSDB with extensions for fast ingestion developed by MapR. Major thumbs down for repetitiveness of the earlier sentence since the book is sponsored by MapR and written by the developer of those extensions. There is some unnecessary hard selling of these which is off-putting.
However the book redeems itself somewhat with some good use cases of time series databases and some challenges and design condiderations. It does not do a good job of telling alternatives TSDBs such as InfluxDB (mentioned in passing) or RRDs or columnar DBs such as CitusDB or MonetDB which can be used for a similar purpose. The book could have been much shorter for the material covered but also much longer when covering more concepts and internal and alternatives. Don't expect it to be comprehensive though. A decent quick one time fast scan read for those not very familiar with TSDBs.
Not Bad. A quick introduction to time series database concepts suitable for a complete beginner. As other reviewers noted, it was also a bit of an advert for MapR (of which one of the authors is an employee) and the Open TSDB project. The good thing about that is that there are some great specific details of the project which are explored, the bad is that it's not an especially representative sample of the space, which the title suggests it will be.
Excelente paper para entender las nuevas formas de procesamiento de datos extensos y a velocidades inferiores a 1 segundo, rompiendo el paradigma de bases de datos con tablas relacionales.