Build A Career in Data Science

Build A Career in Data Science

Build a Career in Data Science is the top guide to help readers get their first data science job, then quickly becoming a senior employee. Industry experts Jacqueline Nolis and Emily Robinson lay out the soft skills readers need alongside their technical know-how in order to succeed in the…

Machine Learning with R, tidyverse, and mlr

Machine Learning with R, tidyverse, and mlr

Machine Learning with R, tidyverse, and mlr teaches readers how to gain valuable insights from their data using the powerful R programming language. In his engaging and informal style, author and R expert Hefin Ioan Rhys lays a firm foundation of ML basics and introduces readers to the tidyverse,…

Data Mashups in R

Data Mashups in R

How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse Html and Xml, and…

Kafka in Action

Kafka in Action

Kafka in Action is a practical, hands-on guide to building Kafka-based data pipelines. Filled with real-world use cases and scenarios, this book probes Kafka’s most common use cases, ranging from simple logging through managing streaming data systems for message routing, analytics, and more. In systems that handle big data,…

Mastering Large Datasets with Python

Mastering Large Datasets with Python

Modern data science solutions need to be clean, easy to read, and scalable. In Mastering Large Datasets with Python, author J.T. Wolohan teaches you how to take a small project and scale it up using a functionally influenced approach to Python coding. You’ll explore methods and built-in Python tools…

R Data Science Quick Reference

R Data Science Quick Reference

In this handy, practical book you will cover each concept concisely, with many illustrative examples. You’ll be introduced to several R data science packages, with examples of how to use each of them. In this book, you’ll learn about the following APIs and packages that deal specifically with…

Data Science with Python and Dask

Data Science with Python and Dask

Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries you’re already using, including Pandas, NumPy, and Scikit-Learn. With Dask you can crunch and work with huge datasets, using the tools you already have. And Data Science with Python and Dask is your guide to using…

Big Data Analytics

Big Data Analytics

This book has a collection of articles written by Big Data experts to describe some of the cutting-edge methods and applications from their respective areas of interest, and provides the reader with a detailed overview of the field of Big Data Analytics as it is practiced today. The chapters cover…