This book is a guide for you on how to use Pandas and Numpy in Python programming language for data analysis. The author begins by helping you familiarize yourself with the basics of data science, Numpy and Pandas. You are guided on how to work with Numpy arrays and how to manipulate them. The various operations that you can perform on your data via the Pandas library have been discussed. You will also know how to create various data structures in Pandas for data storage. Data from the environment is dirty. The process of cleaning such data has been discussed. This involves handling outliers, missing values etc. The author guides you on how to work with data in various types of storage formats. Examples include MS Excel, CSV files, JSON, etc. You are also guided on how to calculate various measures for your data. The process of visualizing data has been explored in detail.
This book is a guide for you on how to use Pandas and Numpy in Python programming language for data analysis. The author begins by helping you familiarize yourself with the basics of data science, Numpy and Pandas. You are guided on how to work with Numpy arrays and how to manipulate them. The various operations that you can perform on your data via the Pandas library have been discussed. You will also know how to create various data structures in Pandas for data storage. Data from the environment is dirty. The process of cleaning such data has been discussed. This involves handling outliers, missing values etc. The author guides you on how to work with data in various types of storage formats. Examples include MS Excel, CSV files, JSON, etc. You are also guided on how to calculate various measures for your data. The process of visualizing data has been explored in detail.