Book For Python Data analysis

PYTHON IN DATA ANALYSIS


Python for Data Analysis

Data Wrangling with Pandas, NumPy, and IPython

Python for Data Analysis is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you’ll need to effectively solve a broad set of data analysis problems. This book is not an exposition on analytical methods using Python as the implementation language.


Written by Wes McKinney, the main author of the pandas library, this hands-on book is packed with practical cases studies. It’s ideal for analysts new to Python and for Python programmers new to scientific computing.
  • Use the IPython interactive shell as your primary development environment
  • Learn basic and advanced NumPy (Numerical Python) features
  • Get started with data analysis tools in the pandas library
  • Use high-performance tools to load, clean, transform, merge, and reshape data
  • Create scatter plots and static or interactive visualizations with matplotlib
  • Apply the pandas groupby facility to slice, dice, and summarize datasets
  • Measure data by points in time, whether it’s specific instances, fixed periods, or intervals
  • Learn how to solve problems in web analytics, social sciences, finance, and economics, through detailed examples
  • Table of Contents

    1. Chapter 1 Preliminaries

      1. What Is This Book About?

      2. Why Python for Data Analysis?

      3. Essential Python Libraries

      4. Installation and Setup

      5. Community and Conferences

      6. Navigating This Book

      7. Acknowledgements

    2. Chapter 2 Introductory Examples

      1. 1.usa.gov data from bit.ly

      2. MovieLens 1M Data Set

      3. US Baby Names 1880-2010

      4. Conclusions and The Path Ahead

    3. Chapter 3 IPython: An Interactive Computing and Development Environment

      1. IPython Basics

      2. Using the Command History

      3. Interacting with the Operating System

      4. Software Development Tools

      5. IPython HTML Notebook

      6. Tips for Productive Code Development Using IPython

      7. Advanced IPython Features

      8. Credits

    4. Chapter 4 NumPy Basics: Arrays and Vectorized Computation

      1. The NumPy ndarray: A Multidimensional Array Object

      2. Universal Functions: Fast Element-wise Array Functions

      3. Data Processing Using Arrays

      4. File Input and Output with Arrays

      5. Linear Algebra

      6. Random Number Generation

      7. Example: Random Walks

    5. Chapter 5 Getting Started with pandas

      1. Introduction to pandas Data Structures

      2. Essential Functionality

      3. Summarizing and Computing Descriptive Statistics

      4. Handling Missing Data

      5. Hierarchical Indexing

      6. Other pandas Topics

    6. Chapter 6 Data Loading, Storage, and File Formats

      1. Reading and Writing Data in Text Format

      2. Binary Data Formats

      3. Interacting with HTML and Web APIs

      4. Interacting with Databases

    7. Chapter 7 Data Wrangling: Clean, Transform, Merge, Reshape

      1. Combining and Merging Data Sets

      2. Reshaping and Pivoting

      3. Data Transformation

      4. String Manipulation

      5. Example: USDA Food Database

    8. Chapter 8 Plotting and Visualization

      1. A Brief matplotlib API Primer

      2. Plotting Functions in pandas

      3. Plotting Maps: Visualizing Haiti Earthquake Crisis Data

      4. Python Visualization Tool Ecosystem

    9. Chapter 9 Data Aggregation and Group Operations

      1. GroupBy Mechanics

      2. Data Aggregation

      3. Group-wise Operations and Transformations

      4. Pivot Tables and Cross-Tabulation

      5. Example: 2012 Federal Election Commission Database

    10. Chapter 10 Time Series

      1. Date and Time Data Types and Tools

      2. Time Series Basics

      3. Date Ranges, Frequencies, and Shifting

      4. Time Zone Handling

      5. Periods and Period Arithmetic

      6. Resampling and Frequency Conversion

      7. Time Series Plotting

      8. Moving Window Functions

      9. Performance and Memory Usage Notes

    11. Chapter 11 Financial and Economic Data Applications

      1. Data Munging Topics

      2. Group Transforms and Analysis

      3. More Example Applications

    12. Chapter 12 Advanced NumPy

      1. ndarray Object Internals

      2. Advanced Array Manipulation

      3. Broadcasting

      4. Advanced ufunc Usage

      5. Structured and Record Arrays

      6. More About Sorting

      7. NumPy Matrix Class

      8. Advanced Array Input and Output

      9. Performance Tips

    1. Appendix Python Language Essentials

      1. The Python Interpreter

      2. The Basics

      3. Data Structures and Sequences

      4. Functions

      5. Files and the operating system

    2. Colophon



GET THIS BOOK AT   AMAZAN








No comments:

Post a Comment

Pages