Python Data Science Handbook
The “Python Data Science Handbook” is a comprehensive guide that delves into the practical aspects of data science using Python. It covers essential topics such as data manipulation, visualization, machine learning, and more. The handbook provides detailed explanations and examples to help readers understand and apply Python tools and libraries effectively in the field of data science. With a focus on practicality and real-world applications, this resource serves as a valuable reference for both beginners and experienced data scientists looking to enhance their Python skills and excel in the realm of data science.
Geographic Data with Basemap
One common type of visualization in data science is that of geographic data. Matplotlib's main tool for this type of visualization is the Basemap toolkit, which is one of several Matplotlib toolkits w...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Working with Time Series
Pandas was developed in the context of financial modeling, so as you might expect, it contains a fairly extensive set of tools for working with dates, times, and time-indexed data. Date and time data ...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Data Manipulation with Pandas
In the previous chapter, we dove into detail on NumPy and its ndarray object, which provides efficient storage and manipulation of dense typed arrays in Python. Here we'll build on this knowledge by l...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Python Data Science Handbook
This website contains the full text of the Python Data Science Handbook by Jake VanderPlas; the content is available on GitHub in the form of Jupyter notebooks. The text is released under the CC-BY-NC...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Preface
What Is Data Science? This is a book about doing data science with Python, which immediately begs the question: what is data science ? It's a surprisingly hard definition to nail down, especially give...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
IPython: Beyond Normal Python
There are many options for development environments for Python, and I'm often asked which one I use in my own work. My answer sometimes surprises people: my preferred environment is IPython plus a tex...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Help and Documentation in IPython
If you read no other section in this chapter, read this one: I find the tools discussed here to be the most transformative contributions of IPython to my daily workflow. When a technologically-minded ...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Structured Data: NumPy's Structured Arrays
While often our data can be well represented by a homogeneous array of values, sometimes this is not the case. This section demonstrates the use of NumPy's structured arrays and record arrays , which ...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Hierarchical Indexing
Up to this point we've been focused primarily on one-dimensional and two-dimensional data, stored in Pandas Series and DataFrame objects, respectively. Often it is useful to go beyond this and store h...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Handling Missing Data
The difference between data found in many tutorials and data in the real world is that real-world data is rarely clean and homogeneous. In particular, many interesting datasets will have some amount o...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Introducing Pandas Objects
At the very basic level, Pandas objects can be thought of as enhanced versions of NumPy structured arrays in which the rows and columns are identified with labels rather than simple integer indices. A...
📚 Read more at Python Data Science Handbook🔎 Find similar documents
Data Indexing and Selection
In Chapter 2 , we looked in detail at methods and tools to access, set, and modify values in NumPy arrays. These included indexing (e.g., arr[2, 1] ), slicing (e.g., arr[:, 1:5] ), masking (e.g., arr[...
📚 Read more at Python Data Science Handbook🔎 Find similar documents