A file containing Python definitions and statements.
A collection of related modules.
A collection of modules and packages.
Written instructions detailing the functions, methods, and variables available and how to use them.
A data structure that stores and aligns data in a table using rows and columns.
Lists the data types used in each column in the DataFrame
Prints the number of rows and columns in the format (rows, columns)
Lists the data types used in each column, non-null values, and memory usage info
Lists the descriptive statistics for each column.
Lists the first num rows
Lists the last num rows
Lists the rows from index a to index b, exclusively (not including b).
A sequence of steps for processing and using data.
Numerical data that can be counted or measured.
Data that can be divided into different categories.
A one-dimensional, labeled array (or list) that is formatted like a single column of a data table.
Used to describe how data is spread. It also describes the variability of the dataset.
A measure of how spread out a group of numbers are, calculated by taking the square root of the variance.
Used to describe how far each number in the dataset is from the mean and calculated by determining the average of the squared differences from the mean.
The difference between the largest number and the smallest number. The larger the range, the larger the spread or dispersion.
The difference between the first and third quartiles. Indicates how spread out the middle 50% of the set of data is. Helps to avoid outliers affecting the data.
Median of the first half of the data or the 25% point.
Median of the second half of the data or the 75% point.
The process of learning about the world using data and computation.
A question that could have a variety of different answers.