Please enable JavaScript to use CodeHS

SC Python 2 Glossary

Flashcards

Course:

Module:

Lesson:

Search:

module Python

A file containing Python definitions and statements.

package Python

A collection of related modules.

library Python

A collection of modules and packages.

documentation General

Written instructions detailing the functions, methods, and variables available and how to use them.

data frame General

A data structure that stores and aligns data in a table using rows and columns.

table.dtypes Python

Lists the data types used in each column in the DataFrame

table.shape Python

Prints the number of rows and columns in the format (rows, columns)

table.info() Python

Lists the data types used in each column, non-null values, and memory usage info

table.describe() Python

Lists the descriptive statistics for each column.

table.head(num) Python

Lists the first num rows

table.tail(num) Python

Lists the last num rows

table[a:b] Python

Lists the rows from index a to index b, exclusively (not including b).

data cycle General

A sequence of steps for processing and using data.

quantitative data General

Numerical data that can be counted or measured.

qualitative data General

Data that can be divided into different categories.

series Python

A one-dimensional, labeled array (or list) that is formatted like a single column of a data table.

measure of spread General

Used to describe how data is spread. It also describes the variability of the dataset.

standard deviation General

A measure of how spread out a group of numbers are, calculated by taking the square root of the variance.

variance General

Used to describe how far each number in the dataset is from the mean and calculated by determining the average of the squared differences from the mean.

range General

The difference between the largest number and the smallest number. The larger the range, the larger the spread or dispersion.

interquartile range (IQR) General

The difference between the first and third quartiles. Indicates how spread out the middle 50% of the set of data is. Helps to avoid outliers affecting the data.

first quartile General

Median of the first half of the data or the 25% point.

third quartile General

Median of the second half of the data or the 75% point.

data science General

The process of learning about the world using data and computation.

statistical question General

A question that could have a variety of different answers.