Lesson Tutorial: Learn Pandas Basics for Tabular Data Manipulation
Introduction to Pandas and DataFrames
Pandas is a powerful Python library for data manipulation and analysis, built on top of NumPy. The key data structure in Pandas is the DataFrame, which allows you to work with tabular data (rows and columns).
Creating a Pandas DataFrame
1. Creating a DataFrame from a Dictionary
Copy to Clipboard
2. Setting a Custom Index
Copy to Clipboard
3. Creating a DataFrame from a CSV File
Copy to Clipboard
Indexing and Selecting Data in DataFrames
1. Selecting Columns
Copy to Clipboard
2. Selecting Rows
Copy to Clipboard
3. Using loc and iloc
loc: Label-based indexing.iloc: Integer-based indexing.
Copy to Clipboard
Assignment
Use the following dataset to complete the exercises:
Copy to Clipboard
Questions
- Display the DataFrame
df. - Select and print only the “capital” column.
- Select and print both “country” and “population” columns.
- Print the first three rows of the DataFrame.
- Use
locto print the data for “UK”. - Use
ilocto print the last two rows of the DataFrame.
Answer Key
1. Display the DataFrame
Copy to Clipboard
2. Select and Print the “capital” Column
Copy to Clipboard
3. Select and Print “country” and “population” Columns
Copy to Clipboard
4. Print the First Three Rows
Copy to Clipboard
5. Use loc to Print Data for “UK”
Copy to Clipboard
6. Use iloc to Print the Last Two Rows
Copy to Clipboard
Learning Objectives Recap
By the end of this tutorial, you should:
- Understand what a Pandas DataFrame is and how to create one.
- Be able to index and select data using square brackets,
loc, andiloc. - Understand how to load and manipulate tabular data from CSV files.
