DataCamp
DataCamp Logo

Data Manipulation with Pandas 

  • Offered byDataCamp

Data Manipulation with Pandas
 at 
DataCamp 
Overview

Learn how to import and clean data, calculate statistics, and create visualizations with pandas

Duration

4 hours

Mode of learning

Online

Difficulty level

Beginner

Credential

Certificate

Data Manipulation with Pandas
 at 
DataCamp 
Highlights

  • 15 Videos
  • 56 Exercises
  • Data Analyst with Python Track
  • Data Manipulation with Python Track
  • Data Scientist with Python Track
  • Python Programmer Track
Read more
Details Icon

Data Manipulation with Pandas
 at 
DataCamp 
Course details

What are the course deliverables?
  • Discover Data Manipulation with pandas
  • Work with pandas Data to Explore Core Data Science Concepts
  • Learn to Manipulate DataFrames
More about this course
  • With this course, Learner willll learn why pandas is the world's most popular Python library, used for everything from data manipulation to data analysis
  • Learner will explore how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis
  • Learner will explore all the core data science concepts
  • Learner will learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python
  • Learner will start by mastering the pandas basics, including how to inspect DataFrames and perform some fundamental manipulations
  • Learner will also learn about aggregating DataFrames, before moving on to slicing and indexing
  • Learner will wrap up the course by learning how to visualize the contents of your DataFrames, working with a dataset that contains weekly US avocado sales
  • Learner will understand how to use this Python library for data manipulation
  • Learner will have an understanding of DataFrames and how to use them, as well as be able to visualize data in Python
Read more

Data Manipulation with Pandas
 at 
DataCamp 
Curriculum

Transforming DataFrames

Introducing DataFrames

Inspecting a DataFrame

Parts of a DataFrame

Sorting and subsetting

Sorting rows

Subsetting columns

Subsetting rows

Subsetting rows by categorical variables

New columns

Adding new columns

Combo-attack!

Aggregating DataFrames

Summary statistics

Mean and median

Summarizing dates

Efficient summaries

Cumulative statistics

Counting

Dropping duplicates

Counting categorical variables

Grouped summary statistics

What percent of sales occurred at each store type?

Calculations with .groupby()

Multiple grouped summaries

Pivot tables

Pivoting on one variable

Fill in missing values and sum values with pivot tables

Slicing and Indexing DataFrames

Explicit indexes

Setting and removing indexes

Subsetting with .loc[]

Setting multi-level indexes

Sorting by index values

Slicing and subsetting with .loc and .iloc

Slicing index values

Slicing in both directions

Slicing time series

Subsetting by row/column number

Working with pivot tables

Pivot temperature by city and year

Subsetting pivot tables

Calculating on a pivot table

Creating and Visualizing DataFrames

Visualizing your data

Which avocado size is most popular?

Changes in sales over time

Avocado supply and demand

Price of conventional vs. organic avocados

Missing values

Finding missing values

Removing missing values

Replacing missing values

Creating DataFrames

List of dictionaries

Dictionary of lists

Reading and writing CSVs

CSV to DataFrame

DataFrame to CSV

Wrap-up

Other courses offered by DataCamp

26 hoursCertificate
– / –
– / –
– / –
– / –
4 hoursCertificate
– / –
– / –
– / –
– / –
View Other 46 CoursesRight Arrow Icon
qna

Data Manipulation with Pandas
 at 
DataCamp 

Student Forum

chatAnything you would want to ask experts?
Write here...