DSCI011ProgramminginPythonforDataScience

Module 1: Python & Pandas - An Unexpected Friendship

In this module, you will be introduced to dataframes and the Pandas Python package.

0Module Learning Outcomes

1Introduction to Dataframes

2Describing a Dataframe

3Introduction to Pandas

4Definitions

5Your First Code!

6Your Second Code!

7Slicing Rows Using df.loc[]

8Slicing and Dicing Practice

9 Practicing Rows Slicing

10Practicing Column Slicing

11 Slicing Columns Using .loc[]

12Slicing Columns Only

13Selecting Using df.loc[]

14Rearranging Columns and Rows

15Practicing Selecting Using Index Labels

16Selecting Values from a Dataframe

17Practicing Selecting Values

18Selecting Single Columns

19Practicing Selecting

20Slicing and Selecting Using df.iloc[]

21Practicing Slicing and Selecting Using Index Position

22Practicing Slicing Using Index Position

23Practicing Selecting Using Index Position

24Sorting Dataframes

25Practice Sorting

26Summary Statistics

27Fruit Bowl Statistics

28Using df.describe()

29Frequency Tables and Writing CSVs

30Practicing Frequency Tables

31Quick Viz with Altair

32Practicing Bar Charts

33Practicing Scatterplots

34What Did We Just Learn?

About this course

Basic programming in Python. Overview of iteration and flow control and data types relevant to data exploration and analysis. When and how to exploit pre-existing libraries. Numerical data types with Numpy and tabular data with Pandas.

About the program

The University of British Columbia (UBC) is a comprehensive research-intensive university, consistently ranked among the 40 best universities in the world. The MDS Mid Career Learners program was launched in September 2020 and is offered by the MDS program who are a collaboration between the UBC Department of Computer Science and Department of Statistics.