Module 4: Pandas and Plotting

Class ressources

Organisation

  • Hour 1: the basics (concepts 1- 4) and questions 1-7 from the exercises
  • Hour 2: slicing, renaming, summarising (concepts 5 - 7) and questions 8-11 from the exercises
  • Hour 3: operations with datasets, rules for data management (concepts 8 - 9) and questions 12-14 from the exercises

Concepts

  1. What is modern Pandas?
  2. Anatomy of a DataFrame: column names, and index
  3. Data types in Pandas
  4. Loading and saving data
  5. Renaming columns, setting indices
  6. Selection of rows and columns
  7. Inspecting and summarising data
  8. Concatenations and merge
  9. Rules for data management

Learning objectives

After this lesson, you should be able to:

  • Methods
    • Create, save and manipulate DataFrames,
    • Organize your workflow, abiding by the rules of data management
  • Personal
    • being confident in managing data in python
    • not being discouraged by seeing errors you do not understand immediately
  • Social
    • jointly fixing errors with your peers