Course Overview
Python is a multi-paradigm programming language which has become the language of choice for data analysis, data visualisation and machine learning. This Data Analysis with Python course provides a concrete first step in learning data analysis.
Skills Required
This course is suitable for those who already have basic knowledge of Python programming. Our Python Programming for Beginners Training Courses cover all of the prerequisites.
-
Introduction to Data Analysis
- What is data analysis
- Why data analysis
- Types of data analysis
- Process flow of data analysis
-
Introduction to Pandas
- Defining pandas library
- Why do we need pandas library
- Pandas data structure
- Exploring the data of a DataFrame
- Selecting data from DataFrame
- Data cleaning in pandas DataFrame
- Grouping and aggregation
- Sorting and ranking
- Adding row into DataFrame
- Adding column into DataFrame
- Dropping the row/column from DataFrame
- Concatenating the dataframe
- Merging/joining the dataframe
- The merge() function
- The join() function
- Writing the DataFrame to external files.
-
Introduction to NumPy
- Pandas & NumPy Intro
- Numpy Arrays & Array Properties
- Array Creation
- Random Number Generation
- Indexing & Slicing Arrays
- Array Operations
- Filtering Arrays & Modifying Array Values
- The Where Function
- Array Aggregation
- Array Functions
- Sorting Arrays
- Vectorization
- Broadcasting
-
Data Visualisation with Matplotlib
- What is data visualisation
- What is Matplotlib
- Getting started with Matplotlib
- Line plot using Matplotlib
- Customising the plot
- Some basic types of plots in matplotlib
- Export the plot into a file