Home CoursesMonitoring, Evaluation, Research & Statistics Training CoursesStatistical Data Analysis with Python Training Course

Monitoring, Evaluation, Research & Statistics Training CoursesJohannesburg, Cape Town, Zanzibar +8 more

Statistical Data Analysis with Python Training Course

Five-day hands-on Python data analysis training covering pandas, data wrangling, descriptive and inferential statistics, regression and visualisation.

5 Days

Duration

Certificate

Included

Instructor-Led

Delivery

Foundation → Intermediate

Level

Reserve My Seat

Statistical Data Analysis with Python Training Course

Starting From

$750

per participant

See Upcoming Dates

Flexible Delivery

In-Person, Live Online

Language

English

Dedicated Support

Pre & post training

Share this course

Course Overview

This five day, hands-on course builds the skill to analyse data in Python from the ground up, using pandas and the scientific Python ecosystem. It covers the Python environment, data wrangling with pandas, descriptive and inferential statistics, regression, and data visualisation. Participants leave able to analyse data in Python, write clear code, and produce strong charts, with no prior programming required.

Introduction

Python has become the dominant language of data science and is now widely used for data analysis far beyond it. It is free, general purpose, and supported by a powerful ecosystem of libraries for working with data, of which pandas is the centre. Learning to analyse data in Python opens a door not only to analysis but to automation, machine learning and much more. Its only barrier is that it is code based, and this course is built to get non programmers over it.

It builds Python for data analysis from the ground up. It covers the Python environment and the essentials of the language, data wrangling with pandas, descriptive and inferential statistics, regression for modelling, and data visualisation. No prior programming is assumed; the course teaches the code carefully, building from first lines to a complete analysis. Participants leave able to import and wrangle data with pandas, run and interpret the main analyses, produce clear visualisations, and write reusable code. The course is hands-on, with every participant writing and running Python throughout.

Learning Objectives

By the end of the course, participants will be able to:

Navigate the Python environment confidently for data analysis work
Understand core Python concepts needed for working with data
Import, inspect and prepare datasets using pandas
Clean, transform and manage variables for analysis
Handle missing data, reshape datasets and combine multiple data sources
Produce descriptive statistics and exploratory summaries in Python
Run and interpret common inferential techniques such as t-tests, ANOVA, chi-square tests and correlation
Build and interpret simple linear, multiple linear and introductory logistic regression models
Produce clear and informative visualisations for exploration and reporting
Write reusable notebooks or scripts that support reproducible analysis
Communicate findings accurately in tables, charts and narrative form

Who Should Attend

This course is designed for professionals who want to use Python for practical data analysis, including:

Data analysts and aspiring data scientists
Researchers and research assistants
Monitoring and evaluation officers and programme analysts
Public health, epidemiology and social science professionals
Government, NGO and development sector staff working with data
Statistics officers, information management staff and reporting personnel
Postgraduate students and academic researchers
Consultants and professionals seeking a scalable, code-based analytical tool
Anyone moving from spreadsheet-based analysis to Python-driven workflows

Training Methodology

The course is delivered in a hands-on computer lab format using notebooks. Each technique is demonstrated and then coded immediately by participants on real and realistic datasets, building code from first lines upward. The focus is on doing and understanding, and a daily practical session takes an analysis from data to findings.
•Live demonstration and guided coding in Python
•Hands-on analysis in notebooks with real datasets
•Building code from first lines upward
•Step by step interpretation of output
•A daily practical session and a final analysis project

Organizational Impact

Organisations whose staff complete this course can expect:

Stronger internal capability for modern, code-based data analysis
Better quality evidence for reporting, programme management and decision-making
Reduced dependence on manual spreadsheet processes and external analysts
Improved reproducibility and transparency in analytical work
Stronger use of data already being collected across programmes, operations and studies
Better visual communication of findings through charts and structured outputs
A foundation for more advanced analytics, automation and data science capability
A more technically capable team able to work confidently with scalable analytical tools

Personal Impact

Participants completing this course will gain:

Confidence in using Python as a practical tool for data analysis
The ability to build reusable analytical workflows in code
Stronger data wrangling capability with pandas
Better understanding of how to interpret statistical output in a code-based environment
A valuable and highly marketable skill applicable across data, research and analytics roles
Greater independence in handling datasets without relying on spreadsheet-only workflows
A foundation for future progression into automation, machine learning and broader data science
Stronger ability to present findings clearly through charts, summaries and model outputs

Course Outline

Why Python for data work
The Python environment and workflow
Running code and understanding Python syntax
Variables, data types and core structures
Functions, libraries and reusable code
Good coding habits for analysts
Getting help and troubleshooting errors

Practical session: Set up the Python environment, write and run first analytical code, create basic variables and work comfortably in a notebook or script.

Introducing pandas and DataFrames
Importing data from Excel, CSV and other sources
Inspecting and validating raw data
Selecting, filtering and sorting data
Creating, recoding and transforming variables
Handling missing values and inconsistent data
Grouping, aggregating and combining datasets

Practical session: Import a raw dataset into Python, clean and transform variables with pandas, handle missing data and create an analysis-ready file.

Why descriptive analysis comes first
Summarising numerical variables
Frequency distributions and categorical summaries
Cross-tabulations and subgroup analysis
Exploring distributions and unusual values
Visual exploration of data in Python
Interpreting descriptive output for decision-making

Practical session: Produce descriptive summaries, subgroup comparisons and exploratory charts in Python, then interpret the main patterns in the dataset.

From sample data to statistical inference
Hypothesis testing and statistical significance
t-tests for comparing means
Analysis of variance for comparing multiple groups
Chi-square tests for association between categorical variables
Correlation analysis
Choosing the right test for the question

Practical session: Use Python to run and interpret t-tests, ANOVA, chi-square tests and correlation analysis on a realistic dataset.

Why regression matters in applied data analysis
Simple linear regression
Multiple regression for more realistic analysis
Introduction to logistic regression
Interpreting and checking regression models
Visualisation with matplotlib and seaborn
Course synthesis and final analysis project

Practical session: Build a regression model in Python, create a clear visualisation and produce a short analytical summary from raw data to findings.

Certification

Certificate of Completion awarded on successful programme conclusion

At Strategic Revenue Africa, our certification goes beyond proof of attendance—it represents practical competence and measurable capability. Upon successful completion of our training programs, participants are awarded a Certificate of Completion from Strategic Revenue Africa, recognizing their ability to apply acquired knowledge in real-world settings. As an organization focused on architecting sustainable revenue and strengthening organizational performance, our certifications signal that participants are equipped with skills that drive results, not just theory.

Programme Inclusions

Course materials & workbook
Certificate of completion
Post-training support (6 months)

Prerequisites

Basic computer literacy is required. No prior programming, Python or advanced statistics experience is assumed. The course is suitable for complete beginners as well as for participants who have some exposure to Python but want a more structured and applied approach to using it for data analysis. A basic familiarity with data, spreadsheets, survey datasets or reporting tables will be helpful but is not essential. Participants should have access to Python during the course, typically through Anaconda, Jupyter Notebook, JupyterLab or another suitable environment.

Schedule & Investment

Upcoming Dates & Fees

LocationScheduleInvestmentLanguage

Dar es Salaam

Tanzania

Mon – Fri · 5 Days

Next: 6 Jul 2026

$1,900

per delegate

English

Kigali

Rwanda

Mon – Fri · 5 Days

Next: 6 Jul 2026

$1,800

per delegate

English

Zanzibar

Tanzania

Mon – Fri · 5 Days

Next: 13 Jul 2026

$2,100

per delegate

English

Naivasha

Kenya

Mon – Fri · 5 Days

Next: 13 Jul 2026

$1,500|KES 98,000

per delegate

English

Johannesburg

South Africa

Mon – Fri · 5 Days

Next: 20 Jul 2026

$3,500

per delegate

English

Dubai

UAE

Mon – Fri · 5 Days

Next: 20 Jul 2026

$3,900

per delegate

English

Mombasa

Kenya

Mon – Fri · 5 Days

Next: 20 Jul 2026

$1,500|KES 98,000

per delegate

English

Dar es Salaam

Tanzania

Schedule

Mon – Fri · 5 Days

Investment

$1,900

Language

English

Kigali

Rwanda

Schedule

Mon – Fri · 5 Days

Investment

$1,800

Language

English

Zanzibar

Tanzania

Schedule

Mon – Fri · 5 Days

Investment

$2,100

Language

English

Naivasha

Kenya

Schedule

Mon – Fri · 5 Days

Investment

$1,500KES 98,000

Language

English

Johannesburg

South Africa

Schedule

Mon – Fri · 5 Days

Investment

$3,500

Language

English

Dubai

UAE

Schedule

Mon – Fri · 5 Days

Investment

$3,900

Language

English

Mombasa

Kenya

Schedule

Mon – Fri · 5 Days

Investment

$1,500KES 98,000

Language

English

Accommodation & Transfer

Accommodation and airport transfer are arranged upon request. Contact the Training Officer to reserve.

[email protected]+254 799 289 473

Payment

Transfer payment to the Strategic Revenue Africa account before the course starts. Send proof of payment to:

[email protected]

Course Fee Includes

Course tuition & training materials
Two break refreshments and lunch
Certificate of completion
Post-training support (6 months)

Travel, visa, insurance and personal expenses are the participant's responsibility.

Frequently Asked Questions

About Statistical Data Analysis with Python Training Course

No. The course assumes no programming and builds Python from first lines, so beginners can follow while learning one of the most in demand skills in data.
Yes. Python and its data libraries are free and open source, with no software licence cost.
pandas is the core Python library for working with tabular data. A full day is devoted to wrangling data with it, since it is the heart of data analysis in Python.
Yes. Python is the dominant language of data science, and this course builds the foundation, though machine learning itself is beyond its scope.
Yes. The final day covers linear, multiple and logistic regression using statsmodels, with interpretation and model checking.
Yes. Python is increasingly used in research and evaluation as well as data science, and the course is built with that work in mind.
Yes. You leave with reusable code and the skills to keep building, and because Python is free there is no barrier to continuing.
R and Python are the two leading code based tools. Python is general purpose and the language of data science, while R has deep statistical roots and superb visualisation. Both are valuable.