HomeCoursesMonitoring, Evaluation, Research & Statistics Training CoursesStatistical Data Analysis with R Training Course
Monitoring, Evaluation, Research & Statistics Training CoursesCape Town, Johannesburg, Dar es Salaam +10 more

Statistical Data Analysis with R Training Course

Five-day hands-on R training covering data wrangling, descriptive and inferential statistics, regression, visualisation and reusable scripts.

5 Days

Duration

Certificate

Included

Instructor-Led

Delivery

Foundation → Intermediate

Level

Reserve My Seat
SD

Statistical Data Analysis with R Training Course

Starting From

$750

per participant

See Upcoming Dates

Flexible Delivery

In-Person, Live Online

Language

English

Dedicated Support

Pre & post training

Share this course

Course Overview

This five-day, hands-on course builds the skill to analyse data in R from the ground up, from importing and wrangling data to modelling and visualisation. It covers the R and RStudio environment, data wrangling, descriptive and inferential statistics, regression, and data visualisation. Participants leave able to analyse data in R, write clear scripts, and produce high quality charts, with no prior programming required.

Introduction

R has become one of the most powerful and popular tools for data analysis anywhere, and it is free. It is used across research, official statistics, data science and evaluation, and its visualisation capabilities are second to none. Its one barrier is that it is code based, which can feel daunting to those who have never programmed. This course removes that barrier.

It builds R from the ground up. It covers the R and RStudio environment, data wrangling with the modern tidyverse approach, descriptive and inferential statistics, regression for modelling, and the production of high quality visualisations. No prior programming is assumed; the course teaches the code patiently, building from first commands to a complete analysis. Participants leave able to import and wrangle data in R, run and interpret the main analyses, produce excellent charts, and write clear, reusable scripts. The course is hands-on, with every participant writing and running R code throughout.

Learning Objectives

By the end of the course, participants will be able to:

  • Navigate the R and RStudio environment confidently and run code efficiently
  • Understand core R objects, data structures and packages relevant to applied data analysis
  • Import data from common file formats and prepare it for analysis in R
  • Clean, transform, recode and manage variables using a structured workflow
  • Use modern data-wrangling techniques to filter, group, summarise and reshape datasets
  • Produce descriptive statistics and exploratory summaries that reveal the structure of the data
  • Conduct and interpret common inferential techniques including t-tests, ANOVA, chi-square tests and correlation
  • Build and interpret simple linear, multiple linear and introductory logistic regression models
  • Produce clear and informative data visualisations using modern R tools
  • Write clean, reusable and well-documented scripts that support reproducible analysis
  • Communicate findings accurately in tables, charts and narrative summaries

Who Should Attend

This course is suitable for professionals who want to analyse and visualise quantitative data in R, including:

  • Researchers, research assistants and academic staff
  • Monitoring and evaluation officers and programme analysts
  • Data analysts and aspiring data scientists
  • Public health, epidemiology and social science professionals
  • Government, NGO and development sector staff working with data
  • Statistics officers and information management personnel
  • Consultants and professionals seeking a free, scalable analytical tool
  • Anyone moving from spreadsheet analysis to code-based statistical work

Training Methodology

The course is delivered in a hands-on computer lab format. Each technique is demonstrated and then coded immediately by participants on real and realistic datasets, building scripts from first commands upward. The focus is on doing and understanding, and a daily practical session takes an analysis from data to findings.

•Live demonstration and guided coding in R and RStudio

•Hands-on analysis of real and realistic datasets

•Building scripts from first commands upward

•Step by step interpretation of output

•A daily practical session and a final analysis project

Organizational Impact

Organisations whose staff complete this course can expect:

  • Stronger internal capability for data analysis and visualisation at no software licence cost
  • More reproducible and transparent analytical workflows
  • Better quality evidence for research, reporting, programme management and decision-making
  • Improved ability to clean, analyse and interpret existing datasets in-house
  • Reduced dependence on external analysts for routine statistical work
  • Stronger data visualisation and communication of findings
  • A scalable analytical capability that can support more advanced data work over time
  • A more technically capable team able to work confidently in a modern code-based environment

Personal Impact

Participants completing this course will gain:

  • Confidence in using R as a practical analytical tool rather than an intimidating programming language
  • The ability to build reusable scripts for data cleaning, analysis and visualisation
  • Stronger data wrangling capability and a more disciplined approach to analysis
  • Greater confidence in interpreting statistical results and explaining them clearly
  • A valuable and transferable skill in one of the world’s most widely used analytical tools
  • Improved ability to produce publication-quality charts and visual outputs
  • A scalable analytical workflow that can grow into more advanced modelling, reporting and research work
  • Greater independence in handling data-intensive tasks without relying on proprietary software

Course Outline

  • Why R for modern data analysis
  • The R and RStudio environment
  • Running code and understanding the logic of scripts
  • Objects, values and vectors
  • Data types and common structures in R
  • Functions, packages and working efficiently
  • Writing clean code and avoiding common beginner errors

Practical session: Write and run a first R script, create simple objects, inspect data structures and become comfortable working in RStudio.

  • Data frames, tibbles and tidy analytical structure
  • Importing data from Excel, CSV and other common sources
  • The logic of data wrangling in R
  • Selecting, filtering and arranging data
  • Creating, recoding and transforming variables
  • Handling missing values and inconsistent data
  • Grouping, summarising and reshaping datasets

Practical session: Import a raw dataset into R, clean and transform variables, handle missing data and create an analysis-ready dataset using a structured script.

  • Why descriptive analysis comes first
  • Summarising variables numerically
  • Frequency tables and categorical summaries
  • Cross-tabulations and subgroup analysis
  • Exploring distributions and unusual values
  • Introduction to data visualisation for exploration
  • Turning descriptive output into analytical interpretation

Practical session: Produce a descriptive statistical and visual summary of a dataset in R, compare subgroups and identify key patterns and data issues.

  • From sample evidence to statistical inference
  • Hypothesis testing and statistical significance
  • t-tests for comparing means
  • Analysis of variance for comparing multiple groups
  • Chi-square tests for categorical association
  • Correlation analysis
  • Choosing the right test for the question

Practical session: Use R to run and interpret t-tests, ANOVA, chi-square tests and correlation analysis on a realistic dataset.

  • The role of regression in applied analysis
  • Simple linear regression
  • Multiple regression for more realistic analysis
  • Introduction to logistic regression
  • Checking and interpreting models in practice
  • Visualisation with ggplot2 and modern R graphics
  • Course synthesis, script organisation and final analysis project

Practical session: Build a regression model in R, create a polished visualisation, and produce a short scripted analysis workflow from data import to reporting.

Certification

Certificate of Completion awarded on successful programme conclusion

At Strategic Revenue Africa, our certification goes beyond proof of attendance—it represents practical competence and measurable capability. Upon successful completion of our training programs, participants are awarded a Certificate of Completion from Strategic Revenue Africa, recognizing their ability to apply acquired knowledge in real-world settings. As an organization focused on architecting sustainable revenue and strengthening organizational performance, our certifications signal that participants are equipped with skills that drive results, not just theory.

Programme Inclusions

  • Course materials & workbook
  • Certificate of completion
  • Post-training support (6 months)

Prerequisites

Basic computer literacy is required. No prior programming, R or advanced statistics experience is assumed. The course is designed for complete beginners as well as for participants who have some exposure to R but want a more structured and applied understanding of how to use it for real analysis. A basic familiarity with data, spreadsheets, survey results or research datasets will be helpful but is not essential. Participants should have access to R and RStudio during the course; both are free to install and use.

Schedule & Investment

Upcoming Dates & Fees

UAE

Dubai

UAE

Schedule

Mon – Fri · 5 Days

Investment

$3,900

Language

English

Register — 8 Dates
Uganda

Kampala

Uganda

Schedule

Mon – Fri · 5 Days

Investment

$1,800

Language

English

Register — 8 Dates
South Africa

Johannesburg

South Africa

Schedule

Mon – Fri · 5 Days

Investment

$3,500

Language

English

Register — 8 Dates
Kenya

Kisumu

Kenya

Schedule

Mon – Fri · 5 Days

Investment

$1,400

Language

English

Register — 8 Dates
Kenya

Nakuru

Kenya

Schedule

Mon – Fri · 5 Days

Investment

$1,400KES 97,000

Language

English

Register — 8 Dates
Tanzania

Dar es Salaam

Tanzania

Schedule

Mon – Fri · 5 Days

Investment

$1,900

Language

English

Register — 8 Dates
Kenya

Naivasha

Kenya

Schedule

Mon – Fri · 5 Days

Investment

$1,500KES 98,000

Language

English

Register — 8 Dates

Accommodation & Transfer

Accommodation and airport transfer are arranged upon request. Contact the Training Officer to reserve.

Payment

Transfer payment to the Strategic Revenue Africa account before the course starts. Send proof of payment to:

[email protected]

Course Fee Includes

  • Course tuition & training materials
  • Two break refreshments and lunch
  • Certificate of completion
  • Post-training support (6 months)

Travel, visa, insurance and personal expenses are the participant's responsibility.

Frequently Asked Questions

About Statistical Data Analysis with R Training Course

  • No. The course assumes no prior programming experience and is designed to take participants from the very beginning of working in R through to meaningful data analysis, visualisation and scripting. It is suitable for complete beginners.

  • Yes. R is free and open source, and RStudio is also available without licence cost. This makes R particularly attractive for organisations that want strong analytical capability without paying for proprietary statistical software.

  • It teaches both. Participants learn the practical coding required to work in R, but the course also covers statistical reasoning, test selection, regression interpretation and the communication of findings. The goal is not simply to write code, but to perform sound and useful analysis.

  • Yes. Data wrangling is a major component of the course. Participants learn how to import data, transform variables, handle missing values, filter and group data, and create an analysis-ready dataset using a structured and reusable script.

  • The course covers descriptive statistics, cross-tabulations, t-tests, ANOVA, chi-square tests, correlation, simple linear regression, multiple regression and introductory logistic regression, alongside practical visualisation and reporting.

  • Yes. The course uses a modern R workflow that includes widely used tidyverse tools for data wrangling and analysis, while also ensuring participants understand the broader logic of working in R.

  • Very much so. R is widely used in research and increasingly across M&E, public health, official statistics and policy analysis. The course is designed around realistic analytical tasks drawn from these environments.

  • R and Python are the two leading code-based tools for modern data work. R has especially deep roots in statistics, quantitative analysis and visualisation, while Python is broader as a general-purpose programming language and is heavily used in data science and automation. Both are powerful; the right choice depends on the team, the work and the longer-term analytical direction.

From

$750

Register