The statistical analysis techniques taught here form the foundation of any analytics or data science practice.
This self-paced class is an excellent hands-on walkthrough of the most foundational techniques used to understand and ask questions of datasets. The statistical techniques taught in this workshop underpin virtually every common method for analyzing data. They also provide a practical, useful introduction to contemporary data science techniques.
This program comprises roughly five hours of expert-led content, with basic assignments and exercises you can participate in between chapters.
- Upcoming Dates and Locations
Guaranteed To Run
- Course Outline
Part 1: Statistics and Statistical Terminology
- What are statistics?
- What is a statistical population?
Part 2: Sampling; Descriptive vs. Inferential Statistics
- What is sampling?
- Population vs. sample
- When do we sample?
Part 3 – Sampling and Statistical Bias
- Random sampling
- What is bias?
- Why we should avoid bias in samples
Part 4: Measures of Central Tendency & Arithmetic Mean
- What are the Measures of Central Tendency
- Arithmetic mean
Part 5: Geometric Mean and Harmonic Mean
- Geometric mean
- Harmonic mean
Part 6: Median
- What is median
Part 7: Mode and Midrange
- What is mode
- What is midrange
- Review and Exercises
Part 8: Measures of Dispersion
- What are the measures of dispersion
- Mean absolute difference
Part 9: Measures of Dispersion
- Standard deviation
Part 10: Some Examples
Part 11: IQR (Interquartile range) and Outliers
- What is IQR
- 1.5 * IQR rule
Part 12: Distributions
- What is distribution
- Probability mass function and cumulative probability
Part 13: Working with Distributions
Part 14: Different Types of Distributions
- Discrete distributions vs. continuous distributions
- Introduction to Monte-Carlo simulations
Part 15: Continuous and Parametric Distributions
- Continuous distributions
- Parametric distributions
- Normal distribution
Part 16: Inferential Statistics
- When we use inferential statistics
- Sample mean and sample variance
- Central Limit Theorem
- Estimation of population mean and confidence interval
Part 17: Hypothesis Testing for the Population Mean
- Estimation of population mean confidence interval
- What is hypothesis testing
- How to run a two-tailed test for a population mean
Part 18: Difference Between Two Means
- Problem formulation
- Hypothesis testing
- Who should attend
- Data Analysts
- Any data practitioner needing a statistics refresher
- Business Analysts and Project Managers who query data
- Data and sytem architects
- Application developers who wish to integrate data processes
- Analysts who want to grow their basic data science skills
You will need a computer to view and participate in the class.