The implications of non-parametric distributions: what does it mean when your data's not normal?

Steven Ford, Gengyuan Miao

Research output: Contribution to conferencePoster

14 Downloads (Pure)

Abstract

Introduction Many scientists use mean (µ) and standard deviation (σ) to describe the centre point and ‘range’ of a measured set of data. However, this is only strictly true when dealing with normally distributed data. How well does µ and σ represent other (non-normal) distributions, and how many datapoints are ‘outside’ the range defined by µ and σ? This is a key question for manufacturing (ie tablet production).Methods Several different types of distribution were modelled in Excel (see left figure). The number of datapoints outside the modelled range (the range is given by µ ± Aσ, where A = 0.1,0.2,…, 3.0) is compared to the expected number of datapoints outside the same range in a normal distribution, producing a ratio (right figure).105 and 106 datapoints are required to reduce error at A values > 1.5.Results These results suggest that considering µ ± 2σ is a better guide to underestimate the number datapoints outside that range than µ ± σ.
Original languageEnglish
Publication statusPublished - 2 Mar 2021
Event2021 #RSCPoster Twitter Conference - Virtual
Duration: 2 Mar 20213 Mar 2021
https://www.rsc.org/our-events/rsc-poster/

Conference

Conference2021 #RSCPoster Twitter Conference
Period2/03/213/03/21
Internet address

Keywords

  • datapoints
  • non-normal distributions
  • manufacturing

Fingerprint

Dive into the research topics of 'The implications of non-parametric distributions: what does it mean when your data's not normal?'. Together they form a unique fingerprint.

Cite this