Abstract
Introduction Many scientists use mean (µ) and standard deviation (σ) to describe the centre point and ‘range’ of a measured set of data. However, this is only strictly true when dealing with normally distributed data. How well does µ and σ represent other (non-normal) distributions, and how many datapoints are ‘outside’ the range defined by µ and σ? This is a key question for manufacturing (ie tablet production).Methods Several different types of distribution were modelled in Excel (see left figure). The number of datapoints outside the modelled range (the range is given by µ ± Aσ, where A = 0.1,0.2,…, 3.0) is compared to the expected number of datapoints outside the same range in a normal distribution, producing a ratio (right figure).105 and 106 datapoints are required to reduce error at A values > 1.5.Results These results suggest that considering µ ± 2σ is a better guide to underestimate the number datapoints outside that range than µ ± σ.
Original language | English |
---|---|
Publication status | Published - 2 Mar 2021 |
Event | 2021 #RSCPoster Twitter Conference - Virtual Duration: 2 Mar 2021 → 3 Mar 2021 https://www.rsc.org/our-events/rsc-poster/ |
Conference
Conference | 2021 #RSCPoster Twitter Conference |
---|---|
Period | 2/03/21 → 3/03/21 |
Internet address |
Keywords
- datapoints
- non-normal distributions
- manufacturing