7.1.6. What are outliers in the data?
The data set of N = 90 ordered observations as shown below is examined for outliers: The computations are as follows: Median = (n+1)/2 largest data point = the average of the 45th and 46th ordered points = (559 + 560)/2 = 559.5 Lower quartile = .25(N+1)th ordered point = 22.75th ordered point = 411 + .75(436-411) = 429.75 Upper quartile = .75(N+1)th ordered point = 68.25th ordered point = 739 +.25(752-739) = 742.25 Interquartile range = 742.25 - 429.75 = 312.5 Lower inner fence = 429.75 - 1.5 (312.5) = -39.0 Upper inner fence = 742.25 + 1.5 (312.5) = 1211.0 Lower outer fence = 429.75 - 3.0 (312.5) = -507.75 Upper outer fence = 742.25 + 3.0 (312.5) = 1679.75 From an examination of the fence points and the data, one point (1441) exceeds the upper inner fence and stands out as a mild outlier; there are no extreme outliers.
Quartiles
Quartiles are the values that divide a list of numbers into quarters. First put the list of numbers in order Then cut the list into four equal parts The Quartiles are at the "cuts" Like this: Example: 5, 8, 4, 4, 6, 3, 8 Put them in order: 3, 4, 4, 5, 6, 8, 8 Cut the list into quarters: And the result is: Quartile 1 (Q1) = 4 Quartile 2 (Q2), which is also the Median, = 5 Quartile 3 (Q3) = 8 Sometimes a "cut" is between two numbers ... the Quartile is the average of the two numbers. Example: 1, 3, 3, 4, 5, 6, 6, 7, 8, 8 The numbers are already in order In this case Quartile 2 is half way between 5 and 6: Quartile 1 (Q1) = 3 Quartile 2 (Q2) = 5.5 Quartile 3 (Q3) = 7 Interquartile Range The "Interquartile Range" is from Q1 to Q3: To calculate it just subtract Quartile 1 from Quartile 3, like this: Example: The Interquartile Range is: Box and Whisker Plot You can show all the important values in a "Box and Whisker Plot", like this: A final example covering everything: Put them in order: Cut it into quarters:
Site inter-établissements -
Cookies et vie privée Utilisation des cookies Qu'est-ce qu'un cookie ?Un « cookie » est un fichier texte susceptible d'être déposé sur votre terminal à l'occasion de votre visite de notre site. Il a pour but de collecter des informations anonymes relatives à votre navigation et de vous adresser des contenus adaptés. Plus d'information sur les cookies sur le site de la CNIL. Conformément aux dispositions de la loi du 6 janvier 1978 relative aux fichiers, à l'informatique et aux libertés, vous disposez d'un droit d'accès, de rectification et d'opposition aux données personnelles vous concernant.
Standard Deviation
The standard deviation of a probability distribution is defined as the square root of the variance where is the mean, is the second raw moment, and denotes the expectation value of . is therefore equal to the second central moment (i.e., moment about the mean), The square root of the sample variance of a set of values is the sample standard deviation The sample standard deviation distribution is a slightly complicated, though well-studied and well-understood, function. However, consistent with widespread inconsistent and ambiguous terminology, the square root of the bias-corrected variance is sometimes also known as the standard deviation, of a list of data is implemented as StandardDeviation[list]. Physical scientists often use the term root-mean-square as a synonym for standard deviation when they refer to the square root of the mean squared deviation of a quantity from a given baseline. , and To find the standard deviation range corresponding to a given confidence interval, solve (5) for , giving
How to Read and Use a Box-and-Whisker Plot
The box-and-whisker plot is an exploratory graphic, created by John W. Tukey, used to show the distribution of a dataset (at a glance). Think of the type of data you might use a histogram with, and the box-and-whisker (or box plot, for short) could probably be useful. The box plot, although very useful, seems to get lost in areas outside of Statistics, but I’m not sure why. It could be that people don’t know about it or maybe are clueless on how to interpret it. Reading a Box-and-Whisker Plot Let’s say we ask 2,852 people (and they miraculously all respond) how many hamburgers they’ve consumed in the past week. Take the top 50% of the group (1,426) who ate more hamburgers; they are represented by everything above the median (the white line). Find Skews in the Data The box-and-whisker of course shows you more than just four split groups. Want to learn more about making data graphics?
Normal Distribution
Data can be "distributed" (spread out) in different ways. But there are many cases where the data tends to be around a central value with no bias left or right, and it gets close to a "Normal Distribution" like this: A Normal Distribution The "Bell Curve" is a Normal Distribution. Many things closely follow a Normal Distribution: heights of people size of things produced by machines errors in measurements blood pressure marks on a test We say the data is "normally distributed". Quincunx Standard Deviations The Standard Deviation is a measure of how spread out numbers are (read that page for details on how to calculate it). When you calculate the standard deviation of your data, you will find that (generally): Example: 95% of students at school are between 1.1m and 1.7m tall. Assuming this data is normally distributed can you calculate the mean and standard deviation? The mean is halfway between 1.1m and 1.7m: Mean = (1.1m + 1.7m) / 2 = 1.4m Standard Scores How far is 1.85 from the mean?
Se former Lettres lycée - Les outils TICE pour les Lettres et les LCA
Les incontournables en Lettres 1. Le pad est le nom anglais donné aux espaces collaboratifs.Ces espaces permettent d'écrire à plusieurs mains et favorisent un vrai travail de groupe, qui peut se poursuivre à distance. Framapad (version française) et pirate pad (version anglaise) sont des éditeurs de textes collaboratifs qui permettent de réaliser un travail à la fois collectif et simultané en salle multimédia. 2. L'application sur internet : TagxedoUn tutoriel de l'académie de Grenoble : tutoriel tagxedo 3. Il permet de créer très simplement des livrets multimédias et interactifs, seul ou avec des élèves. 4. 5. 6. A noter que ces incontournables sont aujourd'hui intégrés aux ENT (etherpad, baladodiffusion, moodle ...) Pour aller plus loin : répertoire des outils numériques pour les Lettres Le répertoire des outils 2.0 en français (portail Skoden) réunit l'ensemble des logiciels et applications qui peuvent intéresser les lettres. Le guide des indipensables du Web 2015 : Français
Excel Box and Whisker Diagrams (Box Plots)
Box and Whisker Charts (Box Plots) are commonly used in the display of statistical analyses. Microsoft Excel does not have a built in Box and Whisker chart type, but you can create your own custom Box and Whisker charts, using stacked bar or column charts and error bars. This tutorial shows how to make box plots, in vertical or horizontal orientations, in all modern versions of Excel. In its simplest form, the box and whisker diagram has a box showing the range from first to third quartiles, and the median divides this large box, the “interquartile range”, into two boxes, for the second and third quartiles. Sample Data and Calculations To play along at home in Excel 2007 or 2010, download the workbook Excel_2007_Box_Plot_Workbook.xlsx. Let’s use the following simple data set for our tutorial. All of these values are positive. First, insert a bunch of blank rows, and set up a range for calculations. First, compute some simple statistics, such as the count, mean, and standard deviation.