Sunday, March 11, 2018

Chapter 37.4 - Median of the Data

In the previous section we completed a discussion on mode. We also saw some solved examples. In this section we will discuss about median.
We have seen some basics about median in part II (Details here). 
It is the middle most observation in a data. We saw the following steps:
• Arrange all data values of the observations in ascending order
• Let the total number of observations be 'n'
• If n is odd, the median is the observation at the position given by: (n+1)2
• If n is even, the median is the average of the following two observations:
    ♦ observation at the (n2)th position
    ♦ observation at the (n+ 1)th position

Let us see a new example:
Example 8:
The following table 37.34 shows the marks (out of 50) obtained by 100 students in a test
Table.37.34
We have to find the median mark
Solution:
• How does this problem differ from the previous 'problems on median' that we saw in part II?
Ans: In this problem, each mark has a frequency. Because, each mark (except 43) occur more than once
    ♦ 20 occurs 6 times
    ♦ 29 occurs 28 times
So on . . .
• Then how can we arrange the marks in ascending or descending order?
Ans: We will develop a new method for such problems. The steps are as follows:
1. Arrange the data in ascending order, without considering the frequencies. This is shown in table 37.35 below:
Table.37.35
2. In this problem, n = 100. It is even. So median will be the average of the following two observations:
    ♦ observation at the (n2)th position
    ♦ observation at the (n+ 1)th position
3. So we want these:
• observation at the (n2)th position
    ♦ That is., the observation at the 50th position 
• observation at the (n+ 1) position. 
    ♦ That is., the observation at the 51st position 
4. Obviously, we cannot obtain them directly from the table 37.34 given in the question or from the table 37.35 above. We will use a special method:
• The data in table 37.35 is arranged in ascending order. 
Based on the table, let us try to answer some questions:
(i) How many students obtained at least 20 marks?
Ans: Considering the second row of the table, we can easily write:
• No. of students who obtained at least 20 marks is 6
(ii) How many students obtained at least 25 marks?
Ans: The number of students who obtained 20 marks will also be included when we consider 'at least' 25.
• So the no. of students who obtained at least 25 marks is (6+20) = 26
(iii) How many students obtained at least 28 marks?
Ans: The number of students who obtained at least 25 marks will also be included when we consider 'at least' 28.
• So the no. of students who obtained at least 28 marks is (26+24) = 50
5. We can go on answering like this till we reach the end of the table. But once we understand the pattern, we can quickly fill up the values
• This new information can be written as cumulative frequency. We will add it to the existing table. The new table 37.36 is given below:
Table.37.36
• Note how easy it is to fill up the 'cumulative frequency column':
To fill up any one row in the 'cumulative frequency column', do these steps:
(i) Take the value coming in the same row in the 'frequency column'
(ii) Take the value coming in the row just above in the 'cumulative frequency column'
(iii) Add (i) and (ii)
6. Now, if we write all the observations in a single line, it will look as in the table.37.37 below:
Table.37.37
• The first 6 observations will be all 20
• The 7th, 8th, 9th, . . . , 26th observations will all be 25
• The 27th, 28th, 29th, . . . , 50th observations will all be 28
so on . . . 
7. So the 50th observation is 28
And the 51st observation is 29
• These are the observations that we want.
• But we do not need to form a table like the table 37.37 above. We will be able to find them easily from the 'cumulative frequency column' in table 37.36
8. Thus from step (3) above, we get:
Median = (28+29)57= 28.5
9. So we can write a conclusion:
• If the 100 marks are arranged in ascending order, 28.5 will come in the middle
• So about 50 students obtained marks less than 28.5
• The other 50 students obtained marks more than 28.5
■ The table 37.36 above is called the Cumulative frequency table


The example above shows the procedure to find median of observations in an 'ungrouped frequency distribution table'
So next we will see the procedure for 'grouped frequency distribution tables':
Example 9:
The following table 37.38 shows the marks (out of 50) obtained by 53 students in a test
Table.37.38
We have to find the median mark
Solution:
We will write the steps:
1. Based on the table 37.38, let us try to answer some questions:
(i) How many students have obtained marks less than 10?
Ans: Considering the class '0 -10', we can say:
• 5 students obtained marks less than 10
• Note that, we use 'less than' in this problem
• In the previous problem it was 'at least'
    ♦ If we use 'at least' in this problem, the mark '10' will be included
    ♦ We know that, in the class '0 - 10', the mark '10' will not be considered. 
    ♦ It will be considered only in the next higher class, which in this problem is '10 - 20'
(ii) How many students have obtained marks less than 20?
Ans: The number of students who obtained marks less than 10 will also be included when we consider 'less than' 20.
• So the no. of students who obtained marks less than 20 is (5+3) = 8
(iii) How many students have obtained marks less than 30?
Ans: The number of students who obtained marks less than 20 will also be included when we consider 'less than' 30.
• So the no. of students who obtained marks less than 30 is (8+4) = 12
• We can go on answering like this till we reach the end of the table. But once we understand the pattern, we can quickly fill up the values
• This new information can be written as cumulative frequency. We will add it to the existing table. The new table 37.39 is given below:
Table.37.39
• Note how easy it is to fill up the 'cumulative frequency column':
To fill up any one row in the 'cumulative frequency column', do these steps:
(i) Take the value coming in the same row in the 'frequency column'
(ii) Take the value coming in the row just above in the 'cumulative frequency column'
(iii) Add (i) and (ii)
■ The distribution given above is called the cumulative frequency distribution of the less than type.
• Here 10, 20, 30, . . . 100, are the upper limits of the respective class intervals.
2. So let us see what 'cumulative frequency distribution of the more than type' is:
(i) How many students have obtained marks more than 0?
Ans: We can say that all the 53 students obtained marks more than 0
(ii) How many students have obtained marks more than 10?
Ans: Considering the class '0 -10', we can say:
• 5 students obtained marks less than 10
So the number of students who obtained marks more than 10 is (53-5) = 48
(iii) How many students have obtained marks more than 20?
Ans: From (i) we have: Number of students who obtained marks more than 10 = 48
Considering the class '10 - 20', we can say:
• 3 students among the above 48, obtained marks less than 20
• So the number of students who obtained marks more than 20 is (48-3) = 45
(iv) How many students have obtained marks more than 30?
Ans: From (ii) we have: Number of students who obtained marks more than 20 = 45
Considering the class '20 - 30', we can say:
• 4 students among the above 45, obtained marks less than 30
• So the number of students who obtained marks more than 30 is (45-4) = 41
• We can go on answering like this till we reach the end of the table. But once we understand the pattern, we can quickly fill up the values
• This new information can be written as cumulative frequency. We will add it to the existing table. The new table 37.40 is given below:
Table.37.40
• Note how easy it is to fill up the 'cumulative frequency column':
To fill up any one row in the 'cumulative frequency column', do these steps:
(i) Take the value coming in the same row in the 'frequency column'
(ii) Take the value coming in the row just above in the 'cumulative frequency column'
(iii) Subtract (i) from (ii)
■ The distribution given above is called the cumulative frequency distribution of the more than type.
• Here 10, 20, 30, . . . 90, are the lower limits of the respective class intervals.
3. So we have obtained two types of cumulative frequency distribution tables. We can use any one of them to find the median. Let us try:
• We will use the less than type. So we will use the table 37.39. The steps are as follows:
(i) In this problem, the total number of observations 'n' = 53. This is an odd number.
So (n+1)2 54= 27
• In the previous example 8, we could easily find the observation corresponding to n2. This is because, the data given to us in that question is an 'ungrouped frequency distribution table' 
• In such a table, we can find the position of each observation (when arranged in ascending order) from the 'cumulative frequency column'
• But in our present problem, we are given a 'grouped frequency distribution table'. So the actual observations are hidden inside various groups. Even then we will try.
• Since the number of observations is 53, which is odd, the middle observation is the observation at the 27th position (when arranged in ascending order)
• So let us try to find the observation at the 27th position. We will try to find it from table 37.39 above. We will travel through the table from top to bottom and examine each class carefully.
• Any 'grouped frequency distribution table' will already have an 'ascending order arrangement' because, the classes are in the increasing order. In our problem, it is: 0-10, 10-20, 20-30 so no...
• Now, the first 5 observations go inside the first class interval 0-10
    ♦ That means, the 5th observation goes inside the first class interval
    ♦ We are not concerned because, we need the 27th observation. Not the 5th.
• The first 8 observations go inside the first and second class intervals 0-10 and 10-20
    ♦ That means, the 8th observation goes inside the first and second classes taken together
    ♦ We are not concerned because, we need the 27th observation. Not the 8th.
• The first 12 observations go inside the first, second and third class intervals 0-10, 10-20 and 20-30
    ♦ That means, the 12th observation goes inside the first, second and third classes taken together
    ♦ We are not concerned because, we need the 27th observation. Not the 12th.
• Continuing in this way, we get:
    ♦ 22nd observation falls within 50-60
    ♦ 29th observation falls within 60-70
• There is no sign of 27. 
• The 27th observation is some where in between the 22nd and 29th.
    ♦ But there are no classes in between 50-60 and 60-70
• So the 27th observation has to be within one among the two: 50-60 or 60-70
But which one?
• 50-60 cannot take in any one higher than 22
• So it has to be within the 60-70
■ So we can conclude:
The 27th observation is inside the 60-70 class

To find the appropriate class, we do not need a detailed analysis as given above. Once we understand the basics, we can straight away take out the class whose cumulative frequency is closest to and just greater than the 'position we are seeking'.
• In our present problem, we are seeking the 27th position
• The cumulative frequency closest to and just greater than 27 is 29
• So take out 60-70

• The class thus taken out is called the median class
4. Our median class contains 7 observations. 
• They may be any value equal to or greater than 60 and less than 70
• The median is one of those 7 observations. Our next aim is to find it
■ The median is calculated using the following formula:
Where:
l = lower limit of the median class
n = number of observations
cf = cumulative frequency of the class preceding the median class
f = frequency of the median class
h = width of the class interval (assuming all classes are of the same width)
• Substituting the values, we get:
Numerator = (n- cf) = (53- 22)  = (26.5-22) = 4.5
Thus median = 60 + (4.57)×10 = 60 + 6.4 = 66.4
We can write the conclusion:
If we arrange all the 53 marks in ascending order, the mark 66.4 will come in the middle. 
• 27 marks will lie on the left of 66.4
    ♦ All those marks will be less than 66.4
• 27 marks will lie on the right of 66.4
    ♦ All those marks will be greater than 66.4

Example 10:
A survey regarding the heights (in cm) of 51 girls of Class X of a school was conducted and the following data was obtained:
Table.37.41
Find the median height
Solution:
1. The given data is in the form of a cumulative frequency table. 
We have to convert it into a 'grouped frequency distribution table'. Let us write the steps:
 No. of heights less than 140 is 4
• No. of heights less than 145 is 11
• This 11 will include the heights less than 140 also
2. From the above three information, we can find the no. of heights satisfying the following two conditions:
    ♦ No. of heights greater than or equal to 140
    ♦ No. of heights less than 145
• Obviously, the no. of heights satisfying the two conditions is: (11-4) = 7
• So 7 heights satisfy the condition: 140 ≤ height < 145
• Those 7 heights will obviously fall in the class: 140-145
• In other words, the frequency of the class 140-145 is 7
3. So we separated out 7 heights from 11
What about the remaining 4?
• They satisfy the condition: 'height < 140'
• Those 4 heights will obviously fall in the class: k-140
    ♦ Where 'k' is any number less than 140. We will give it a value 135
• So the frequency of the class 135-140 is 4
4. So we found two classes: 
    ♦ class 135-140 with frequency 4
    ♦ class 140-145 with frequency 7
Now we consider the next cumulative frequency, which is 29:
 No. of heights less than 145 is 11
• No. of heights less than 150 is 29
• This 29 will include the heights less than 145 also
• From the above three information, we can find the no. of heights satisfying the following two conditions:
    ♦ No. of heights greater than or equal to 145
    ♦ No. of heights less than 150
• Obviously, the no. of heights satisfying the two conditions is: (29-11) = 18
• So 18 heights satisfy the condition: 145 ≤ height < 150
• Those 18 heights will obviously fall in the class: 145-150
• In other words, the frequency of the class 145-150 is 18
5. Thus so far, we found three classes: 
    ♦ class 135-140 with frequency 4
    ♦ class 140-145 with frequency 7
    ♦ class 145-150 with frequency 18
Now we consider the next cumulative frequency, which is 40:
 No. of heights less than 150 is 29
• No. of heights less than 155 is 40
• This 40 will include the heights less than 150 also
• From the above three information, we can find the no. of heights satisfying the following two conditions:
    ♦ No. of heights greater than or equal to 150
    ♦ No. of heights less than 155
• Obviously, the no. of heights satisfying the two conditions is: (40-29) = 11
• So 11 heights satisfy the condition: 150 ≤ height < 155
• Those 11 heights will obviously fall in the class: 150-155
• In other words, the frequency of the class 150-155 is 11
 Thus so far, we found four classes: 
    ♦ class 135-140 with frequency 4
    ♦ class 140-145 with frequency 7
    ♦ class 145-150 with frequency 18
    ♦ class 150-155 with frequency 11
■ We can go on like this until we reach the end of the table. However, once we understand the pattern, the frequencies of each class can be easily calculated. The resulting table is shown below:
Table.37.42
• Note how easy it is to fill up the 'frequency column':
To fill up any one row in the 'frequency column', do these steps:
(i) Take the value coming in the same row in the 'cumulative frequency column'
(ii) Take the value coming in the row just above in the 'cumulative frequency column'
(iii) Subtract (ii) from (i)
6. Now we can begin the calculations for the median:
• In this problem, the total number of observations 'n' = 51. This is an odd number.
So (n+1)2 52= 26
• The cumulative frequency closest to 26 and just greater than 26 is 29
• The class corresponding to the cumulative frequency 29 is 145-150
• So 145-150 is the median class. It's frequency = 18
■ The median is calculated using the following formula:

Where:
l = lower limit of the median class = 145
n = number of observations = 51
cf = cumulative frequency of the class preceding the median class = 11
f = frequency of the median class = 18
h = width of the class interval (assuming all classes are of the same width) = 5
• Substituting the values, we get:
• Numerator = (n- cf) = (51- 11)  = (25.5-11) = 14.5
• Thus median = 145 + (14.518)×5 = 145 + 4.03 = 149.03
7. We can write the conclusion:
If all the 51 girls stand in ascending order of heights, the girl with height 149.03 will come in the middle. 
• 25 girls will stand on the left of this girl
    ♦ Heights of all those girls will be less than 149.03 cm
• 25 girls will stand on the right of this girl
    ♦ Heights of all those girls will be greater than 149.03 cm



In the next section, we will see another example.


PREVIOUS      CONTENTS       NEXT

 

                        Copyright©2018 High school Maths lessons. blogspot.in - All Rights Reserved

No comments:

Post a Comment