Task 1 (10 marks)
A. Organize your sample data in a spreadsheet as per "Instructions" above.
(Students who failed to follow the instructions will not be marked and “0” mark will be
awarded to them)
B. What sampling method is used to select your sample data?
C. Do you think that is the best method of sampling? Why not? Why yes?
D. What is the best statistic used to compare the volatility in WEF, WI, and FS values?
Task 2 (10 marks)
Based on your sample data:
A. Develop the tabular form and graphical bar chart of WI based on the following classification:
1st Class = Very Poor
2nd Class = Poor
3rd Class = Moderate
4th Class = Rich
5th Class = Very Rich.
B. What is the most frequent group in your WI sample data? What does that indicate in
terms of your data distribution?
C. Do you think your WI of sample data is normally distributed? Provide the “statistical
reason” for your answer?
Task 3 (10 marks)
A. What is the top 10% and bottom 10 % of your WEF household values?
B. What is the probability that your WI values will be less than or equal to $200?
C. What is the probability that FS will be equal to 2?
D. Is there any outlier(s) in your sample data of WEF? Show the graph or prove for that!
If yes, what is the best statistic to measure the dispersity of your WEF?
3 | P a g e
Task 4 (15 Marks)
A. What is the probability that the head of household is woman and her HLE is Primary?
B. What is the probability that the head of household is man and has the College degree?
C. What is the proportion of having the Secondary as the highest degree from among males?
D. What is the proportion of having the Intermediate as the highest degree from among males?
E. Do you think that the events "gender of household head is male" and "having the CollegeDegree" are independent?
For task 5 ô€„ô€‘ô€‡ô€€ƒonwardô€–, assume that your sample data is normally distributed.
Task 5 (15 Marks)
A. Provide the most accurate of interval estimate of WI and interpret your result.
B. Provide the least accurate of interval estimate of WEF and interpret your result.
C. Provide the most and least accurate of interval estimates of FS and interpret your result.
D. Explain the main differences between the most and least accurate of interval estimate! Why they called as most and least accurate of interval estimate?
Task 6 (15 Marks)
A. After surveying many countries, Michael Scott, one of La Trobe University researchers believes that in order to be considered as the wealthy city, the average weekly income of thehousehold would be at least $1200. Based on the statement above, can you consider your sample data is from a wealthy city? (ð›¼ = 0.10)
B. Michael Scott also believe one city can be considered as the fertile city if the average of family size of household is greater than 8 (ðœ‡ð¹ð‘†> 8). Based on the statement above, can you consider your sample as the fertile city? (ð›¼ = 0.05)
C. Michael Scott also believes one city can be considered as the obese city if the average spending of Weekly food expenditure of household is greater than or equal to 50. Based on the statement above, can you consider your sample as the obese city? (ð›¼ = 0.01)
D. Based on the calculation above, which prediction is the most accurate and why?
4 | P a g e
Task 7 (15 Marks)
A. What is the relationship between the amount of WEF and FS in your sample?
B. What is the relationship between the amount of WI and gender of the family head in
C. How the HLE and WI do affected the WEF in your selected sample?
NB: Use the linear regression line to estimate, R, R2 and graph in order to explain the
Task 8 (10 Marks)
As one of the largest city in USA, New York is also known as the food city. In this city people
spend so much money in food, and Bill de Blasio, Mayor of New York believes, that the average
amount of weekly income (WI) spent by households is not equal with your sample data. In
order to prove that he collects a random sample of 50 households data of his city. (The data is
attached on excel file New York tab).
Based on the Bill de Blasio’s statement, perform the analysis on hypothesis testing with level
of significance of 5%. Do you think Bill de Blasio’s statement is correct?
You may consider the following assumptions while performing this test:
A. Populations for both of your sample and New York data are normally distributed
and samples are independent.
B. Population variances of Weekly Food on Expenditure (WEF) are unknown and