This assignment covers all the learning outcomes for the module which are given below:
Knowledge & Understanding:
Assessment Brief
Assessment Regulations
You are advised to read the guidance for students regarding assessment policies. They are available online here.
Late submission of work
Where coursework is submitted late without approval, after the published hand-in deadline, the following penalties will apply.
Coursework submitted more than 1 working day (24 hours) after the published hand-in deadline without approval will be regarded as not having been completed. A mark of zero will be awarded for the assessment and the module will be failed, irrespective of the overall module mark. These provisions apply to all assessments, including those assessed on a Pass/Fail basis. The full policy can be found here.
Students must retain an electronic copy of this assignment (including ALL appendices) and it must be made available within 24 hours if you are asked to resubmit it.
The Assessment Regulations for Taught Awards (ARTA) contain the Regulations and procedures applying to cheating, plagiarism and other forms of academic misconduct.
The full policy is available at here
You are reminded that plagiarism, collusion and other forms of academic misconduct as referred to in the Academic Misconduct procedure of the assessment regulations are taken very seriously. Assignments in which evidence of plagiarism or other forms of academic misconduct is found may receive a mark of zero.
There are different ways to deal with missing data values when processing data, describe one of the ways to deal with it and what are the positives and negatives of using doing this way.
Cystic fibrosis is an inherited condition that causes sticky mucus to build up in the lungs and digestive system. This causes lung infections and problems with digesting food. Below is the data recorded for this disease for Bone Morphogenetic Protein (BMP), FEV1 (Forced expiratory volume in 1 second), Right Ventricle (RV), Functional Residual Capacity (FRC), Total Lung Capacity (TLC) and Maximal Expiratory Pressure (PEMAX). The data below is also provided in ‘cystfibr.txt’ for copying and processing.
(a) Read this data into a data frame and attach it to the data frame. (3)
(b) Create summaries of the variables in this dataset and comment on them? (4)
For the data given in question 3 (‘cystfibr.txt’), (15)
(a) Use scatterplots between the variables to find any clear relationships between the variables and discuss them? (5)
(b) Create boxplots for the variables height, weight, bmp, fev1, rv, frc, tlc and pemax, all stratified by sex. Which of these have evidence of outlying observations?
The probability that a patient recovers from a delicate heart operation is 0.9. What is the probability that exactly 5 of the next 7 patients having this operation survive? (6)
Northumbria University’s ‘ask4help’ receives 4 emails per minute on the average. Find the probability of receiving 5 emails in a given minute. (6)
A fuel station sells, on the average, 14500 litres of fuel per day with a standard deviation of 2500 litres. If a manager stocks 20000 litres on a particular day, what is the probability that more than 10000 litres will be sold?
Report Requirements
Your STATISTICAL report should consist of eight or fewer pages (applies only to question 10) and should be word-processed. Credit will be given for the use of an appropriate technical style of presentation.
Your report should address the following topics: