Individual assignment – Due at end of week 7
You are in charge of the World Bank
You have 3 objectives
- Reduce the number of people living on less than $1(US) per day
- Limit CO2 emissions so global warming is less than 1.5 Degrees by 2050
- Protect the world from another pandemic by providing basic health care for as many people as possible.
You have 1 Trillion Dollars (US) per year for each objective – 1 Trillion= 1,000,000,000,000 or 1012
Tasks
1. Identify the data sources you will use (10/50)
a. Why have you chosen these sources ? ( coverage, accuracy, data availability etc.)
b. Describe the original source of the data used from these sources
c. Indicate how confident you are about the reliability and usefulness of this data
2. Download the data and store it in a suitable format. (10/50)
a. Label each data field
b. Identify the datatype for each field
c. Visualise some key trends of the data
3. Clean and analyse the data (10/50)
a. Deal with missing data
b. Identify and deal with outlying data
c. Make some estimates about the accuracy of the data you have
4. Allocate the funding (20/50)
a. Identify how much funding is likely to be needed in each area.
b. Visualise how the funding is allocated
c. Describe in words what this allocation means
d. Explain why the funding has been allocated and how confident you are of success !
Assignment 2 – group work
Due at end of week 12
In groups of 3-5…
1. Identify a problem from a news source which you think could benefit from data analysis (10/50)
2. Identify a hypothesis or series of hypotheses that you want to test and Identify what data will you need for this (10/50)
3. Find suitable data sources and justify their use (10/50)
4. Download the data and attempt to prove or disprove your hypothesis using: (10/50)
a. Visualisation
b. Analysis – potentially including machine learning
5. Justify the approach and comment on the validity of your results (10/50)