Objectives
This assignment aims to assess students’ theoretical understanding and practical knowledge of concepts covered in independent learning and practical sessions.
Background
The dataset provided in this assignment is created by researchers at the Peterson Institute for International Economics (PIIE) which uses data from the 1996 to 2014 Forbes’ World’s Billionaires lists. The data includes the name, country of citizenship and networth (current US dollars) and other variables of the world’s billionaires.
Using data and analytics, we can use the data to analyse outcomes and look at trends in the numbers, geographical locations, and sources of wealth and/or net worth of the world’s richest people. The dataset provides insights on sources of wealth acquired by the world’s wealthiest, change in gender and age distribution and regions they are based in. Hence, we can learn more about changes in extreme wealth, inequalities of wealth distribution and economic problems.
You are challenged to analyse trends and insights of the world’s wealthiest or any other relevant study using the publicly available dataset. Following are required for the assignment:
You are given 3 datasets scrapped from the dataset created by researchers at the Peterson Institute for International Economics (PIIE). To facilitate the assignment, some modifications are made where appropriate.
This dataset describes the demographics, companies info, sources of wealth:
The below data dictionary provides a description of each column for all datasets.
Wealthiest Demographics Dataset
Name |
Description |
age |
The age of the billionaire |
citizenship |
Billionaire country of citizenship |
gender |
Gender identity: female or male |
name |
Name of the individual or family on the billionaires list |
was political |
True, if billionaire is linked to a politician, or questionable license |
wealth.type |
Source of the wealth |
worth in billions
|
Net worth of billionaire, current US dollars in billions |
country code |
3 digit ISO country code |
region |
Location classification of the billionaire |
Company Info Dataset
Name |
Description |
category |
Broad industry categories |
company.name |
Company primarily associated with billionaire’s wealth |
company.type |
Indicates if company was new, acquired or privatized when billionaire or family members were first associated with it |
founded |
Founding date of the company associated with the billionaire’s wealth
|
gdp |
By country GDP, current US dollars |
country code |
3 digit country code |
name |
Name of the individual or family on the billionaires list |
industry |
The industry labels based on Kaplan and Rauh (2013) which the company primarily associated with billionaire’s wealth is in |
relationship |
Describes the billionaire’s relationship to their company |
sector |
The sector which the company primarily associated with billionaire’s wealth is in |
year |
The year the billionaire or family is listed in the wealthiest list |
Countrycode Dataset
Name |
Description |
citizenship |
The status of being a citizen in the country |
country code |
3 digit country code |
region |
Location classification of the billionaire |
Task – Case Study
What can we learn about the about changes in extreme wealth and inequalities of wealth distribution in United States, Europe and other advanced countries using the dataset on the sources of billionaire wealth? Are the wealth mostly self-made or inherited and can we identify the company and industry from which it comes?
Among self-made billionaires are the individuals, founders, executives, politically connected? Can industries, sectors, regions lead to wealth being generated faster based on the data?
What You Need To DoUsing data and analytics, we can research the data, analyse outcomes and look at trends in the numbers, geographical locations, and sources of wealth and/or net worth of the world’s richest people
You are to explore the data and provide the insights in the numbers, geographical locations, and sources of wealth and/or net worth of the world’s top wealth distribution. Before you can visualize and perform analysis on the data, there is a need to understand the business requirements, understand the data, and perform data cleaning and exploration.
Data Profiling and Data PreparationBelow are some of the fundamental questions about inequality in wealth distribution: