For this assignment, you are required to perform the following tasks:
1. Understand the assignment dataset by going through the information given in the annexures and the excel spreadsheet.
2. Use the information given in the annexures to identify the potential data quality issues in the variables. Suggest a mechanism to deal with these data quality issues.
3. Identify one appropriate dependent variable that you would want to predict. Your dependent variable should be either ordinal or categorical. Provide the reasons for your choice of the dependent variable.
4. From the remaining variables, shortlist independent variables (maximum ten variables) for the prediction of your dependent variable.
5. Suggest one analytical technique that you will use for predicting the dependent variable. What will be your major considerations while using this technique? Justify your reasons for the choice of the analytical technique.
The document (a word file) should not be more than 2000 words in length (font Times New Roman size 12; 1.5 line spacing; justified). Page margins are to be 2.5cm all around. The file should be submitted via Turnitin on the blackboard. The blackboard link for submission is Assignment Submission.
The assignment questions should be answered in the following template:
1. Use the information given in the annexures to identify the potential data quality issues in the variables.
2. Suggest a mechanism to deal with these data quality issues.
3. Identify one appropriate dependent variable that you would want to predict.Your dependent variable should be either ordinal or categorical. Provide the reasons for your choice of the dependent variable.
4. From the remaining variables, shortlist independent variables (maximum ten variables) for the prediction of your dependent variable.
5. Suggest one analytical technique that you will use for predicting the dependent variable. What will be your major considerations while using this technique?
6. Justify your reasons for the choice of the analytical technique