COMPSCI4094 Distributed Databases and Data Mining
Project Objectives and Scope
The objectives of the term project is that you will have a good understanding of the given research topic, provide insight into its solution and a well de ned strategy for its solution. You should treat the term project as if you were doing the initial background study for further in-depth research. In other words, the report should demonstrate an understanding of and an insight into the problem such that given enough time, you could carry it to its logical conclusion and complete the research.
The project has two parts: an in-depth literature review and an implementation of a classi cation problem. For groups that with 1 (i.e., individual project) or 2 student(s), only literature review is required, see details in Deliverable section.
Literature review. It describes the problem domain with proper problem de nition, and a survey of existing work. The research topic of this term project is Web Mining and Content Analysis.
The sub-topics include:
a. Crawling and indexing Web content;
b. Web recommender systems and algorithms;
c. Summarization of Web data;
d. Data, entity, event, and relationship extraction;
e. Knowledge acquisition and automatic construction of knowledge bases;
f. Large-scale graph analysis. Please pick one of them.