Homework 6
SI 618 Fall 2008
Overview of Homework 6
Understand how clustering can reveal patterns of similarity and dissimilarity within large sets of data.
Objectives
Create a matrix showing dissimilarity between documents and the terms appearing in those documents. Use the matrix to create a dendogram that reveals meaningful clusters.
Deliverables
Create a report that includes the following:
- an abstract of less than 150 words
- a description of the data used
- a diary of what was done
- the results (lists)
- a statement of what the results mean
- document everything on a web page and put a link to it in the class wiki
