Homework 3
SI 618 Fall 2008
Overview of Homework 3
Building on the first and second homework assignments, this exercise involves the use of a data visualizing language called R. We build on our use of SQLite for extracting specific information about our data set.
Objectives
Generate a back to back histogram showing the number of documents containing phrases that appear in two data sets.
Deliverables
Create a report that includes the following:
- an abstract of less than 150 words
- a description of the data used
- a diary of what was done
- the results (lists)
- a statement of what the results mean
- document everything on a web page and put a link to it in the class wiki
