George Mason University
CDS/CCDS/Statistics Colloquium Series
Seminar Announcement


Multi-modal Data and Text Mining

John Thomas Rigsby

Naval Surface Warfare Center
Advanced Computation Division


Research 1, Room 301, Fairfax Campus
George Mason University, 4400 University Drive, Fairfax, VA 22030

Time: 10:30 a.m. Refreshments, 10:45 a.m. Colloquium Talk
Date: November 9, 2007



ABSTRACT

There are many attributes to text analysis: words, documents, bigrams, trigrams, n-grams, contextual relationships, latent semantics, and many others. This paper covers a spectral graph method for co-clustering multiple attributes at the same time. Co-clustering is very useful not only because it turns a two step process into a one step process, but it also shows you the relationships between different sets of attributes. This paper goes beyond normal two-mode co-clustering (ie words and documents) into the area of co-clustering multiple modes (ie words, documents, bigrams, trigrams, etc.) all at the same time.