INFT 979/CSI 979 Advanced Topics: Data Mining
Syllabus prepared by Dr.
Edward J. Wegman
Fall Semester, 1996
Tues: 7:20 - 10:00 p.m.
Basic Description
"Data Mining" has become a buzz-word within the computer industry for
extraction of knowledge or information from large databases. Often the databases
are financial, but the technology extends to other sorts of information
databases. Similar ideas have existed in the statistics community for about 15
years under the name of exploratory data analysis. The convergence of these
ideas coupled with recent advances in storage technology and database structures
offer an interesting, exciting new technology. The idea of data mining is to
look for information which may reside in opportunistically collected databases.
A classic example involves a large discount chain sales records database in
which it was noted that there was a high correlation between sales of disposable
diapers and beer. The conclusion was that many men on their way home would be
asked by their wives to pick up disposable diapers. The man decided to buy a six
pack of beer while he was stopped anyway. The discount chain moved the beer and
snacks such as peanuts and pretzels next to the disposable diapers and increased
sales on peanuts and pretzels by more that 27%.
This course will focus on bringing together aspects of the computing,
graphical/visualization and statistical technologies into an integrated
treatment. This course will deal with data mining, exploratory analysis and
knowledge discovery, particularly as they relate to graphical tools. Topics will
include foundations, classification and clustering, trend and deviation
analysis, dependency derivation, graphical tools, integrated discovery systems,
and next generation databases.
Lecture Notes
Lecture notes will be made available as the course progresses. Watch this
page for further materials.
URL's For Data Mining
Dr. Wegman's Powerpoint Presentation on Data Mining
Further Information
The course will be project oriented. There will be collateral reading
assignments from the course text
Fayyad, U. M. et al. (1996) Advances in Knowledge Discovery and Data
Mining, AAAI Press/MIT Press.
For further information, contact Dr. Edward J. Wegman at 703-993-1691 or by
email at ewegman@gmu.edu.