Keynote Address
Usama Fayyad (Yahoo, Inc.)
Grand Challenges in Data Mining: The Technical, The Pragmatic, and the Ugly
Thursday 9:00-10:00, Fountain I, II, III
Abstract:
Data Mining has received much attention as companies and organizations
started to ask how they can better utilize the huge data stores they
built up over the past two decades. While some interesting progress has
been achieved over the past few years, especially when it comes to
techniques and scalable algorithms, very few organizations have managed
to benefit from the technology. This paradoxical situation of having too
much data and be unable to utilize it or mine it arose because of both
technical and business challenges. We will cover these challenges, paint
a picture for where the data problems are, and cover some of the
pragmatic issues. While emphasis in the academic community has been
focused primarily on developing new algorithms, few people have paid
attention to the problem that algorithms cannot get to data today in a
usable form. Of particular interest are the challenges of how to make
the technology really work in practice: considering the business setting
and the realities of the possible --including how to evolve the role of
data into a more strategic position. We shall also cover applications in
the EBusiness setting to illustrate the challenges and contributions of
data mining as an example of the type of organization where the role of
data has become so central that it has gained a strategic role that is
critical to the business. I will also discuss some of this evolution
which brings challenges to all of us on how to think about data and how
to present it in a context that transcends the traditionally purely
technical framework and extends it into the core business strategy
realm. Finally, since there are still many unsolved deeper technical and
scientific problems in this field, we conclude by revisiting the
technical challenges facing the field.