PTDM 2012 - Brussels

Practical Theories for Exploratory Data Mining (PTDM), Brussels 2012

The goal of this ICDM 2012 workshop is to help closing the gap between data mining practice and theory. To this end, we intend to explore what is the essence of exploratory data mining and how to formalize it in a useful but theoretically well-founded way.

The workshop is motivated by a widely perceived discrepancy between theoretical data mining prototypes and practitioners’ requirements. A notable example is frequent pattern mining. Despite its attractive theoretical foundations, the practical use of frequent pattern mining methods has been limited. This is due to a difficulty to overcome issues, such as the pattern explosion problem and a discrepancy between usefulness and frequency. These issues have been addressed to some extent in the past 15 years, through heuristic post-processing steps and through rigorously motivated adaptations. The multitude of possible solution strategies has unfortunately to a large extent undermined the original elegance, and made it hard for practitioners to understand how to use these techniques.

The problem is however not restricted to frequent pattern mining alone. The multitude of available methods for typical exploratory data mining problems such as (subspace) clustering and dimensionality reduction is such that practitioners face a daunting task in selecting a suitable method. Additionally to the usability issues, less attention has been given on pattern mining methods for relational databases. Although most real world databases are relational, most pattern mining research has focused on one-table data.

We believe the core reasons for these difficulties are:

Different users inevitably have different prior beliefs and goals, whereas most exploratory data mining algorithms have a rigid objective function and do not consider this.
Formally comparing the quality of different data mining patterns is hard due to their widely varying nature (e.g. comparing a dimensionality reduction with a frequent itemset), unless their 'interestingness' can be quantified in a comparable manner.
The iterative process of data mining is often not considered.
Data mining in complex relational data is hard to fit into standard data mining prototypes.
More generally, data mining methods tend to be rigid, defined for highly specific tasks, for highly specific and idealized data, and for very specific types of patterns.

The purpose of this workshop will be to serve as a forum of exchanging ideas on how to formalize exploratory data mining in order to make it useful in practice. This workshop will survey (through invited as well as contributed talks and posters) some existing attempts at addressing the problems mentioned above. We particularly encourage papers that present principled theoretical contributions motivated by real world requirements.

For more information please visit the workshop´s website.

Categories

Top » Computer Science » Data Mining


Opening Remarks
[syn] 2544 views, 17:34 IntroductionIntroduction Tijl De Bie Tijl De Bie
Keynote Talks
[syn] 3190 views, 41:19 Keynote Talk Network-based Data Integration for Computational Systems BiologyNetwork-based Data Integration for Computational Systems Biology Kathleen Marchal Kathleen Marchal	[syn] 2763 views, 58:29 Keynote Talk From Inductive Querying to Declarative Modeling for Data MiningFrom Inductive Querying to Declarative Modeling for Data Mining Luc De Raedt Luc De Raedt	[syn] 3150 views, 55:29 Keynote Talk The Use of Randomization and Statistical Significance in Data MiningThe Use of Randomization and Statistical Significance in Data Mining Kai Puolamäki Kai Puolamäki	[syn] 3214 views, 55:15 Keynote Talk Datamining "Looking backward, looking forward"Datamining "Looking backward, looking forward" Pieter Adriaans Pieter Adriaans
Lectures
[syn] 2348 views, 17:08 Thorough analysis of log data with dependency rules: Practical solutions and theoretical challengesThorough analysis of log data with dependency rules: Practical solutions ... Wilhelmiina Hämäläinen Wilhelmiina Hämäläinen	[syn] 2314 views, 13:29 Enhancing the Analysis of Large Multimedia Applications Execution Traces with FrameMinerEnhancing the Analysis of Large Multimedia Applications Execution Traces with ... Christiane Kamdem Kengne Christiane Kamdem Kengne	[syn] 2362 views, 16:33 Generalized Expansion DimensionGeneralized Expansion Dimension Michael Nett Michael Nett 1 comment	[syn] 2185 views, 18:08 Generating Diverse Realistic Data Sets for Episode MiningGenerating Diverse Realistic Data Sets for Episode Mining Albrecht Zimmermann Albrecht Zimmermann

Write your own review or comment:

Comment:
Name:
Email address:
URL:

make sure you have javascript enabled or clear this field:

View order

Topic taxonomy

Type of content

Language

Year

PTDM 2012 - Brussels

Practical Theories for Exploratory Data Mining (PTDM), Brussels 2012

Opening Remarks

Keynote Talks

Lectures

Write your own review or comment:

From:
To: