Pattern Mining

The session Pattern Mining will be held on thursday, 2019-09-19, from 11:00 to 12:40, at room 1.011. The session chair is Bruno Crémilleux.

Talks

11:20 - 11:40
Maximal Closed Set and Half-Space Separations in Finite Closure Systems (559)
Florian Seiffarth (University of Bonn), Tamás Horváth (University of Bonn; Fraunhofer IAIS, Sankt Augustin; Fraunhofer Center for Machine Learning, Sankt Augustin), Stefan Wrobel (University of Bonn; Fraunhofer IAIS, Sankt Augustin; Fraunhofer Center for Machine Learning, Sankt Augustin)

Motivated by various binary classification problems in structured data (e.g., graphs or other relational and algebraic structures), we investigate some algorithmic properties of closed set and half-space separation in abstract closure systems.Assuming that the underlying closure system is finite and given by the corresponding closure operator, we formulate some negative and positive complexity results for these two separation problems.In particular, we prove that deciding half-space separability in abstract closure systems is NP-complete in general.On the other hand, for the relaxed problem of maximal closed set separation we propose a simple greedy algorithm and show that it is efficient and has the best possible lower bound on the number of closure operator calls.As a second direction to overcome the negative result above, we consider Kakutani closure systems and show first that our greedy algorithm provides an algorithmic characterization of this kind of set systems.As one of the major potential application fields, we then focus on Kakutani closure systems over graphs and generalize a fundamental characterization result based on the Pasch axiom to graph structure partitioning of finite sets.Though the primary focus of this work is on the generality of the results obtained, we experimentally demonstrate the practical usefulness of our approach on vertex classification in different graph datasets.Motivated by various binary classification problems in structured data (e.g., graphs or other relational and algebraic structures), we investigate some algorithmic properties of closed set and half-space separation in abstract closure systems.Assuming that the underlying closure system is finite and given by the corresponding closure operator, we formulate some negative and positive complexity results for these two separation problems.In particular, we prove that deciding half-space separability in abstract closure systems is NP-complete in general.On the other hand, for the relaxed problem of maximal closed set separation we propose a simple greedy algorithm and show that it is efficient and has the best possible lower bound on the number of closure operator calls.As a second direction to overcome the negative result above, we consider Kakutani closure systems and show first that our greedy algorithm provides an algorithmic characterization of this kind of set systems.As one of the major potential application fields, we then focus on Kakutani closure systems over graphs and generalize a fundamental characterization result based on the Pasch axiom to graph structure partitioning of finite sets.Though the primary focus of this work is on the generality of the results obtained, we experimentally demonstrate the practical usefulness of our approach on vertex classification in different graph datasets.

11:00 - 11:20
DEvIANT: Discovering Significant Exceptional (Dis-)Agreement Within Groups (58)
Adnene Belfodil (INSA Lyon), Wouter Duivesteijn (Technische Universiteit Eindhoven), Marc Plantevit (Univ Lyon), Sylvie Cazalens (INSA Lyon), Philippe Lamarre (INSA Lyon)

We strive to find contexts (i.e., subgroups of entities) under which exceptional (dis-)agreement occurs among a group of individuals, in any type of data featuring individuals (e.g., parliamentarians, customers) performing observable actions (e.g., votes, ratings) on entities (e.g., legislative procedures, movies). To this end, we introduce the problem of discovering statistically significant exceptional contextual intra-group agreement patterns. To handle the sparsity inherent to voting and rating data, we use Krippendorff's Alpha measure for assessing the agreement among individuals. We devise a branch-and-bound algorithm, named DEvIANT, to discover such patterns. DEvIANT exploits both closure operators and tight optimistic estimates. We derive analytic approximations for the confidence intervals (CIs) associated with patterns for a computationally efficient significance assessment. We prove that these approximate CIs are nested along specialization of patterns. This allows to incorporate pruning properties in DEvIANT to quickly discard non-significant patterns. Empirical study on several datasets demonstrates the efficiency and the usefulness of DEvIANT.

Reproducible Research
11:40 - 12:00
Multi-location Visibility Query Processing Using Portion-based Transactional Modeling and Pattern Mining (J22)
Lakshmi Gangumalla, P. Krishna Reddy, Anirban Mondal


12:00 - 12:20
Mining Skypatterns in Fuzzy Tensors (J23)
Nicolas Nadisic, Aurélien Coussat, Loïc Cerf


Parallel Sessions