Retour

Séance TC5 - Classification / Clustering

Jour mardi, le 05 mai 2009
Salle Ordre des CGA
Président Sylvain Perron

Présentations

15h30-
15h55
Parallel Hyperplanes Separation Method for the Two-Groups Discrimination Problem
  Anthony Guillou, GERAD, HEC Montréal, Montréal, Québec, Canada
Pierre Hansen, HEC Montréal, GERAD et Méthodes quantitatives de gestion, 3000 Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7
Sylvain Perron, GERAD, HEC Montréal, Méthodes quantitatives de gestion, 3000, chemin de la Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7

We consider the problem of separating two non linearly separable sets of points A and B in the euclidean space, with two parallel hyperplanes that are respectively the boundaries of half-spaces containing A and B. Both heuristic and exact method are proposed to minimise the euclidean distance between such two hyperplanes.


15h55-
16h20
Optimal Clusterwise Multiple Linear Regression
  Réal Carbonneau, GERAD et HEC Montréal, Méthodes quantitatives de gestion
Gilles Caporossi, GERAD, HEC Montréal, Méthodes quantitatives de gestion, 3000, chemin de la Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7
Pierre Hansen, HEC Montréal, GERAD et Méthodes quantitatives de gestion, 3000 Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7

Clusterwise regression has been used as a data mining tool, however, there is no published research on identifying optimal solutions. In this research, both a quadratic programming formulation and a branch and bound algorithm are compared for identifying Optimal Clusterwise Multiple Linear Regression (OCMLR) solutions. The processing time increases with the number of clusters, observations, dimensions and amount of noise.


16h20-
16h45
A Column Generation Algorithm for 2 Groups Discrimination
  Gilles Caporossi, GERAD, HEC Montréal, Méthodes quantitatives de gestion, 3000, chemin de la Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7
Sylvain Perron, GERAD, HEC Montréal, Méthodes quantitatives de gestion, 3000, chemin de la Côte-Sainte-Catherine, Montréal, Québec, Canada, H3T 2A7

A method of discrimination among two groups of data is proposed. This method involves a combination of linear classifiers, each of which aims at minimizing the number of (weighted) misclassified observations. The combination of these classifiers is achieved by weighted vote. The learning algorithm uses column generation and could be viewed as a mathematical programming approach for boosting.


Retour