A Matrix Factorization Approach for Integrating Multiple Data Views

Publication Type:

Conference Paper

Source:

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), Bled, Slovenia (2009)

URL:

http://mlg.ucd.ie/imf

Abstract:

In many domains there will exist different representations or “views” describing the same set of objects. Taken alone, these views will often be deficient or incomplete. Therefore a key problem for exploratory data analysis is the integration of multiple views to discover the under- lying structures in a domain. This problem is made more difficult when disagreement exists between views. We introduce a new unsupervised algorithm for combining information from related views, using a late in- tegration strategy. Combination is performed by applying an approach based on matrix factorization to group related clusters produced on indi- vidual views. This yields a projection of the original clusters in the form of a new set of “meta-clusters” covering the entire domain. We also pro- vide a novel model selection strategy for identifying the correct number of meta-clusters. Evaluations performed on a number of multi-view text clustering problems demonstrate the effectiveness of the algorithm.

Social Network Analysis Group

Search

User login

Subscribe to Clique

Sponsors