Digital Library

cab1

 
Title:      USER ASSISTED EXPLORATION AND SAMPLING OF THE SOLUTION SET OF NON-NEGATIVE MATRIX FACTORIZATIONS
Author(s):      Joachim Staib, Marcel Spehr, Stefan Gumhold
ISBN:      978-989-8704-10-8
Editors:      Ajith P. Abraham, Antonio Palma dos Reis and Jörg Roth
Year:      2014
Edition:      Single
Keywords:      Data Mining, Interactive Sampling, Non-Negative Matrix Factorization
Type:      Full Paper
First Page:      29
Last Page:      38
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      The non-negative matrix factorization provides a valuable tool for the analysis of positive data by representing it as an additive linear superposition of a small number of non-negative base elements. This property allows the base elements to be interpreted in the same domain as the input data. The problem though lies in the ambiguity of equally valid solutions from which only one is obtained. Its selection depends on the initialization of the applied factorization algorithm or further constraints. We propose a new approach which is based on sampling the set of valid factorizations, given one initial solution. First we derive a parameterization of the ambiguity. A parameter tuple can be probed for membership through an oracle function that either returns true or false. Then we present an algorithm that explores and samples parts of the non-convex solution set. To assist the otherwise automatic process and to alleviate the drawbacks of sampling a non-convex space, we provide a graphical user interface that puts the human in the loop. From an initial set of samples the user is allowed to select elements that serve as the starting point for subsequent samplings. With this browser-like tool a steering of the sampling of the NMF can be performed without further knowledge on the underlying algorithm and without the need to express possibly hard to formulate constraints. An evaluation of the sampling procedure reveals promising results for a factorization with a rank up to 4.
   

Social Media Links

Search

Login