Multiview self-learning

  1. Get@NRC: Multiview self-learning (Opens in a new window)
DOIResolve DOI:
AuthorSearch for: ; Search for: ; Search for: ; Search for:
Journal titleNeurocomputing
Pages117127; # of pages: 11
SubjectClassification (of information); Information retrieval systems; Text processing; Document Classification; Image annotation; Labeled training data; Margin distributions; Multi-view learning; Multilingual documents; Self-learning; State-of-the-art techniques; Image classification; algorithm; Bayes theorem; classification; classifier; image processing; intermethod comparison; machine learning; multiview self learning
AbstractIn many applications, observations are available with different views. This is, for example, the case with image-text classification, multilingual document classification or document classification on the web. In addition, unlabeled multiview examples can be easily acquired, but assigning labels to these examples is usually a time consuming task. We describe a multiview self-learning strategy which trains different voting classifiers on different views. The margin distributions over the unlabeled training data, obtained with each view-specific classifier are then used to estimate an upper-bound on their transductive Bayes error. Minimizing this upper-bound provides an automatic margin-threshold which is used to assign pseudo-labels to unlabeled examples. Final class labels are then assigned to these examples, by taking a vote on the pool of the previous pseudo-labels. New view-specific classifiers are then trained using the labeled and pseudo-labeled training data. We consider applications to image-text classification and to multilingual document classification. We present experimental results on the NUS-WIDE collection and on Reuters RCV1-RCV2 which show that despite its simplicity, our approach is competitive with other state-of-the-art techniques.
Publication date
AffiliationNational Research Council Canada; Information and Communication Technologies
Peer reviewedYes
NPARC number21275648
Export citationExport as RIS
Report a correctionReport a correction
Record identifierd06cf560-26f6-4e99-8c68-5639436a4b6d
Record created2015-07-14
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)