Deep Discriminative Features Learning and Sampling for Imbalanced Data Problem

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

The imbalanced data problem occurs in many application domains and is considered to be a challenging problem in machine learning and data mining. Most resampling methods for synthetic data focus on minority class without considering the data distribution of major classes. In contrast to previous works, the proposed method considers both majority classes and minority classes to learn feature embeddings and utilizes appropriate loss functions to make feature embedding as discriminative as possible. The proposed method is a comprehensive framework and different deep learning feature extractors can be utilized for different domains. We conduct experiments utilizing seven numerical datasets and one image dataset based on multiclass classification tasks. The experimental results indicate that the proposed method provides accurate and stable results.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Data Mining, ICDM 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1146-1151
Number of pages6
ISBN (Electronic)9781538691588
DOIs
StatePublished - 27 Dec 2018
Event18th IEEE International Conference on Data Mining, ICDM 2018 - Singapore, Singapore
Duration: 17 Nov 201820 Nov 2018

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
Volume2018-November
ISSN (Print)1550-4786

Conference

Conference18th IEEE International Conference on Data Mining, ICDM 2018
Country/TerritorySingapore
CitySingapore
Period17/11/1820/11/18

Keywords

  • Center loss
  • Feature embedding
  • Imbalanced data
  • Synthetic sampling
  • Triplet loss

Fingerprint

Dive into the research topics of 'Deep Discriminative Features Learning and Sampling for Imbalanced Data Problem'. Together they form a unique fingerprint.

Cite this