Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest

Yu Ting Ho, Chun Feng Wu, Ming Chang Yang, Tseng Yi Chen, Yuan Hao Chang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

Random forest is effective and accurate in making predictions for classification and regression problems, which constitute the majority of machine learning applications or systems nowadays. However, as the data are being generated explosively in this big data era, many machine learning algorithms, including the random forest algorithm, may face the difficulty in maintaining and processing all the required data in the main memory. Instead, intensive data movements (i.e., data swappings) between the faster-but-smaller main memory and the slowerbut-larger secondary storage may occur excessively and largely degrade the performance. To address this challenge, the emerging non-volatile memory (NVM) technologies are placed great hopes to substitute the traditional random access memory (RAM) and to build a larger-Than-ever main memory space because of its higher cell density, lower power consumption, and comparable read performance as traditional RAM. Nevertheless, the limited write endurance of NVM cells and the read-write asymmetry of NVMs may still limit the feasibility of performing machine learning algorithms directly on NVMs. Such dilemma inspires this study to develop an NVM-friendly bagging strategy for the random forest algorithm, in order to trade the 'randomness' of the sampled data for the reduced data movements in the memory hierarchy without hurting the prediction accuracy. The evaluation results show that the proposed design could save up to 72% of the write accesses on the representative traces with nearly no degradation on the prediction accuracy.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728138541
DOIs
StatePublished - Aug 2019
Event8th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2019 - Hangzhou, China
Duration: 18 Aug 201921 Aug 2019

Publication series

NameProceedings - 2019 IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2019

Conference

Conference8th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2019
Country/TerritoryChina
CityHangzhou
Period18/08/1921/08/19

Fingerprint

Dive into the research topics of 'Replanting Your Forest: NVM-friendly Bagging Strategy for Random Forest'. Together they form a unique fingerprint.

Cite this