Abstract
Leveraging multithreading on embedded multicore platforms has been proven effective on handling the increasing resolutions of target stimuli of object detection. However, complex tradeoffs and correlated design impacts between a parallel application and the underlying multicore platform necessitate an effective and adaptable multithreaded design. This paper introduces a hybrid multithreaded object detection with high parallelism and extensive data reuse. A self adaptable flow is proposed to adjust the multithreaded object detection to fully exploit various embedded multicore architectures. The ARM-based cycle accurate simulations of multicore systems have shown the superior performance returned by the proposed design.
Original language | English |
---|---|
Pages (from-to) | 25-38 |
Number of pages | 14 |
Journal | Journal of Parallel and Distributed Computing |
Volume | 78 |
DOIs | |
State | Published - 1 Apr 2015 |
Keywords
- Cache memories
- Face and gesture recognition
- Multiprocessor systems