Extending Ldms To Enable Performance Monitoring In Multi-Core Applications
Keywords
Monitoring Performance; Multi-core
Abstract
Identifying design patterns that limit the performance of multi-core algorithms is a challenging task. There are many known methods by which threads synchronize their actions and each method may exhibit different behavior in different use cases. These use cases may vary in regards to the workload being executed, number of parallel tasks, dependencies between these tasks, and the behavior of the system scheduler. Restructuring algorithms to overcome performance limitations requires intimate knowledge on how these algorithms utilize the hardware. In our experience, we have found a lack of adequate tools to gain such knowledge. To address this, we have enhanced and implemented additional data sampler modules for OVIS's Lightweight Distributed Metric Service (LDMS) to enable scalable distributed collection of hardware performance counter data. These modules provide an interface by which LDMS can utilize the PAPI library, Linux perf tools, and RAPL to collect hardware performance data of interest. Using these samplers, we plan to monitor the intra-node behavior, including contention for node level shared resources, of multi-core applications for a diverse set of use cases. We are currently exploring how the values reported are affected by the level of concurrency, the synchronization methodologies, and progress guarantees. We hope to use this information to identify ways to restructure algorithms to increase their performance.
Publication Date
10-26-2015
Publication Title
Proceedings - IEEE International Conference on Cluster Computing, ICCC
Volume
2015-October
Number of Pages
717-720
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/CLUSTER.2015.125
Copyright Status
Unknown
Socpus ID
84959259300 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/84959259300
STARS Citation
Feldman, Steven; Zhang, Deli; Dechev, Damian; and Brandt, James, "Extending Ldms To Enable Performance Monitoring In Multi-Core Applications" (2015). Scopus Export 2015-2019. 2043.
https://stars.library.ucf.edu/scopus2015/2043