Enabling Real Time Instrumentation Using Reservoir Sampling and Binpacking

dc.contributor.advisorHill, James H.
dc.contributor.authorMeruga, Sai Pavan Kumar
dc.contributor.otherDurresi, Arjan
dc.contributor.otherZheng, Jiang Yu
dc.date.accessioned2023-08-31T16:42:08Z
dc.date.available2023-08-31T16:42:08Z
dc.date.issued2023-05
dc.degree.date2023en_US
dc.degree.disciplineComputer & Information Science
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThis thesis investigates the overhead added by reservoir sampling algorithm at different levels of granularity in real-time instrumentation of a distributed software systems. Firstly, this thesis not only discusses the inconsistencies found in the implementation of the reservoir sampling pintool in paper [ 1 ] but also provides the correct implementation. Secondly, this thesis provides the design and implementation of pintools for different level of granularities i.e., thread level, image level and routine level. Additionally, we provide quantitative comparison of performance for different sampling techniques (including reservoir sampling) at different levels of granularity. Based on the insights obtained from the empirical results, to enable real time instrumentation, we need to scale and manage the resources in the best way possible. To scale the reservoir sampling algorithm on a real time software system we integrate the traditional bin packing approach with the instrumentation in such a way that there is a decrease in the memory usage and improve the performance. The results of this research show that percentage difference between overhead added by Reservoir and Constant Sampling at a Image level granularity is 1.74%, at a Routine level granularity is 0.3% percent, at a Thread level granularity is 0.035%. Additionally, when we use bin packing technique along with reservoir sampling it normalizes the memory usage/performance runtime for Reservoir Sampling across multiple threads and different system visibility levels.
dc.identifier.urihttps://hdl.handle.net/1805/35280
dc.language.isoen_USen_US
dc.subjectReservoir sampling
dc.subjectBin packing
dc.subjectSoftware instrumentation
dc.titleEnabling Real Time Instrumentation Using Reservoir Sampling and Binpacking
dc.typeThesisen
thesis.degree.disciplineComputer & Information Scienceen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_final-1.pdf
Size:
31.68 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: