Towards No-Penalty Control Hazard Handling in RISC Architecture Microcontrollers

dc.contributor.advisorRizkalla, Maher E.
dc.contributor.advisorLee, John J.
dc.contributor.authorBalasubramanian, Linknath Surya
dc.contributor.otherYtterdal, Trond
dc.contributor.otherKumar, Mukesh
dc.date.accessioned2024-09-03T12:46:12Z
dc.date.available2024-09-03T12:46:12Z
dc.date.issued2024-08
dc.degree.date2024
dc.degree.disciplineElectrical & Computer Engineeringen
dc.degree.grantorPurdue Universityen
dc.degree.levelPh.D.
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en
dc.description.abstractAchieving higher throughput is one of the most important requirements of a modern microcontroller. It is therefore not affordable for it to waste a considerable number of clock cycles in branch mispredictions. This paper proposes a hardware mechanism that makes microcontrollers forgo branch predictors, thereby removing branch mispredictions. The scope of this work is limited to low cost microcontroller cores that are applied in embedded systems. The proposed technique is implemented as five different modules which work together to forward required operands, resolve branches without prediction, and calculate the next instruction's address in the first stage of an in-order five stage pipelined micro-architecture. Since the address of successive instruction to a control transfer instruction is calculated in the first stage of pipeline, branch prediction is no longer necessary, thereby eliminating the clock cycle penalties occurred when using a branch predictor. The designed architecture was able to successfully calculate the address of next correct instruction and fetch it without any wastage of clock cycles except in cases where control transfer instructions are in true dependence with their immediate previous instructions. Further, we synthesized the proposed design with 7nm FinFET process and compared its latency with other designs to make sure that the microcontroller's operating frequency is not degraded by using this design. The critical path latency of instruction fetch stage integrated with the proposed architecture is 307 ps excluding the instruction cache access time.
dc.identifier.urihttps://hdl.handle.net/1805/43081
dc.language.isoen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0
dc.subjectMicrocontrollers
dc.subjectBranch mispredictions
dc.subjectControl transfer instruction
dc.subjectVariable reference voltage
dc.subjectDegraded switches
dc.subjectTransmission gate
dc.subjectIn-memory computing
dc.subjectAdder
dc.subjectSum-comparator
dc.titleTowards No-Penalty Control Hazard Handling in RISC Architecture Microcontrollers
dc.typeThesisen
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Towards No-Penalty Control Hazard Handling in RISC architecture microcontrollers.pdf
Size:
8.85 MB
Format:
Adobe Portable Document Format
Description:
Achieving higher throughput is one of the most important requirements of a modern microcontroller. It is therefore not affordable for it to waste a considerable number of clock cycles in branch mispredictions. This paper proposes a hardware mechanism that makes microcontrollers forgo branch predictors, thereby removing branch mispredictions. The scope of this work is limited to low cost microcontroller cores that are applied in embedded systems. The proposed technique is implemented as five different modules which work together to forward required operands, resolve branches without prediction, and calculate the next instruction's address in the first stage of an in-order five stage pipelined micro-architecture. Since the address of successive instruction to a control transfer instruction is calculated in the first stage of pipeline, branch prediction is no longer necessary, thereby eliminating the clock cycle penalties occurred when using a branch predictor. The designed architecture was able to successfully calculate the address of next correct instruction and fetch it without any wastage of clock cycles except in cases where control transfer instructions are in true dependence with their immediate previous instructions. Further, we synthesized the proposed design with 7nm FinFET process and compared its latency with other designs to make sure that the microcontroller's operating frequency is not degraded by using this design. The critical path latency of instruction fetch stage integrated with the proposed architecture is 307 ps excluding the instruction cache access time.
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: