Rakesh Kumar
About
Rakesh Kumar is an Associate Professor in the Department of Computer Science at Norwegian University of Science and Technology (NTNU). He is affiliated with Computer Architecture Lab (CAL) in the Computing Unit. Before joining NTNU, he was a post-doctoral researcher at Uppsala University, Sweden and the University of Edinburgh, UK. He received his PhD from UPC, Barcelona in 2014.
Research
His current research focuses on improving the efficiency of large-scale datacenters through improvement in processor microarchitecture and memory systems. His previous work explored hardware/software co-designed processors (think of Nvidia Denver) as an energy-efficient alternative to conventional (hardware only) processors. He has also investigated dynamic code translation and optimizations, especially vectorization.
Teaching
- TDT4258 Low Level Programming, Autumn 2021, 2020, 2019, 2018
- TDT01 Architecture of Computing Systems, Autumn 2021, 2020, 2019, 2018
- TFE4208 Embedded Systems Design Project, Spring 2022, 2021, 2020, 2019
Competencies
Publications
2022
-
Asheim, Truls;
Ahmed Khan, Tanvir;
Kasikci, Baris;
Kumar, Rakesh.
(2022)
Impact of Microarchitectural State Reuse on Serverless Functions.
8th International Workshop on Serverless Computing (WoSC).
Academic chapter/article/Conference paper
-
Asheim, Truls;
Grot, Boris;
Kumar, Rakesh.
(2022)
A Specialized BTB Organization for Servers.
The 31st International Conference on Parallel Architectures and Compilation Techniques (PACT).
Academic chapter/article/Conference paper
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2022)
Dependence-aware Slice Execution to Boost MLP in Slice-out-of-order Cores.
ACM Transactions on Architecture and Code Optimization (TACO).
volum 19 (2).
Academic article
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2022)
Freeway to Memory Level Parallelism in Slice-Out-of-Order Cores.
arXiv.org.
Academic article
-
Kumar, Rakesh;
Grot, Boris.
(2022)
Shooting Down the Server Front-End Bottleneck.
ACM Transactions on Computer Systems.
volum 38 (3-4).
Academic article
-
Ugedal, Odin;
Kumar, Rakesh.
(2022)
Mitigating Unnecessary Throttling in Linux CFS Bandwidth Control.
IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)..
Academic chapter/article/Conference paper
2021
-
Ahmed Khan, Tanvir;
Brown, Nathan;
Sriraman, Akshitha;
Soundararajan, Niranjan;
Kumar, Rakesh;
Devietti, Joseph;
Subramoney, Sreenivas;
Pokam, Gilles;
Litz, Heiner;
Kasikci, Baris.
(2021)
Twig: Profile-Guided BTB Prefetching for Data Center Applications.
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture.
Academic chapter/article/Conference paper
-
Asheim, Truls;
Grot, Boris;
Kumar, Rakesh.
(2021)
BTB-X: A Storage-Effective BTB Organization.
IEEE computer architecture letters.
volum 20 (2).
Academic article
-
Kumar, Rakesh;
Martínez, Alejandro;
Gonzalez, Antonio.
(2021)
A Variable Vector Length SIMD Architecture for HW/SW Co-designed Processors.
arXiv.org.
Academic article
2020
-
Alipour, Mehdi;
Kumar, Rakesh;
Kaxiras, Stefanos;
Black-Schaffer, David.
(2020)
Delay and Bypass: Ready and Criticality Aware Instruction Scheduling in Out-of-Order Processors.
IEEE Symposium on High-Performance Computer Architecture (HPCA).
Academic article
-
Asheim, Truls;
Kumar, Rakesh;
Grot, Boris.
(2020)
Fetch-Directed Instruction Prefetching Revisited.
arXiv.org.
Academic article
2019
-
Alipour, Mehdi;
Kumar, Rakesh;
Kaxiras, Stefanos;
Black-Schaffer, David.
(2019)
FIFOrder MicroArchitecture: Ready-Aware Instruction Scheduling for OoO Processors.
Design, Automation and Test in Europe (DATE).
Academic article
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2019)
Freeway: Maximizing MLP for Slice-Out-of-Order Execution.
IEEE Symposium on High-Performance Computer Architecture (HPCA).
Academic article
Journal publications
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2022)
Dependence-aware Slice Execution to Boost MLP in Slice-out-of-order Cores.
ACM Transactions on Architecture and Code Optimization (TACO).
volum 19 (2).
Academic article
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2022)
Freeway to Memory Level Parallelism in Slice-Out-of-Order Cores.
arXiv.org.
Academic article
-
Kumar, Rakesh;
Grot, Boris.
(2022)
Shooting Down the Server Front-End Bottleneck.
ACM Transactions on Computer Systems.
volum 38 (3-4).
Academic article
-
Asheim, Truls;
Grot, Boris;
Kumar, Rakesh.
(2021)
BTB-X: A Storage-Effective BTB Organization.
IEEE computer architecture letters.
volum 20 (2).
Academic article
-
Kumar, Rakesh;
Martínez, Alejandro;
Gonzalez, Antonio.
(2021)
A Variable Vector Length SIMD Architecture for HW/SW Co-designed Processors.
arXiv.org.
Academic article
-
Alipour, Mehdi;
Kumar, Rakesh;
Kaxiras, Stefanos;
Black-Schaffer, David.
(2020)
Delay and Bypass: Ready and Criticality Aware Instruction Scheduling in Out-of-Order Processors.
IEEE Symposium on High-Performance Computer Architecture (HPCA).
Academic article
-
Asheim, Truls;
Kumar, Rakesh;
Grot, Boris.
(2020)
Fetch-Directed Instruction Prefetching Revisited.
arXiv.org.
Academic article
-
Alipour, Mehdi;
Kumar, Rakesh;
Kaxiras, Stefanos;
Black-Schaffer, David.
(2019)
FIFOrder MicroArchitecture: Ready-Aware Instruction Scheduling for OoO Processors.
Design, Automation and Test in Europe (DATE).
Academic article
-
Kumar, Rakesh;
Alipour, Mehdi;
Black-Schaffer, David.
(2019)
Freeway: Maximizing MLP for Slice-Out-of-Order Execution.
IEEE Symposium on High-Performance Computer Architecture (HPCA).
Academic article
Part of book/report
-
Asheim, Truls;
Ahmed Khan, Tanvir;
Kasikci, Baris;
Kumar, Rakesh.
(2022)
Impact of Microarchitectural State Reuse on Serverless Functions.
8th International Workshop on Serverless Computing (WoSC).
Academic chapter/article/Conference paper
-
Asheim, Truls;
Grot, Boris;
Kumar, Rakesh.
(2022)
A Specialized BTB Organization for Servers.
The 31st International Conference on Parallel Architectures and Compilation Techniques (PACT).
Academic chapter/article/Conference paper
-
Ugedal, Odin;
Kumar, Rakesh.
(2022)
Mitigating Unnecessary Throttling in Linux CFS Bandwidth Control.
IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)..
Academic chapter/article/Conference paper
-
Ahmed Khan, Tanvir;
Brown, Nathan;
Sriraman, Akshitha;
Soundararajan, Niranjan;
Kumar, Rakesh;
Devietti, Joseph;
Subramoney, Sreenivas;
Pokam, Gilles;
Litz, Heiner;
Kasikci, Baris.
(2021)
Twig: Profile-Guided BTB Prefetching for Data Center Applications.
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture.
Academic chapter/article/Conference paper
Media
2020
-
Academic lectureStrupe, Fredrik; Kumar, Rakesh. (2020) Uncovering Hidden Instructions in Armv8-A Implementations. 9th International Workshop on Hardware and Architectural Support for Security and Privacy (HASP) ; 2020-10-17 - 2020-10-17.
2018
-
PosterAlipour, Mehdi; Kumar, Rakesh; Kaxiras, Stefanos; Black-Schaffer, David. (2018) A Minimum Out-of-Order Core. Student Research Competition at International Symposium on Microarchitecture (MICRO) . IEEE/ACM; 2018-10-20 - 2018-10-24.