CV

Education

Ph.D in Version Control Theory, GitHub University, 2018 (expected)
M.S. in Jekyll, GitHub University, 2014
B.S. in GitHub, GitHub University, 2012

Work experience

Spring 2024: Academic Pages Collaborator
- GitHub University
- Duties includes: Updates and improvements to template
- Supervisor: The Users
Fall 2015: Research Assistant
- GitHub University
- Duties included: Merging pull requests
- Supervisor: Professor Hub
Summer 2015: Research Assistant
- GitHub University
- Duties included: Tagging issues
- Supervisor: Professor Git

Skills

Skill 1
Skill 2
- Sub-skill 2.1
- Sub-skill 2.2
- Sub-skill 2.3
Skill 3

Publications

SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling

Shashwat Jaiswal*, Kunal Jain*, Yogesh Simmhan, Anjaly Parayil, Ankur Mallick, Rujia Wang, Renee St Amant, et al. ‘Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale’. arXiv [Cs.DC], 2025. https://doi.org/10.48550/ARXIV.2502.14617.

Intelligent router for llm workloads: Improving performance through workload-aware scheduling

Kunal Jain, Anjaly Parayil, Ankur Mallick, Esha Choukse, Xiaoting Qin, Jue Zhang, Íñigo Goiri, Rujia Wang, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan, Microsoft . 2025. Performance Aware LLM Load Balancer for Mixed Workloads. In The 5th Workshop on Machine Learning and Systems (EuroMLSys ’25), March 30-April 3, 2025, Rotterdam, Netherlands. ACM, New York, NY, USA, 12 pages. https://doi. org/10.1145/3721146.3721947

Seer: A Framework for Optimizing Traffic Camera Placement and Deep Learning Inference at the Edge for Vehicle Path Reconstruction

Siddhant Jain, Kunal Jain, Arun Ravindran and Suresh Purini, "Seer: A Framework for Optimizing Traffic Camera Placement and Deep Learning Inference at the Edge for Vehicle Path Reconstruction," 2024 IEEE/ACM Symposium on Edge Computing (SEC), Rome, Italy, 2024, pp. 333-345, doi: 10.1109/SEC62691.2024.00033.

Ensuring Fair LLM Serving Amid Diverse Applications

Redwan Ibne Seraj Khan*, Kunal Jain*, Haiying Shen, Ankur Mallick, Anjaly Parayil, Anoop Kulkarni, Steve Kofsky, et al. ‘Ensuring Fair LLM Serving amid Diverse Applications’. arXiv [Cs.LG], 2024. https://doi.org/10.48550/ARXIV.2411.15997.

A cloud-fog architecture for video analytics on large scale camera networks using semantic scene analysis

Kunal Jain, Kishan Sairam Adapa, Kunwar Grover, Ravi Kiran Sarvadevabhatla and Suresh Purini, "A Cloud-Fog Architecture for Video Analytics on Large Scale Camera Networks Using Semantic Scene Analysis," 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid), Bangalore, India, 2023, pp. 513-523, doi: 10.1109/CCGrid57682.2023.00054.

Service and leadership

Currently signed in to 43 different slack teams

Kunal Jain

CV

Education

Work experience

Skills

Publications

SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling

Intelligent router for llm workloads: Improving performance through workload-aware scheduling

Seer: A Framework for Optimizing Traffic Camera Placement and Deep Learning Inference at the Edge for Vehicle Path Reconstruction

Ensuring Fair LLM Serving Amid Diverse Applications

A cloud-fog architecture for video analytics on large scale camera networks using semantic scene analysis

Teaching

Teaching Assistant, Advanced Programming Techniques

Teaching Assistant, Advanced Algorithms

Teaching Assistant, Data Structures and Algorithms

Teaching Assistant, Computer Programming

Service and leadership