


default search action
Zihan Liu 0002
Person information
- affiliation: Shanghai Jiao Tong University, Department of Computer Science and Engineering, Shanghai, China
Other persons with the same name
- Zihan Liu (aka: Zi-Han Liu) — disambiguation page
- Zihan Liu 0001 (aka: Zihan (Johan) Liu) — Nvidia, Santa Clara, CA, USA (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c14]Yangjie Zhou
, Wenting Shen
, Jingwen Leng
, Shuwen Lu
, Zihan Liu
, Weihao Cui
, Zhendong Zhang
, Wencong Xiao
, Baole Ai
, Yong Li
, Wei Lin
, Deze Zeng
, Yun Liang
, Quan Chen
, Ning Liu
, Minyi Guo
:
Voyager: Input-Adaptive Algebraic Transformations for High-Performance Graph Neural Networks. ASPLOS (3) 2025: 247-263
[c13]Yu Feng
, Zheng Liu
, Weikai Lin
, Zihan Liu
, Jingwen Leng
, Minyi Guo
, Zhezhi He
, Jieru Zhao
, Yuhao Zhu
:
StreamGrid: Streaming Point Cloud Analytics via Compulsory Splitting and Deterministic Termination. ASPLOS (2) 2025: 1189-1202
[c12]Weiming Hu, Haoyan Zhang
, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng:
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type. HPCA 2025: 1112-1126
[c11]Zihan Liu
, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Chen Jin, Jingwen Leng:
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference. HPCA 2025: 1496-1509
[c10]Xingyang Li, Jie Jiang, Yu Feng, Yiming Gan, Jieru Zhao, Zihan Liu, Jingwen Leng, Minyi Guo:
SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity. ICCAD 2025: 1-9
[c9]Yu Feng
, Weikai Lin
, Yuge Cheng
, Zihan Liu
, Jingwen Leng
, Minyi Guo
, Chen Chen
, Shixuan Sun
, Yuhao Zhu
:
Lumina: Real-Time Neural Rendering by Exploiting Computational Redundancy. ISCA 2025: 1925-1939
[c8]Yangjie Zhou
, Honglin Zhu
, Qian Qiu
, Weihao Cui
, Zihan Liu
, Peng Chen
, Mohamed Wahib
, Cong Guo
, Siyuan Feng
, Jintao Meng
, Haidong Lan
, Jingwen Leng
, Yun Lin
, Jin Song Dong
, Wenxi Zhu
, Minwen Deng
:
A Sample-Free Compilation Framework for Efficient Dynamic Tensor Computation. SC 2025: 167-184
[i23]Weiming Hu, Haoyan Zhang, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng:
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type. CoRR abs/2502.18755 (2025)
[i22]Zihan Liu, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Jingwen Leng, Chen Jin:
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference. CoRR abs/2503.02236 (2025)
[i21]Xiaotong Huang, He Zhu, Zihan Liu, Weikai Lin, Xiaohong Liu, Zhezhi He, Jingwen Leng, Minyi Guo, Yu Feng:
SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting. CoRR abs/2503.05168 (2025)
[i20]Yu Feng, Zheng Liu, Weikai Lin, Zihan Liu, Jingwen Leng, Minyi Guo, Zhezhi He, Jieru Zhao, Yuhao Zhu:
StreamGrid: Streaming Point Cloud Analytics via Compulsory Splitting and Deterministic Termination. CoRR abs/2503.05197 (2025)
[i19]Haosong Liu, Yuge Cheng, Zihan Liu, Aiyue Chen, Jing Lin, Yiwu Yao, Chen Chen, Jingwen Leng, Yu Feng, Minyi Guo:
Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers. CoRR abs/2506.05096 (2025)
[i18]Yu Feng, Weikai Lin, Yuge Cheng, Zihan Liu, Jingwen Leng, Minyi Guo, Chen Chen, Shixuan Sun, Yuhao Zhu:
Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy. CoRR abs/2506.05682 (2025)
[i17]Jiale Xu, Rui Zhang, Yi Xiong, Cong Guo, Zihan Liu, Yangjie Zhou, Weiming Hu, Hao Wu, Changxu Shao, Ziqing Wang, Yongjie Yuan, Junping Zhao, Minyi Guo, Jingwen Leng:
eLLM: Elastic Memory Management Framework for Efficient LLM Serving. CoRR abs/2506.15155 (2025)
[i16]Xingyang Li, Jie Jiang, Yu Feng, Yiming Gan, Jieru Zhao, Zihan Liu, Jingwen Leng, Minyi Guo:
SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity. CoRR abs/2507.21499 (2025)
[i15]Xinhao Luo, Zihan Liu, Yangjie Zhou, Shihan Fang, Ziyu Huang, Yu Feng, Chen Zhang, Shixuan Sun, Zhenzhe Zheng, Jingwen Leng, Minyi Guo:
ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive. CoRR abs/2508.18850 (2025)
[i14]Xiaotong Huang, He Zhu, Tianrui Ma, Yuxiang Xiong, Fangxin Liu, Zhezhi He, Yiming Gan, Zihan Liu, Jingwen Leng, Yu Feng, Minyi Guo:
Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing. CoRR abs/2511.18755 (2025)
[i13]Ziyu Huang, Yangjie Zhou, Zihan Liu, Xinhao Luo, Yijia Diao, Minyi Guo, Jidong Zhai, Yu Feng, Chen Zhang, Anbang Wu, Jingwen Leng:
FlashFuser: Expanding the Scale of Kernel Fusion for Compute-Intensive Operators via Inter-Core Connection. CoRR abs/2512.12949 (2025)
[i12]Yue Guan, Changming Yu, Shihan Fang, Weiming Hu, Zaifeng Pan, Zheng Wang, Zihan Liu, Yangjie Zhou, Yufei Ding, Minyi Guo, Jingwen Leng:
Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding. CoRR abs/2512.23858 (2025)- 2024
[j2]Yu Feng
, Weikai Lin
, Zihan Liu
, Jingwen Leng
, Minyi Guo
, Han Zhao
, Xiaofeng Hou
, Jieru Zhao
, Yuhao Zhu
:
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture. ACM Trans. Archit. Code Optim. 21(4): 80:1-80:25 (2024)
[c7]Cong Guo
, Rui Zhang
, Jiale Xu
, Jingwen Leng
, Zihan Liu
, Ziyu Huang
, Minyi Guo
, Hao Wu
, Shouren Zhao
, Junping Zhao
, Ke Zhang
:
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching. ASPLOS (2) 2024: 450-466
[c6]Zihan Liu
, Wentao Ni
, Jingwen Leng
, Yu Feng
, Cong Guo
, Quan Chen
, Chao Li
, Minyi Guo
, Yuhao Zhu
:
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping. ASPLOS (2) 2024: 549-565
[c5]Yu Feng, Zihan Liu
, Jingwen Leng, Minyi Guo, Yuhao Zhu:
Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations. ISCA 2024: 1293-1308
[i11]Cong Guo
, Rui Zhang, Jiale Xu, Jingwen Leng, Zihan Liu, Ziyu Huang, Minyi Guo, Hao Wu, Shouren Zhao, Junping Zhao, Ke Zhang:
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching. CoRR abs/2401.08156 (2024)
[i10]Yu Feng, Zihan Liu, Jingwen Leng, Minyi Guo, Yuhao Zhu:
Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations. CoRR abs/2404.11852 (2024)
[i9]Jiale Xu, Rui Zhang, Cong Guo
, Weiming Hu, Zihan Liu, Feiyang Wu, Yu Feng, Shixuan Sun, Changxu Shao, Yuhong Guo, Junping Zhao, Ke Zhang, Minyi Guo, Jingwen Leng:
vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving. CoRR abs/2407.15309 (2024)
[i8]Yu Feng, Weikai Lin, Zihan Liu, Jingwen Leng, Minyi Guo, Han Zhao, Xiaofeng Hou, Jieru Zhao, Yuhao Zhu:
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture. CoRR abs/2408.06608 (2024)
[i7]Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Cong Guo
, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Wenxi Zhu, Minwen Deng:
Vortex: Efficient Sample-Free Dynamic Tensor Program Optimization via Hardware-aware Strategy Space Hierarchization. CoRR abs/2409.01075 (2024)- 2023
[c4]Yangjie Zhou
, Yaoxu Song
, Jingwen Leng
, Zihan Liu
, Weihao Cui
, Zhendong Zhang
, Cong Guo
, Quan Chen
, Li Li
, Minyi Guo
:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CF 2023: 52-62
[i6]Yangjie Zhou, Yaoxu Song, Jingwen Leng, Zihan Liu, Weihao Cui, Zhendong Zhang, Cong Guo, Quan Chen, Li Li, Minyi Guo:
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs. CoRR abs/2305.17408 (2023)
[i5]Xiaoxiang Shi, Chao Li, Zijun Li, Zihan Liu, Dianmo Sheng
, Quan Chen, Jingwen Leng, Minyi Guo:
DFlow: Efficient Dataflow-based Invocation Workflow Execution for Function-as-a-Service. CoRR abs/2306.11043 (2023)
[i4]Zihan Liu, Wentao Ni, Jingwen Leng, Yu Feng, Cong Guo
, Quan Chen, Chao Li, Minyi Guo, Yuhao Zhu:
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping. CoRR abs/2312.01712 (2023)- 2022
[c3]Zihan Liu
, Jingwen Leng
, Zhihui Zhang
, Quan Chen
, Chao Li
, Minyi Guo
:
VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling. ASPLOS 2022: 388-401
[c2]Cong Guo
, Chen Zhang
, Jingwen Leng, Zihan Liu
, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. MICRO 2022: 1414-1433
[i3]Zihan Liu, Jingwen Leng, Zhihui Zhang, Quan Chen, Chao Li, Minyi Guo:
VELTAIR: Towards High-Performance Multi-tenant Deep Learning Services via Adaptive Compilation and Scheduling. CoRR abs/2201.06212 (2022)
[i2]Cong Guo
, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. CoRR abs/2208.14286 (2022)- 2020
[j1]Zihan Liu
, Jingwen Leng, Guandong Lu, Chenhui Wang, Quan Chen, Minyi Guo
:
Survey and design of paleozoic: a high-performance compiler tool chain for deep learning inference accelerator. CCF Trans. High Perform. Comput. 2(4): 332-347 (2020)
[c1]Zihan Liu
, Jingwen Leng, Quan Chen, Chao Li, Wenli Zheng, Li Li, Minyi Guo:
DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural Network Accelerator. ISPA/BDCloud/SocialCom/SustainCom 2020: 118-127
[i1]Zihan Liu, Jingwen Leng, Quan Chen, Chao Li, Wenli Zheng, Li Li, Minyi Guo:
DLFusion: An Auto-Tuning Compiler for Layer Fusion on Deep Neural Network Accelerator. CoRR abs/2011.05630 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-17 23:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







