


default search action
Weitai Kang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c5]Weitai Kang, Mengxue Qu, Jyoti Kini, Yunchao Wei, Mubarak Shah, Yan Yan:
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention. ICLR 2025
[c4]Weitai Kang
, Luowei Zhou
, Junyi Wu
, Changchang Sun
, Yan Yan
:
Visual Grounding with Attention-Driven Constraint Balancing. ACM Multimedia 2025: 1637-1645
[i15]Wenxin Chen, Mengxue Qu, Weitai Kang, Yan Yan, Yao Zhao, Yunchao Wei:
3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation. CoRR abs/2504.12599 (2025)
[i14]Bin Lei, Weitai Kang, Zijian Zhang, Winson Chen, Xi Xie, Shan Zuo, Mimi Xie, Ali Payani, Mingyi Hong, Yan Yan, Caiwen Ding:
InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction. CoRR abs/2505.10887 (2025)
[i13]Weitai Kang, Bin Lei, Gaowen Liu, Caiwen Ding, Yan Yan:
GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning. CoRR abs/2508.04389 (2025)
[i12]Weitai Kang, Weiming Zhuang, Zhizhong Li, Yan Yan, Lingjuan Lyu:
ExpVG: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model. CoRR abs/2508.08066 (2025)
[i11]Weitai Kang, Jason Kuen, Mengwei Ren, Zijun Wei, Yan Yan, Kangning Liu:
VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction. CoRR abs/2512.11099 (2025)
[i10]Jiachen Tao, Benjamin Planche, Van Nguyen Nguyen, Junyi Wu, Yuchun Liu, Haoxuan Wang, Zhongpai Gao, Gengyu Zhang, Meng Zheng, Feiran Wang, Anwesa Choudhuri, Zhenghao Zhao, Weitai Kang, Terrence Chen, Yan Yan, Ziyan Wu:
From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields. CoRR abs/2512.12459 (2025)- 2024
[c3]Junyi Wu, Bin Duan, Weitai Kang, Hao Tang
, Yan Yan:
Token Transformation Matters: Towards Faithful Post-Hoc Explanation for Vision Transformer. CVPR 2024: 10926-10935
[c2]Junyi Wu, Weitai Kang, Hao Tang
, Yuan Hong
, Yan Yan:
On the Faithfulness of Vision Transformer Explanations. CVPR 2024: 10936-10945
[c1]Weitai Kang
, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. ECCV (38) 2024: 57-75
[i9]Junyi Wu, Bin Duan, Weitai Kang, Hao Tang
, Yan Yan:
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer. CoRR abs/2403.14552 (2024)
[i8]Junyi Wu, Weitai Kang, Hao Tang
, Yuan Hong
, Yan Yan:
On the Faithfulness of Vision Transformer Explanations. CoRR abs/2404.01415 (2024)
[i7]Weitai Kang, Mengxue Qu, Jyoti Kini, Yunchao Wei, Mubarak Shah, Yan Yan:
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention. CoRR abs/2405.18295 (2024)
[i6]Weitai Kang, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. CoRR abs/2407.03200 (2024)
[i5]Weitai Kang, Luowei Zhou, Junyi Wu, Changchang Sun, Yan Yan:
Visual Grounding with Attention-Driven Constraint Balancing. CoRR abs/2407.03243 (2024)
[i4]Weitai Kang, Mengxue Qu, Yunchao Wei, Yan Yan:
ACTRESS: Active Retraining for Semi-supervised Visual Grounding. CoRR abs/2407.03251 (2024)
[i3]Yuzhang Shang
, Bingxin Xu, Weitai Kang, Mu Cai, Yuheng Li, Zehao Wen, Zhen Dong, Kurt Keutzer, Yong Jae Lee, Yan Yan:
Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner. CoRR abs/2409.12963 (2024)
[i2]Weitai Kang, Haifeng Huang, Yuzhang Shang
, Mubarak Shah, Yan Yan:
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning. CoRR abs/2410.00255 (2024)
[i1]Bin Lei, Yuchen Li, Yiming Zeng, Tao Ren, Yi Luo, Tianyu Shi, Zitian Gao, Zeyu Hu, Weitai Kang, Qiuwu Chen:
Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage. CoRR abs/2411.01114 (2024)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-26 23:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







