


default search action
Shuicheng Yan
- > Home > Persons > Shuicheng Yan
Publications
- 2025
- [c442]Shengqiong Wu, Hao Fei, Liangming Pan, William Yang Wang, Shuicheng Yan, Tat-Seng Chua:
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning. AAAI 2025: 8460-8468 - [c433]Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. ICLR 2025 - [i265]Shengqiong Wu, Weicai Ye, Jiahao Wang, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Shuicheng Yan, Hao Fei, Tat-Seng Chua:
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation. CoRR abs/2503.24379 (2025) - [i262]Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Weiming Wu, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. CoRR abs/2505.04620 (2025) - 2024
- [j310]Hao Fei
, Shengqiong Wu
, Meishan Zhang
, Min Zhang
, Tat-Seng Chua
, Shuicheng Yan
:
Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 7701-7719 (2024) - [c423]Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. ICML 2024 - [c418]Hao Fei, Shengqiong Wu, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing. NeurIPS 2024 - [i244]Kaihang Pan, Siliang Tang
, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. CoRR abs/2405.01926 (2024) - [i239]Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. CoRR abs/2406.05127 (2024) - [i235]Hao Fei, Shengqiong Wu, Meishan Zhang, Min Zhang, Tat-Seng Chua, Shuicheng Yan:
Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment. CoRR abs/2406.19255 (2024) - [i216]Shengqiong Wu, Hao Fei, Liangming Pan, William Yang Wang, Shuicheng Yan, Tat-Seng Chua:
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning. CoRR abs/2412.11124 (2024) - [i213]Hao Fei, Shengqiong Wu, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing. CoRR abs/2412.19806 (2024) - 2023
- [j297]Junbin Xiao
, Pan Zhou
, Angela Yao
, Yicong Li
, Richang Hong
, Shuicheng Yan
, Tat-Seng Chua
:
Contrastive Video Question Answering via Video Graph Transformer. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13265-13280 (2023) - [i203]Junbin Xiao, Pan Zhou, Angela Yao, Yicong Li, Richang Hong, Shuicheng Yan, Tat-Seng Chua:
Contrastive Video Question Answering via Video Graph Transformer. CoRR abs/2302.13668 (2023) - [i189]Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, Junxian He:
SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning. CoRR abs/2308.02565 (2023) - 2022
- [c379]Junbin Xiao, Pan Zhou
, Tat-Seng Chua, Shuicheng Yan:
Video Graph Transformer for Video Question Answering. ECCV (36) 2022: 39-58 - [i170]Junbin Xiao, Pan Zhou
, Tat-Seng Chua, Shuicheng Yan:
Video Graph Transformer for Video Question Answering. CoRR abs/2207.05342 (2022) - 2015
- [j143]Liqiang Nie, Meng Wang, Luming Zhang, Shuicheng Yan, Bo Zhang, Tat-Seng Chua:
Disease Inference from Health-Related Questions via Sparse Deep Learning. IEEE Trans. Knowl. Data Eng. 27(8): 2107-2119 (2015) - [c253]Xiangyu Chen, Yanwu Xu
, Shuicheng Yan, Tat-Seng Chua, Damon Wing Kee Wong
, Tien Yin Wong
, Jiang Liu
:
Discriminative Feature Selection for Multiple Ocular Diseases Classification by Sparse Induced Graph Regularized Group Lasso. MICCAI (2) 2015: 11-19 - 2014
- [j121]Hanwang Zhang
, Zheng-Jun Zha
, Yang Yang, Shuicheng Yan, Tat-Seng Chua:
Robust (Semi) Nonnegative Graph Embedding. IEEE Trans. Image Process. 23(7): 2996-3012 (2014) - [j109]Hanwang Zhang
, Zheng-Jun Zha
, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua:
Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval. ACM Trans. Multim. Comput. Commun. Appl. 11(1s): 21:1-21:21 (2014) - [c230]Hanwang Zhang
, Yang Yang, Huan-Bo Luan, Shuicheng Yan, Tat-Seng Chua:
Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes. ACM Multimedia 2014: 187-196 - [p4]Yi-Liang Zhao, Qiang Cheng, Shuicheng Yan, Daqing Zhang, Tat-Seng Chua:
Community Understanding in Location-based Social Networks. Human-Centered Social Media Analytics 2014: 43-74 - 2013
- [j104]Jinhui Tang
, Shuicheng Yan, Chunxia Zhao, Tat-Seng Chua, Ramesh C. Jain:
Label-specific training set construction from web resource for image annotation. Signal Process. 93(8): 2199-2204 (2013) - [j90]Yan-Tao Zheng, Shuicheng Yan, Zheng-Jun Zha
, Yiqun Li, Xiangdong Zhou, Tat-Seng Chua, Ramesh C. Jain:
GPSView: A scenic driving route planner. ACM Trans. Multim. Comput. Commun. Appl. 9(1): 3:1-3:18 (2013) - [j89]Jian Dong, Bin Cheng, Xiangyu Chen, Tat-Seng Chua, Shuicheng Yan, Xi Zhou:
Robust image annotation via simultaneous feature and sample outlier pursuit. ACM Trans. Multim. Comput. Commun. Appl. 9(4): 24:1-24:20 (2013) - [j87]Jinhui Tang
, Qiang Chen, Meng Wang, Shuicheng Yan, Tat-Seng Chua, Ramesh C. Jain:
Towards optimizing human labeling for interactive image tagging. ACM Trans. Multim. Comput. Commun. Appl. 9(4): 29:1-29:18 (2013) - [j86]Yi-Liang Zhao, Qiang Chen, Shuicheng Yan, Tat-Seng Chua, Daqing Zhang:
Detecting profilable and overlapping communities with user-generated multimedia contents in LBSNs. ACM Trans. Multim. Comput. Commun. Appl. 10(1): 3:1-3:22 (2013) - [j85]Xiangyu Chen, Yadong Mu, Hairong Liu, Shuicheng Yan, Yong Rui, Tat-Seng Chua:
Large-scale multilabel propagation based on efficient sparse graph construction. ACM Trans. Multim. Comput. Commun. Appl. 10(1): 6:1-6:20 (2013) - [c201]Hanwang Zhang
, Zheng-Jun Zha
, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua:
Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. ACM Multimedia 2013: 33-42 - 2012
- [j83]Yadong Mu, Xiangyu Chen, Xianglong Liu
, Tat-Seng Chua, Shuicheng Yan:
Multimedia semantics-aware query-adaptive hashing with bits reconfigurability. Int. J. Multim. Inf. Retr. 1(1): 59-70 (2012) - [j77]Yue Gao, Jinhui Tang, Richang Hong, Shuicheng Yan, Qionghai Dai, Naiyao Zhang, Tat-Seng Chua:
Camera Constraint-Free View-Based 3-D Object Retrieval. IEEE Trans. Image Process. 21(4): 2269-2281 (2012) - [j70]Meng Wang, Richang Hong, Xiao-Tong Yuan, Shuicheng Yan, Tat-Seng Chua:
Movie2Comics: Towards a Lively Video Content Presentation. IEEE Trans. Multim. 14(3-2): 858-870 (2012) - [j69]Meng Wang, Richang Hong, Guangda Li, Zheng-Jun Zha
, Shuicheng Yan, Tat-Seng Chua:
Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification. IEEE Trans. Multim. 14(4): 975-985 (2012) - [j67]Xiaobai Liu, Shuicheng Yan, Tat-Seng Chua, Hai Jin:
Image label completion by pursuing contextual decomposability. ACM Trans. Multim. Comput. Commun. Appl. 8(2): 21:1-21:20 (2012) - [j66]Xiaobai Liu, Shuicheng Yan, Bin Cheng, Jinhui Tang
, Tat-Seng Chua, Hai Jin:
Label-to-region with continuity-biased bi-layer sparsity priors. ACM Trans. Multim. Comput. Commun. Appl. 8(4): 50:1-50:23 (2012) - [c185]Hanwang Zhang
, Zheng-Jun Zha
, Shuicheng Yan, Meng Wang, Tat-Seng Chua:
Robust Non-negative Graph Embedding: Towards noisy data, unreliable graphs, and noisy labels. CVPR 2012: 2464-2471 - [c172]Liqiang Nie, Shuicheng Yan, Meng Wang, Richang Hong, Tat-Seng Chua:
Harvesting visual concepts for image search with complex queries. ACM Multimedia 2012: 59-68 - [c171]Hanwang Zhang
, Zheng-Jun Zha
, Shuicheng Yan, Jingwen Bian, Tat-Seng Chua:
Attribute feedback. ACM Multimedia 2012: 79-88 - 2011
- [j54]Jinhui Tang, Richang Hong, Shuicheng Yan, Tat-Seng Chua, Guo-Jun Qi
, Ramesh C. Jain:
Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images. ACM Trans. Intell. Syst. Technol. 2(2): 14:1-14:15 (2011) - [j48]Richang Hong, Jinhui Tang, Hung-Khoon Tan, Chong-Wah Ngo
, Shuicheng Yan, Tat-Seng Chua:
Beyond search: Event-driven summarization for web videos. ACM Trans. Multim. Comput. Commun. Appl. 7(4): 35:1-35:18 (2011) - [j47]Richang Hong, Meng Wang, Xiao-Tong Yuan, Mengdi Xu, Jianguo Jiang, Shuicheng Yan, Tat-Seng Chua:
Video accessibility enhancement for hearing-impaired users. ACM Trans. Multim. Comput. Commun. Appl. 7(Supplement): 24 (2011) - [c144]Xiangyu Chen, Xiao-Tong Yuan, Qiang Chen, Shuicheng Yan, Tat-Seng Chua:
Multi-label visual classification with label exclusive context. ICCV 2011: 834-841 - [c136]Yadong Mu, Xiangyu Chen, Tat-Seng Chua, Shuicheng Yan:
Learning reconfigurable hashing for diverse semantics. ICMR 2011: 7 - [c135]Xiangyu Chen, Xiaotong Yuan, Shuicheng Yan, Jinhui Tang, Yong Rui, Tat-Seng Chua:
Towards multi-semantic image annotation with graph regularized exclusive group lasso. ACM Multimedia 2011: 263-272 - [i6]Jinhui Tang, Shuicheng Yan, Tat-Seng Chua, Ramesh C. Jain:
Label-Specific Training Set Construction from Web Resource for Image Annotation. CoRR abs/1107.2859 (2011) - 2010
- [c103]Xiangyu Chen, Yadong Mu, Shuicheng Yan, Tat-Seng Chua:
Efficient large-scale image annotation by probabilistic collaborative multi-label propagation. ACM Multimedia 2010: 35-44 - [c100]Richang Hong, Meng Wang, Mengdi Xu, Shuicheng Yan, Tat-Seng Chua:
Dynamic captioning: video accessibility enhancement for hearing impairment. ACM Multimedia 2010: 421-430 - [c98]Richang Hong, Xiaotong Yuan, Mengdi Xu, Meng Wang, Shuicheng Yan, Tat-Seng Chua:
Movie2Comics: a feast of multimedia artwork. ACM Multimedia 2010: 611-614 - [c96]Jinhui Tang, Qiang Chen, Shuicheng Yan, Tat-Seng Chua, Ramesh C. Jain:
One person labels one million images. ACM Multimedia 2010: 1019-1022 - [c95]Richang Hong, Meng Wang, Guangda Li, Xiaotong Yuan, Shuicheng Yan, Tat-Seng Chua:
iComics: automatic conversion of movie into comics. ACM Multimedia 2010: 1599-1602 - [c93]Guangda Li, Richang Hong, Yantao Zheng, Shuicheng Yan, Tat-Seng Chua:
Learning Cooking Techniques from YouTube. MMM 2010: 713-718 - [c89]Xiangyu Chen, Jin Yuan, Liqiang Nie, Zheng-Jun Zha, Shuicheng Yan, Tat-Seng Chua:
TRECVID 2010 Known-item Search by NUS. TRECVID 2010 - 2009
- [c84]Ju Sun
, Xiao Wu, Shuicheng Yan, Loong Fah Cheong, Tat-Seng Chua, Jintao Li:
Hierarchical spatio-temporal context modeling for action recognition. CVPR 2009: 2004-2011 - [c72]Richang Hong, Jinhui Tang, Hung-Khoon Tan, Shuicheng Yan, Chong-Wah Ngo
, Tat-Seng Chua:
Event driven summarization for web videos. WSM@MM 2009: 43-48 - [c70]Xiaobai Liu, Bin Cheng, Shuicheng Yan, Jinhui Tang, Tat-Seng Chua, Hai Jin:
Label to region by bi-layer sparsity priors. ACM Multimedia 2009: 115-124 - [c69]Jinhui Tang, Shuicheng Yan, Richang Hong, Guo-Jun Qi
, Tat-Seng Chua:
Inferring semantic concepts from community-contributed images and noisy tags. ACM Multimedia 2009: 223-232

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-08-21 01:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint