Image and Video Processing

Authors and titles for recent submissions

See today's new changes

Total of 76 entries : 1-50 51-76

Showing up to 50 entries per page: fewer | more | all

[1] arXiv:2602.05738 [pdf, html, other]: Title: Disc-Centric Contrastive Learning for Lumbar Spine Severity Grading

Sajjan Acharya, Pralisha Kansakar

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2602.05453 [pdf, html, other]: Title: Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis

Budhaditya Mukhopadhyay, Chirag Mandal, Pavan Tummala, Naghmeh Mahmoodian, Andreas Nürnberger, Soumick Chatterjee

Comments: Accepted for AIBio at ECAI 2025

Journal-ref: Artificial Intelligence for Biomedical Data, AIBIO 2025, CCIS 2696, pp 1-14, 2026

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[3] arXiv:2602.05208 [pdf, html, other]: Title: Context-Aware Asymmetric Ensembling for Interpretable Retinopathy of Prematurity Screening via Active Query and Vascular Attention

Md. Mehedi Hassan, Taufiq Hasan

Comments: 16 pages, 6 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2602.05201 [pdf, html, other]: Title: Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Maojun Zhang, Haotian Wu, Richeng Jin, Deniz Gunduz, Krystian Mikolajczyk

Comments: Accepted by ICASSP 2026

Subjects: Image and Video Processing (eess.IV)
[5] arXiv:2602.05104 [pdf, other]: Title: Personalized White Matter Bundle Segmentation for Early Childhood

Elyssa M. McMaster, Michael E. Kim, Nancy R. Newlin, Gaurav Rudravaram, Adam M. Saunders, Aravind R. Krishnan, Jongyeon Yoon, Ji S. Kim, Bryce L. Geeraert, Meaghan V. Perdue, Catherine Lebel, Daniel Moyer, Kurt G. Schilling, Laurie E. Cutting, Bennett A. Landman

Subjects: Image and Video Processing (eess.IV)
[6] arXiv:2602.04983 [pdf, html, other]: Title: AI-Based Detection of In-Treatment Changes from Prostate MR-Linac Images

Seungbin Park, Peilin Wang, Ryan Pennell, Emily S. Weg, Himanshu Nagar, Timothy McClure, Mert R. Sabuncu, Daniel Margolis, Heejong Kim

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[7] arXiv:2602.04944 [pdf, other]: Title: Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women's Reproductive Health

Shayan Abrar, Samura Rahman, Ishrat Jahan Momo, Mahjabin Tasnim Samiha, B. M. Shahria Alam, Mohammad Tahmid Noor, Nishat Tasnim Niloy

Comments: 6 pages, 12 figures. This is the author's accepted manuscript of a paper accepted for publication in the Proceedings of the 16th International IEEE Conference on Computing, Communication and Networking Technologies (ICCCNT 2025). The final published version will be available via IEEE Xplore

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2602.05908 (cross-list from physics.app-ph) [pdf, html, other]: Title: Self-Portrait of the Focusing Process in Speckle: III. Tailoring Complex Spatio-Temporal Focusing Laws To Overcome Reverberations in Reflection Imaging

Elsa Giraudat, Flavien Bureau, William Lambert, Mathias Fink, Alexandre Aubry

Comments: 29 pages, 8 figures, 2 tables

Subjects: Applied Physics (physics.app-ph); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph); Optics (physics.optics)
[9] arXiv:2602.05078 (cross-list from cs.CV) [pdf, html, other]: Title: Food Portion Estimation: From Pixels to Calories

Gautham Vinod, Fengqing Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[10] arXiv:2602.04932 (cross-list from cs.LG) [pdf, html, other]: Title: Comparing Euclidean and Hyperbolic K-Means for Generalized Category Discovery

Mohamad Dalal, Thomas B. Moeslund, Joakim Bruslund Haurum

Comments: 11 pages, 4 figures. To be published in the VISAPP

Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[11] arXiv:2602.04904 (cross-list from cs.LG) [pdf, html, other]: Title: DCER: Dual-Stage Compression and Energy-Based Reconstruction

Yiwen Wang, Jiahao Qin

Comments: 13 pages, 2 figures, 8 tables. Submitted to ICML 2026. Code will be available on GitHub

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)

[12] arXiv:2602.04032 [pdf, html, other]: Title: MS-SCANet: A Multiscale Transformer-Based Architecture with Dual Attention for No-Reference Image Quality Assessment

Mayesha Maliha R. Mithila, Mylene C.Q. Farias

Comments: Published in ICASSP 2025, 5 pages, 3 figures

Journal-ref: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2025

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[13] arXiv:2602.03998 [pdf, html, other]: Title: AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing in Computational Pathology

Ahmed Alagha, Christopher Leclerc, Yousef Kotp, Omar Metwally, Calvin Moras, Peter Rentopoulos, Ghodsiyeh Rostami, Bich Ngoc Nguyen, Jumanah Baig, Abdelhakim Khellaf, Vincent Quoc-Huy Trinh, Rabeb Mizouni, Hadi Otrok, Jamal Bentahar, Mahdi S. Hosseini

Comments: Under review

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[14] arXiv:2602.03910 [pdf, other]: Title: CONRep: Uncertainty-Aware Vision-Language Report Drafting Using Conformal Prediction

Danial Elyassirad, Benyamin Gheiji, Mahsa Vatanparast, Amir Mahmoud Ahmadzadeh, Seyed Amir Asef Agah, Mana Moassefi, Meysam Tavakoli, Shahriar Faghani

Comments: 17 pages, 3 figures, 3 tables

Subjects: Image and Video Processing (eess.IV)
[15] arXiv:2602.03887 [pdf, other]: Title: To What Extent Do Token-Level Representations from Pathology Foundation Models Improve Dense Prediction?

Weiming Chen, Xitong Ling, Xidong Wang, Zhenyang Cai, Yijia Guo, Mingxi Fu, Ziyi Zeng, Minxi Ouyang, Jiawen Li, Yizhi Wang, Tian Guan, Benyou Wang, Yonghong He

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2602.03870 [pdf, html, other]: Title: DINO-AD: Unsupervised Anomaly Detection with Frozen DINO-V3 Features

Jiayu Huo, Jingyuan Hong, Liyun Chen

Comments: Accepted by ISBI 2026, 4 pages, 2 figures, 3 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2602.04834 (cross-list from physics.optics) [pdf, html, other]: Title: ConvRML: High-Quality Lensless Imaging with Random Multi-Focal Lenslets

Leyla A. Kabuli, Clara S. Hung, Vasilisa Ponomarenko, Eric Markley, Laura Waller

Comments: 28 pages, 11 figures

Subjects: Optics (physics.optics); Image and Video Processing (eess.IV)
[18] arXiv:2602.04712 (cross-list from cs.CV) [pdf, other]: Title: SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation

David F. Ramirez, Tim Overman, Kristen Jaskie, Joe Marvin, Andreas Spanias

Comments: Submitted to 2026 IEEE Radar Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[19] arXiv:2602.04162 (cross-list from cs.CV) [pdf, html, other]: Title: Improving 2D Diffusion Models for 3D Medical Imaging with Inter-Slice Consistent Stochasticity

Chenhe Du, Qing Wu, Xuanyu Tian, Jingyi Yu, Hongjiang Wei, Yuyao Zhang

Comments: Accepted by ICLR 2026

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

[20] arXiv:2602.02798 [pdf, html, other]: Title: Real-time topology-aware M-mode OCT segmentation for robotic deep anterior lamellar keratoplasty (DALK) guidance

Rosalinda Xiong, Jinglun Yu, Yaning Wang, Ziyi Huang, Jin U. Kang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[21] arXiv:2602.02795 [pdf, other]: Title: Super-Resolution and Denoising of Corneal B-Scan OCT Imaging Using Diffusion Model Plug-and-Play Priors

Yaning Wang, Jinglun Yu, Wenhan Guo, Ziyi Huang, Rosalinda Xiong, Yu Sun, Jin U. Kang

Journal-ref: Proceedings of SPIE, Vol. 13865, 13865-67 (2026)

Subjects: Image and Video Processing (eess.IV)
[22] arXiv:2602.02758 [pdf, other]: Title: Wide-field high-resolution microscopy via high-speed galvo scanning and real-time mosaicking

Ziyi Huang, Rosalinda Xiong, Yaning Wang, Jinglun Yu, Jin U. Kang

Subjects: Image and Video Processing (eess.IV)
[23] arXiv:2602.02755 [pdf, html, other]: Title: Physics-based generation of multilayer corneal OCT data via Gaussian modeling and MCML for AI-driven diagnostic and surgical guidance applications

Jinglun Yu, Yaning Wang, Rosalinda Xiong, Ziyi Huang, Kristina Irsch, Jin U. Kang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2602.02603 [pdf, html, other]: Title: EchoJEPA: A Latent Predictive Foundation Model for Echocardiography

Alif Munim, Adibvafa Fallahpour, Teodora Szasz, Ahmadreza Attarpour, River Jiang, Brana Sooriyakanthan, Maala Sooriyakanthan, Heather Whitney, Jeremy Slivnick, Barry Rubin, Wendy Tsang, Bo Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[25] arXiv:2602.02552 [pdf, html, other]: Title: Super-résolution non supervisée d'images hyperspectrales de télédétection utilisant un entraînement entièrement synthétique

Xinxin Xu, Yann Gousseau, Christophe Kervazo, Saïd Ladjal

Comments: in French language

Journal-ref: GRETSI 2025: XXXe Colloque Francophone de Traitement du Signal et des Images, Strasbourg, France, August 2025

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2602.03669 (cross-list from cs.CV) [pdf, other]: Title: Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images

Sandeep Patil, Yongqi Dong, Haneen Farah, Hans Hellendoorn

Comments: 14 pages, 9 figures, under review by IEEE T-ITS

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[27] arXiv:2602.03294 (cross-list from cs.CV) [pdf, html, other]: Title: LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices

Jonas Kühne, Christian Vogt, Michele Magno, Luca Benini

Comments: This article has been accepted for publication in the IEEE Sensors Journal (JSEN)

Journal-ref: IEEE Sensors Journal ( Volume: 26, Issue: 3, 01 February 2026)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[28] arXiv:2602.03281 (cross-list from physics.app-ph) [pdf, html, other]: Title: Physics-Based Learning of the Wave Speed Landscape in Complex Media

Baptiste Hériard-Dubreuil, Emma Brenner, Benjamin Rio, William Lambert, Foucauld Chamming's, Mathias Fink, Alexandre Aubry

Comments: 40 pages, 8 figures, 1 table

Subjects: Applied Physics (physics.app-ph); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph); Optics (physics.optics)
[29] arXiv:2602.03264 (cross-list from cs.CV) [pdf, html, other]: Title: HypCBC: Domain-Invariant Hyperbolic Cross-Branch Consistency for Generalizable Medical Image Analysis

Francesco Di Salvo, Sebastian Doerrich, Jonas Alle, Christian Ledig

Comments: Accepted to Transactions on Machine Learning Research (TMLR)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[30] arXiv:2602.02713 (cross-list from physics.med-ph) [pdf, html, other]: Title: Perfusion Imaging and Single Material Reconstruction in Polychromatic Photon Counting CT

Namhoon Kim, Ashwin Pananjady, Amir Pourmorteza, Sara Fridovich-Keil

Comments: Code is available at this https URL

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[31] arXiv:2602.02567 (cross-list from cs.LG) [pdf, html, other]: Title: IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space

Jingyi Xu, Shengnan Wang, Weidong Yang, Siwei Tu, Lei Bai, Ben Fei

Comments: 9 pages, 6 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[32] arXiv:2602.02508 (cross-list from cs.IT) [pdf, html, other]: Title: Precoding-Oriented CSI Feedback Design with Mutual Information Regularized VQ-VAE

Xi Chen, Homa Esfahanizadeh, Foad Sohrabi

Comments: 5 pages, submitted to IEEE VTC conference

Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

[33] arXiv:2602.02031 [pdf, html, other]: Title: Edge-Aligned Initialization of Kernels for Steered Mixture-of-Experts

Martin Determann, Elvira Fleig

Subjects: Image and Video Processing (eess.IV)
[34] arXiv:2602.01681 [pdf, html, other]: Title: Hyperspectral Image Fusion with Spectral-Band and Fusion-Scale Agnosticism

Yu-Jie Liang, Zihan Cao, Liang-Jian Deng, Yang Yang, Malu Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[35] arXiv:2602.01513 [pdf, html, other]: Title: MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation

Xiaoxi Kong, Jieyu Yuan, Pengdi Chen, Yuanlin Zhang, Chongyi Li, Bin Li

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2602.01444 [pdf, other]: Title: A texture-based framework for foundational ultrasound models

Tal Grutman, Carmel Shinar, Tali Ilovitsh

Comments: This work has been submitted to the IEEE for possible publication

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2602.01325 [pdf, html, other]: Title: Unified ROI-based Image Compression Paradigm with Generalized Gaussian Model

Kai Hu, Junfu Tan, Fang Xu, Ramy Samy, Yu Liu

Comments: 14 pages, 18 figures,

Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[38] arXiv:2602.01065 [pdf, html, other]: Title: Coordinate-conditioned Deconvolution for Scalable Spatially Varying High-Throughput Imaging

Qianwan Yang, Zhixiong Chen, Jiaqi Zhang, Ruipeng Guo, Guorong Hu, Lei Tian

Subjects: Image and Video Processing (eess.IV)
[39] arXiv:2602.00990 [pdf, other]: Title: Diagnostic Impact of Cine Clips for Thyroid Nodule Assessment on Ultrasound

Jichen Yang, Brian C. Allen, Kirti Magudia, Lisa M. Ho, Chad M. Miller, Maciej A. Mazurowski, Benjamin Wildman-Tobriner

Comments: 17 pages, 5 tables

Subjects: Image and Video Processing (eess.IV)
[40] arXiv:2602.00863 [pdf, html, other]: Title: Lightweight Super Resolution-enabled Coding Model for the JPEG Pleno Learning-based Point Cloud Coding Standard

André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira

Comments: 32 pages, 8 figures, submitted to Signal Processing: Image Communication

Subjects: Image and Video Processing (eess.IV)
[41] arXiv:2602.00483 [pdf, html, other]: Title: Recent Advances of End-to-End Video Coding Technologies for AVS Standard Development

Xihua Sheng, Xiongzhuang Liang, Chuanbo Tang, Zhirui Zuo, Yifan Bian, Yutao Xie, Zhuoyuan Li, Yuqi Li, Hui Xiang, Li Li, Dong Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[42] arXiv:2602.00221 [pdf, other]: Title: Benchmarking Vanilla GAN, DCGAN, and WGAN Architectures for MRI Reconstruction: A Quantitative Analysis

Humaira Mehwish, Hina Shakir, Muneeba Rashid, Asarim Aamir, Reema Qaiser Khan

Comments: 20 pages

Journal-ref: Edelweiss Applied Science and Technology January 2026

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2602.00220 [pdf, html, other]: Title: Advanced Geometric Correction Algorithms for 3D Medical Reconstruction: Comparison of Computed Tomography and Macroscopic Imaging

Tomasz Les, Tomasz Markiewicz, Malgorzata Lorent, Miroslaw Dziekiewicz, Krzysztof Siwek

Comments: 24 pages, 9 figures, submitted to Applied Sciences (MDPI)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[44] arXiv:2602.00215 [pdf, html, other]: Title: A Renderer-Enabled Framework for Computing Parameter Estimation Lower Bounds in Plenoptic Imaging Systems

Abhinav V. Sambasivan, Liam J. Coulter, Richard G. Paxman, Jarvis D. Haupt

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[45] arXiv:2602.00198 [pdf, other]: Title: SCALED : Surrogate-gradient for Codec-Aware Learning of Downsampling in ABR Streaming

Esteban Pesnel (COMPACT), Julien Le Tanou, Michael Ropert, Thomas Maugey (COMPACT), Aline Roumy (COMPACT)

Journal-ref: PCS 2025 - Picture Coding Symposium, IEEE Signal Processing Society, Dec 2025, Aachen (Aix la Chapelle), Germany

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[46] arXiv:2602.00186 [pdf, html, other]: Title: SurfelSoup: Learned Point Cloud Geometry Compression With a Probablistic SurfelTree Representation

Tingyu Fan, Ran Gong, Yueyu Hu, Yao Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2602.00184 [pdf, html, other]: Title: Visible Singularities Guided Correlation Network for Limited-Angle CT Reconstruction

Yiyang Wen, Liu Shi, Zekun Zhou, WenZhe Shan, Qiegen Liu

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2602.00136 [pdf, other]: Title: Toward a Unified Semantic Loss Model for Deep JSCC-based Transmission of EO Imagery

Ti Ti Nguyen, Thanh-Dung Le, Vu Nguyen Ha, Duc-Dung Tran, Hung Nguyen-Kha, Dinh-Hieu Tran, Carlos L. Marcos-Rojas, Juan C. Merlano-Duncan, Symeon Chatzinotas

Comments: 5 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2602.00102 [pdf, html, other]: Title: Radiomics in Medical Imaging: Methods, Applications, and Challenges

Fnu Neha, Deepak kumar Shukla

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[50] arXiv:2602.00100 [pdf, html, other]: Title: Frequent Pattern Mining approach to Image Compression

Avinash Kadimisetty, C. Oswald, B. Sivalselvan

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Total of 76 entries : 1-50 51-76

Showing up to 50 entries per page: fewer | more | all

Image and Video Processing

Authors and titles for recent submissions

Fri, 6 Feb 2026 (showing 11 of 11 entries )

Thu, 5 Feb 2026 (showing 8 of 8 entries )

Wed, 4 Feb 2026 (showing 13 of 13 entries )

Tue, 3 Feb 2026 (showing first 18 of 28 entries )