Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess.IV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Image and Video Processing

Authors and titles for recent submissions

  • Fri, 6 Feb 2026
  • Thu, 5 Feb 2026
  • Wed, 4 Feb 2026
  • Tue, 3 Feb 2026
  • Mon, 2 Feb 2026

See today's new changes

Total of 76 entries : 1-50 51-76
Showing up to 50 entries per page: fewer | more | all

Fri, 6 Feb 2026 (showing 11 of 11 entries )

[1] arXiv:2602.05738 [pdf, html, other]
Title: Disc-Centric Contrastive Learning for Lumbar Spine Severity Grading
Sajjan Acharya, Pralisha Kansakar
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2602.05453 [pdf, html, other]
Title: Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis
Budhaditya Mukhopadhyay, Chirag Mandal, Pavan Tummala, Naghmeh Mahmoodian, Andreas Nürnberger, Soumick Chatterjee
Comments: Accepted for AIBio at ECAI 2025
Journal-ref: Artificial Intelligence for Biomedical Data, AIBIO 2025, CCIS 2696, pp 1-14, 2026
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[3] arXiv:2602.05208 [pdf, html, other]
Title: Context-Aware Asymmetric Ensembling for Interpretable Retinopathy of Prematurity Screening via Active Query and Vascular Attention
Md. Mehedi Hassan, Taufiq Hasan
Comments: 16 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2602.05201 [pdf, html, other]
Title: Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance
Maojun Zhang, Haotian Wu, Richeng Jin, Deniz Gunduz, Krystian Mikolajczyk
Comments: Accepted by ICASSP 2026
Subjects: Image and Video Processing (eess.IV)
[5] arXiv:2602.05104 [pdf, other]
Title: Personalized White Matter Bundle Segmentation for Early Childhood
Elyssa M. McMaster, Michael E. Kim, Nancy R. Newlin, Gaurav Rudravaram, Adam M. Saunders, Aravind R. Krishnan, Jongyeon Yoon, Ji S. Kim, Bryce L. Geeraert, Meaghan V. Perdue, Catherine Lebel, Daniel Moyer, Kurt G. Schilling, Laurie E. Cutting, Bennett A. Landman
Subjects: Image and Video Processing (eess.IV)
[6] arXiv:2602.04983 [pdf, html, other]
Title: AI-Based Detection of In-Treatment Changes from Prostate MR-Linac Images
Seungbin Park, Peilin Wang, Ryan Pennell, Emily S. Weg, Himanshu Nagar, Timothy McClure, Mert R. Sabuncu, Daniel Margolis, Heejong Kim
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[7] arXiv:2602.04944 [pdf, other]
Title: Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women's Reproductive Health
Shayan Abrar, Samura Rahman, Ishrat Jahan Momo, Mahjabin Tasnim Samiha, B. M. Shahria Alam, Mohammad Tahmid Noor, Nishat Tasnim Niloy
Comments: 6 pages, 12 figures. This is the author's accepted manuscript of a paper accepted for publication in the Proceedings of the 16th International IEEE Conference on Computing, Communication and Networking Technologies (ICCCNT 2025). The final published version will be available via IEEE Xplore
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2602.05908 (cross-list from physics.app-ph) [pdf, html, other]
Title: Self-Portrait of the Focusing Process in Speckle: III. Tailoring Complex Spatio-Temporal Focusing Laws To Overcome Reverberations in Reflection Imaging
Elsa Giraudat, Flavien Bureau, William Lambert, Mathias Fink, Alexandre Aubry
Comments: 29 pages, 8 figures, 2 tables
Subjects: Applied Physics (physics.app-ph); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph); Optics (physics.optics)
[9] arXiv:2602.05078 (cross-list from cs.CV) [pdf, html, other]
Title: Food Portion Estimation: From Pixels to Calories
Gautham Vinod, Fengqing Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[10] arXiv:2602.04932 (cross-list from cs.LG) [pdf, html, other]
Title: Comparing Euclidean and Hyperbolic K-Means for Generalized Category Discovery
Mohamad Dalal, Thomas B. Moeslund, Joakim Bruslund Haurum
Comments: 11 pages, 4 figures. To be published in the VISAPP
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[11] arXiv:2602.04904 (cross-list from cs.LG) [pdf, html, other]
Title: DCER: Dual-Stage Compression and Energy-Based Reconstruction
Yiwen Wang, Jiahao Qin
Comments: 13 pages, 2 figures, 8 tables. Submitted to ICML 2026. Code will be available on GitHub
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)

Thu, 5 Feb 2026 (showing 8 of 8 entries )

[12] arXiv:2602.04032 [pdf, html, other]
Title: MS-SCANet: A Multiscale Transformer-Based Architecture with Dual Attention for No-Reference Image Quality Assessment
Mayesha Maliha R. Mithila, Mylene C.Q. Farias
Comments: Published in ICASSP 2025, 5 pages, 3 figures
Journal-ref: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[13] arXiv:2602.03998 [pdf, html, other]
Title: AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing in Computational Pathology
Ahmed Alagha, Christopher Leclerc, Yousef Kotp, Omar Metwally, Calvin Moras, Peter Rentopoulos, Ghodsiyeh Rostami, Bich Ngoc Nguyen, Jumanah Baig, Abdelhakim Khellaf, Vincent Quoc-Huy Trinh, Rabeb Mizouni, Hadi Otrok, Jamal Bentahar, Mahdi S. Hosseini
Comments: Under review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[14] arXiv:2602.03910 [pdf, other]
Title: CONRep: Uncertainty-Aware Vision-Language Report Drafting Using Conformal Prediction
Danial Elyassirad, Benyamin Gheiji, Mahsa Vatanparast, Amir Mahmoud Ahmadzadeh, Seyed Amir Asef Agah, Mana Moassefi, Meysam Tavakoli, Shahriar Faghani
Comments: 17 pages, 3 figures, 3 tables
Subjects: Image and Video Processing (eess.IV)
[15] arXiv:2602.03887 [pdf, other]
Title: To What Extent Do Token-Level Representations from Pathology Foundation Models Improve Dense Prediction?
Weiming Chen, Xitong Ling, Xidong Wang, Zhenyang Cai, Yijia Guo, Mingxi Fu, Ziyi Zeng, Minxi Ouyang, Jiawen Li, Yizhi Wang, Tian Guan, Benyou Wang, Yonghong He
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2602.03870 [pdf, html, other]
Title: DINO-AD: Unsupervised Anomaly Detection with Frozen DINO-V3 Features
Jiayu Huo, Jingyuan Hong, Liyun Chen
Comments: Accepted by ISBI 2026, 4 pages, 2 figures, 3 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2602.04834 (cross-list from physics.optics) [pdf, html, other]
Title: ConvRML: High-Quality Lensless Imaging with Random Multi-Focal Lenslets
Leyla A. Kabuli, Clara S. Hung, Vasilisa Ponomarenko, Eric Markley, Laura Waller
Comments: 28 pages, 11 figures
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV)
[18] arXiv:2602.04712 (cross-list from cs.CV) [pdf, other]
Title: SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation
David F. Ramirez, Tim Overman, Kristen Jaskie, Joe Marvin, Andreas Spanias
Comments: Submitted to 2026 IEEE Radar Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[19] arXiv:2602.04162 (cross-list from cs.CV) [pdf, html, other]
Title: Improving 2D Diffusion Models for 3D Medical Imaging with Inter-Slice Consistent Stochasticity
Chenhe Du, Qing Wu, Xuanyu Tian, Jingyi Yu, Hongjiang Wei, Yuyao Zhang
Comments: Accepted by ICLR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Wed, 4 Feb 2026 (showing 13 of 13 entries )

[20] arXiv:2602.02798 [pdf, html, other]
Title: Real-time topology-aware M-mode OCT segmentation for robotic deep anterior lamellar keratoplasty (DALK) guidance
Rosalinda Xiong, Jinglun Yu, Yaning Wang, Ziyi Huang, Jin U. Kang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[21] arXiv:2602.02795 [pdf, other]
Title: Super-Resolution and Denoising of Corneal B-Scan OCT Imaging Using Diffusion Model Plug-and-Play Priors
Yaning Wang, Jinglun Yu, Wenhan Guo, Ziyi Huang, Rosalinda Xiong, Yu Sun, Jin U. Kang
Journal-ref: Proceedings of SPIE, Vol. 13865, 13865-67 (2026)
Subjects: Image and Video Processing (eess.IV)
[22] arXiv:2602.02758 [pdf, other]
Title: Wide-field high-resolution microscopy via high-speed galvo scanning and real-time mosaicking
Ziyi Huang, Rosalinda Xiong, Yaning Wang, Jinglun Yu, Jin U. Kang
Subjects: Image and Video Processing (eess.IV)
[23] arXiv:2602.02755 [pdf, html, other]
Title: Physics-based generation of multilayer corneal OCT data via Gaussian modeling and MCML for AI-driven diagnostic and surgical guidance applications
Jinglun Yu, Yaning Wang, Rosalinda Xiong, Ziyi Huang, Kristina Irsch, Jin U. Kang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2602.02603 [pdf, html, other]
Title: EchoJEPA: A Latent Predictive Foundation Model for Echocardiography
Alif Munim, Adibvafa Fallahpour, Teodora Szasz, Ahmadreza Attarpour, River Jiang, Brana Sooriyakanthan, Maala Sooriyakanthan, Heather Whitney, Jeremy Slivnick, Barry Rubin, Wendy Tsang, Bo Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[25] arXiv:2602.02552 [pdf, html, other]
Title: Super-résolution non supervisée d'images hyperspectrales de télédétection utilisant un entraînement entièrement synthétique
Xinxin Xu, Yann Gousseau, Christophe Kervazo, Saïd Ladjal
Comments: in French language
Journal-ref: GRETSI 2025: XXXe Colloque Francophone de Traitement du Signal et des Images, Strasbourg, France, August 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2602.03669 (cross-list from cs.CV) [pdf, other]
Title: Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images
Sandeep Patil, Yongqi Dong, Haneen Farah, Hans Hellendoorn
Comments: 14 pages, 9 figures, under review by IEEE T-ITS
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[27] arXiv:2602.03294 (cross-list from cs.CV) [pdf, html, other]
Title: LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices
Jonas Kühne, Christian Vogt, Michele Magno, Luca Benini
Comments: This article has been accepted for publication in the IEEE Sensors Journal (JSEN)
Journal-ref: IEEE Sensors Journal ( Volume: 26, Issue: 3, 01 February 2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[28] arXiv:2602.03281 (cross-list from physics.app-ph) [pdf, html, other]
Title: Physics-Based Learning of the Wave Speed Landscape in Complex Media
Baptiste Hériard-Dubreuil, Emma Brenner, Benjamin Rio, William Lambert, Foucauld Chamming's, Mathias Fink, Alexandre Aubry
Comments: 40 pages, 8 figures, 1 table
Subjects: Applied Physics (physics.app-ph); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph); Optics (physics.optics)
[29] arXiv:2602.03264 (cross-list from cs.CV) [pdf, html, other]
Title: HypCBC: Domain-Invariant Hyperbolic Cross-Branch Consistency for Generalizable Medical Image Analysis
Francesco Di Salvo, Sebastian Doerrich, Jonas Alle, Christian Ledig
Comments: Accepted to Transactions on Machine Learning Research (TMLR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[30] arXiv:2602.02713 (cross-list from physics.med-ph) [pdf, html, other]
Title: Perfusion Imaging and Single Material Reconstruction in Polychromatic Photon Counting CT
Namhoon Kim, Ashwin Pananjady, Amir Pourmorteza, Sara Fridovich-Keil
Comments: Code is available at this https URL
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[31] arXiv:2602.02567 (cross-list from cs.LG) [pdf, html, other]
Title: IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space
Jingyi Xu, Shengnan Wang, Weidong Yang, Siwei Tu, Lei Bai, Ben Fei
Comments: 9 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[32] arXiv:2602.02508 (cross-list from cs.IT) [pdf, html, other]
Title: Precoding-Oriented CSI Feedback Design with Mutual Information Regularized VQ-VAE
Xi Chen, Homa Esfahanizadeh, Foad Sohrabi
Comments: 5 pages, submitted to IEEE VTC conference
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Tue, 3 Feb 2026 (showing first 18 of 28 entries )

[33] arXiv:2602.02031 [pdf, html, other]
Title: Edge-Aligned Initialization of Kernels for Steered Mixture-of-Experts
Martin Determann, Elvira Fleig
Subjects: Image and Video Processing (eess.IV)
[34] arXiv:2602.01681 [pdf, html, other]
Title: Hyperspectral Image Fusion with Spectral-Band and Fusion-Scale Agnosticism
Yu-Jie Liang, Zihan Cao, Liang-Jian Deng, Yang Yang, Malu Zhang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[35] arXiv:2602.01513 [pdf, html, other]
Title: MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation
Xiaoxi Kong, Jieyu Yuan, Pengdi Chen, Yuanlin Zhang, Chongyi Li, Bin Li
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2602.01444 [pdf, other]
Title: A texture-based framework for foundational ultrasound models
Tal Grutman, Carmel Shinar, Tali Ilovitsh
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2602.01325 [pdf, html, other]
Title: Unified ROI-based Image Compression Paradigm with Generalized Gaussian Model
Kai Hu, Junfu Tan, Fang Xu, Ramy Samy, Yu Liu
Comments: 14 pages, 18 figures,
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[38] arXiv:2602.01065 [pdf, html, other]
Title: Coordinate-conditioned Deconvolution for Scalable Spatially Varying High-Throughput Imaging
Qianwan Yang, Zhixiong Chen, Jiaqi Zhang, Ruipeng Guo, Guorong Hu, Lei Tian
Subjects: Image and Video Processing (eess.IV)
[39] arXiv:2602.00990 [pdf, other]
Title: Diagnostic Impact of Cine Clips for Thyroid Nodule Assessment on Ultrasound
Jichen Yang, Brian C. Allen, Kirti Magudia, Lisa M. Ho, Chad M. Miller, Maciej A. Mazurowski, Benjamin Wildman-Tobriner
Comments: 17 pages, 5 tables
Subjects: Image and Video Processing (eess.IV)
[40] arXiv:2602.00863 [pdf, html, other]
Title: Lightweight Super Resolution-enabled Coding Model for the JPEG Pleno Learning-based Point Cloud Coding Standard
André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira
Comments: 32 pages, 8 figures, submitted to Signal Processing: Image Communication
Subjects: Image and Video Processing (eess.IV)
[41] arXiv:2602.00483 [pdf, html, other]
Title: Recent Advances of End-to-End Video Coding Technologies for AVS Standard Development
Xihua Sheng, Xiongzhuang Liang, Chuanbo Tang, Zhirui Zuo, Yifan Bian, Yutao Xie, Zhuoyuan Li, Yuqi Li, Hui Xiang, Li Li, Dong Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[42] arXiv:2602.00221 [pdf, other]
Title: Benchmarking Vanilla GAN, DCGAN, and WGAN Architectures for MRI Reconstruction: A Quantitative Analysis
Humaira Mehwish, Hina Shakir, Muneeba Rashid, Asarim Aamir, Reema Qaiser Khan
Comments: 20 pages
Journal-ref: Edelweiss Applied Science and Technology January 2026
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2602.00220 [pdf, html, other]
Title: Advanced Geometric Correction Algorithms for 3D Medical Reconstruction: Comparison of Computed Tomography and Macroscopic Imaging
Tomasz Les, Tomasz Markiewicz, Malgorzata Lorent, Miroslaw Dziekiewicz, Krzysztof Siwek
Comments: 24 pages, 9 figures, submitted to Applied Sciences (MDPI)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[44] arXiv:2602.00215 [pdf, html, other]
Title: A Renderer-Enabled Framework for Computing Parameter Estimation Lower Bounds in Plenoptic Imaging Systems
Abhinav V. Sambasivan, Liam J. Coulter, Richard G. Paxman, Jarvis D. Haupt
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[45] arXiv:2602.00198 [pdf, other]
Title: SCALED : Surrogate-gradient for Codec-Aware Learning of Downsampling in ABR Streaming
Esteban Pesnel (COMPACT), Julien Le Tanou, Michael Ropert, Thomas Maugey (COMPACT), Aline Roumy (COMPACT)
Journal-ref: PCS 2025 - Picture Coding Symposium, IEEE Signal Processing Society, Dec 2025, Aachen (Aix la Chapelle), Germany
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[46] arXiv:2602.00186 [pdf, html, other]
Title: SurfelSoup: Learned Point Cloud Geometry Compression With a Probablistic SurfelTree Representation
Tingyu Fan, Ran Gong, Yueyu Hu, Yao Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2602.00184 [pdf, html, other]
Title: Visible Singularities Guided Correlation Network for Limited-Angle CT Reconstruction
Yiyang Wen, Liu Shi, Zekun Zhou, WenZhe Shan, Qiegen Liu
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2602.00136 [pdf, other]
Title: Toward a Unified Semantic Loss Model for Deep JSCC-based Transmission of EO Imagery
Ti Ti Nguyen, Thanh-Dung Le, Vu Nguyen Ha, Duc-Dung Tran, Hung Nguyen-Kha, Dinh-Hieu Tran, Carlos L. Marcos-Rojas, Juan C. Merlano-Duncan, Symeon Chatzinotas
Comments: 5 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2602.00102 [pdf, html, other]
Title: Radiomics in Medical Imaging: Methods, Applications, and Challenges
Fnu Neha, Deepak kumar Shukla
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[50] arXiv:2602.00100 [pdf, html, other]
Title: Frequent Pattern Mining approach to Image Compression
Avinash Kadimisetty, C. Oswald, B. Sivalselvan
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Total of 76 entries : 1-50 51-76
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status