publications
publications by categories in reversed chronological order.
2026
- CVPRRestore-R1: Efficient Image Restoration Agents via Reinforcement Learning with Multimodal LLM Perceptual FeedbackIn The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings track, 2026
- CVPRTraining-Free Cross-Modal Alignment via Anchor Profiles with Statistical Significance TestingIn The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings track, 2026
- CVPRDUALVISION: RGB–Infrared Multimodal Large Language Models for Robust Visual ReasoningIn The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings track, 2026
- ICLRSeeing Through Words: Controlling Visual Retrieval Quality with Language Models2026arXiv preprint, 2026
2025
- EMNLPRepresentation Potentials of Foundation Models for Multimodal Alignment: A SurveyThe 2025 Conference on Empirical Methods in Natural Language Processing, 2025Cited by Prof. Yoshua Bengio in the International AI Safety Report 2026