publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- CVPRVQ-VA World: Towards High-Quality Visual Question-Visual AnsweringIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
, 2026
- CVPRAn Empirical Study on How Video-LLMs Answer Video QuestionsIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
- AAAIWhere and What Matters: Sensitivity-Aware Task Vectors for Many-Shot Multimodal In-Context LearningIn AAAI Conference on Artificial Intelligence (AAAI), 2026
- ICLRSparsity Forcing: Reinforcing Token Sparsity of MLLMsIn International Conference on Learning Representations (ICLR), 2026
- CVPREvaluating and Advancing Multimodal Large Language Models in Ability LensIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026
2025
- Tech Report
- PreprintLightBagel: A Light-Weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation2025
- PreprintUniMedVL: Unifying Medical Multimodal Understanding and Generation Through Observation-Knowledge-Analysis. Contributor. , 2025
- CVPRDrVideo: Document Retrieval Based Long Video UnderstandingIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
- EMNLPInfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video UnderstandingIn Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
- Preprint
- CVPRPoint-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud AnalysisIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
2024
- Preprint
- Preprint
- CVPRJRDB-PanoTrack: An Open-World Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human EnvironmentsIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
2023
- Preprint
2022
- NeurIPSRTFormer: Efficient Design for Real-Time Semantic Segmentation with TransformerIn Advances in Neural Information Processing Systems (NeurIPS). Spotlight Presentation , 2022