Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

Published in CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025