PhD Student in Computer Science at The Hong Kong Polytechnic University. Research interests: Multimodal Large Language Models, Embodied AI.