Submitted by tianchez 3 VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs Om AI Lab 2