Qingyun Li's picture

Qingyun Li

Qingyun

·

https://scholar.google.com/citations?user=TvsTun4AAAAJ&hl=zh-CN

Li-Qingyun

AI & ML interests

Object Detection, Remote Sensing

Recent Activity

upvoted a paper 10 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a collection 15 days ago

MiroThinker-v1.0

liked a model 28 days ago

Qwen/Qwen2.5-VL-32B-Instruct

View all activity

Organizations

upvoted a paper 10 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 14 days ago • 154

upvoted a collection 15 days ago

MiroThinker-v1.0

Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 7 items • Updated 10 days ago • 39

upvoted a collection 3 months ago

InternVL3.5-Core

This collection includes only the InternVL3.5 checkpoints that have completed the full training pipeline (i.e., Pretraining, SFT, MPO, Cascade RL). • 30 items • Updated Sep 28 • 12

upvoted a paper 4 months ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25 • 31

upvoted a collection 5 months ago

OmniCorpus

A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text • 6 items • Updated Sep 28 • 3

upvoted a paper 5 months ago

VGR: Visual Grounded Reasoning

Paper • 2506.11991 • Published Jun 13 • 19

upvoted a paper 6 months ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 45

upvoted 2 papers 7 months ago

A Simple Aerial Detection Baseline of Multimodal Language Models

Paper • 2501.09720 • Published Jan 16 • 2

Scalable Vision Language Model Training via High Quality Data Curation

Paper • 2501.05952 • Published Jan 10 • 5

upvoted an article 8 months ago

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

88

upvoted 2 collections 8 months ago

InternVL3

34 items • Updated Sep 28 • 83

OmniCorpus 🐳

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text https://github.com/OpenGVLab/OmniCorpus • 5 items • Updated May 14 • 1

upvoted a paper 8 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68

upvoted a collection about 1 year ago

InternVL Data

9 items • Updated Sep 28 • 9

upvoted a paper over 1 year ago

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 31

upvoted 2 collections over 1 year ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 14 items • Updated Oct 22 • 62

Florence

9 items • Updated May 1 • 173

upvoted a paper over 2 years ago

InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language

Paper • 2305.05662 • Published May 9, 2023 • 4