Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wei Liu's picture
47 15

Wei Liu

lefutonku
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago
SAM Audio: Segment Anything in Audio
upvoted a paper 8 days ago
Few-Step Distillation for Text-to-Image Generation: A Practical Guide
upvoted a paper 8 days ago
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
View all activity

Organizations

None yet

Collections 1

topic_vlm_mllm
  • InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

    Paper • 2504.10479 • Published Apr 14 • 306
  • Qwen3 Technical Report

    Paper • 2505.09388 • Published May 14 • 320
  • InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

    Paper • 2508.18265 • Published Aug 25 • 211
  • How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

    Paper • 2509.18905 • Published Sep 23 • 29
topic_vlm_mllm
  • InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

    Paper • 2504.10479 • Published Apr 14 • 306
  • Qwen3 Technical Report

    Paper • 2505.09388 • Published May 14 • 320
  • InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

    Paper • 2508.18265 • Published Aug 25 • 211
  • How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

    Paper • 2509.18905 • Published Sep 23 • 29

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs