4 32 7

Baifeng Shi PRO

bfshi

https://bfshi.github.io

AI & ML interests

computer vision

Recent Activity

liked a dataset about 4 hours ago

VisGym/visgym_data

upvoted a paper about 4 hours ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

published a dataset 7 days ago

bfshi/HLVid

View all activity

Organizations

upvoted a paper about 4 hours ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 3 days ago • 20

upvoted a paper 12 days ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published 16 days ago • 190

upvoted a paper about 2 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

upvoted a collection 3 months ago

NVILA (HuggingFace)

Collection

HuggingFace Transformers can load us. • 5 items • Updated Sep 13, 2025 • 5

upvoted 2 papers 3 months ago

Learning to Grasp Anything by Playing with Random Toys

Paper • 2510.12866 • Published Oct 14, 2025 • 6

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 179

upvoted a paper 7 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

upvoted 4 papers 9 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 63

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93

upvoted 3 papers 10 months ago

upvoted a paper 12 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

upvoted a paper about 1 year ago

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published Jan 9, 2025 • 41

upvoted a collection about 1 year ago

NVILA

Collection

11 items • Updated Sep 13, 2025 • 16

upvoted 2 papers about 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset

Paper • 2410.22325 • Published Oct 29, 2024 • 10

upvoted a paper over 1 year ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69

Baifeng Shi PRO

AI & ML interests

Recent Activity

Organizations

bfshi's activity