128 67 57

Kaicheng Yang

Kaichengalex

https://kaichengyang0828.github.io/Kaicheng-Yang0828.github.io/

kaichengyang0828

AI & ML interests

Multimodal Representation Learning/ Vision-Language Pretraining/DeepResearch

Recent Activity

upvoted a paper 3 days ago

Latent Implicit Visual Reasoning

liked a model 4 days ago

lmms-lab-encoder/onevision-encoder-large

upvoted a collection 4 days ago

Molmo2 Data

View all activity

Organizations

upvoted a paper 3 days ago

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published 5 days ago • 56

liked a model 4 days ago

lmms-lab-encoder/onevision-encoder-large

0.3B • Updated 4 days ago • 97 • 7

upvoted a collection 4 days ago

Molmo2 Data

Collection

Artifacts for the Molmo2 data release • 16 items • Updated 6 days ago • 26

liked a model 6 days ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 6 days ago • 28.6k • • 1.21k

upvoted a paper 11 days ago

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Paper • 2512.14052 • Published 13 days ago • 39

upvoted a paper 13 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 14 days ago • 97

updated a collection 17 days ago

SFT Dataset

Collection

6 items • Updated 17 days ago

liked a dataset 17 days ago

OneThink/OneThinker-train-data

Preview • Updated 6 days ago • 25.1k • 12

liked a model 18 days ago

lmms-lab/LLaVA-OneVision-1.5-4B-Instruct

Image-Text-to-Text • 5B • Updated Oct 21 • 3.39k • 15

liked a Space 20 days ago

Rex Omni

🏃

Analyze images to detect objects, points, keypoints, or text

liked a model 21 days ago

zai-org/GLM-4.6V

Image-Text-to-Text • 108B • Updated 20 days ago • 151k • • 350

upvoted a paper 25 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26 • 144

upvoted a paper 26 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 27 days ago • 237

upvoted a paper 27 days ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published 28 days ago • 15

upvoted an article 27 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

29 days ago

•

258

upvoted a paper about 1 month ago

HunyuanOCR Technical Report

Paper • 2511.19575 • Published Nov 24 • 22

updated a collection about 1 month ago

Vision-Language Dataset

Collection

3 items • Updated Nov 21

published a dataset about 1 month ago

Kaichengalex/DanQing100M

Updated Nov 21 • 3

upvoted a paper about 1 month ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published Nov 16 • 103

upvoted a paper about 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7 • 42

Kaicheng Yang

AI & ML interests

Recent Activity

Organizations

Kaichengalex's activity

Rex Omni

Transformers v5: Simple model definitions powering the AI ecosystem