TIGER-Lab

university

https://wenhuchen.github.io/lab

WenhuChen

TIGER-AI-Lab

Activity Feed Request to join this org

AI & ML interests

Natural Language Processing, Image Generation

Recent Activity

ubowang updated a dataset 2 days ago

TIGER-Lab/MMLU-Pro

ubowang new activity 2 days ago

TIGER-Lab/MMLU-Pro:158 exact duplicate pairs and 111 redundant superset pairs

ubowang new activity 2 days ago

TIGER-Lab/MMLU-Pro:Consolidated note on Health category issues and recent updates (category changes, time-sensitive items, and near-duplicates)

View all activity

Papers

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

View all Papers

TIGER-Lab 's collections 38

ImagenWorld

Stress Testing Image Generation Models

TIGER-Lab/ImagenWorld

Preview • Updated 4 days ago • 1.17k • 1
TIGER-Lab/ImagenWorld-model-outputs

Updated 4 days ago • 2.66k
TIGER-Lab/ImagenWorld-condition-set

Updated 4 days ago • 46
TIGER-Lab/ImagenWorld-annotated-set

Updated 4 days ago • 38

EditReward

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

TIGER-Lab/EditReward-Data

Preview • Updated 15 days ago • 669 • 2
TIGER-Lab/EditReward-Bench

Preview • Updated 14 days ago • 138 • 1
TIGER-Lab/EditReward-MiMo-VL-7B-SFT-2508

Image-to-Text • Updated 8 days ago • 38 • 1

Critique-Coder

Crique-Coder

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published about 1 month ago • 20
TIGER-Lab/Critique-Coder-8B

8B • Updated 26 days ago • 28 • 2
TIGER-Lab/Critique-Coder-4B

4B • Updated 26 days ago • 16
TIGER-Lab/rStar-Critique-Data

Preview • Updated 26 days ago • 131

MMLU-Pro

TIGER-Lab/MMLU-Pro

Viewer • Updated 2 days ago • 12.1k • 52.7k • 387
Running on CPU Upgrade

232

232

MMLU-Pro Leaderboard

🥇

More advanced and challenging multi-task evaluation
TIGER-Lab/mmlu_pro_leaderboard_submission

Viewer • Updated Sep 1 • 227 • 67

VisCoder

TIGER-Lab/VisCode-200K

Viewer • Updated Jun 8 • 203k • 79 • 8
TIGER-Lab/VisCoder-7B

Text Generation • 8B • Updated Jun 8 • 2 • 8
TIGER-Lab/VisCoder-3B

Text Generation • 3B • Updated Jun 8 • 3 • 3
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 26

Pixel-Reasoner

Reasoning in the pixel space

TIGER-Lab/PixelReasoner-SFT-Data

Viewer • Updated Aug 13 • 7.85k • 296 • 4
TIGER-Lab/PixelReasoner-RL-Data

Viewer • Updated Jun 3 • 15.4k • 372 • 2
TIGER-Lab/PixelReasoner-WarmStart

Image-Text-to-Text • 8B • Updated May 26 • 2.49k • 4
TIGER-Lab/PixelReasoner-RL-v1

Image-Text-to-Text • 8B • Updated Jun 11 • 1.73k • 9

General-Reasoner

Advancing LLMs' general reasoning capabilities

TIGER-Lab/General-Reasoner-Qwen2.5-7B

8B • Updated May 21 • 371 • 3
TIGER-Lab/General-Reasoner-Qwen2.5-14B

Text Generation • 15B • Updated May 21 • 67 • 5
TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15 • 5.15k • • 19
TIGER-Lab/WebInstruct-verified

Viewer • Updated May 21 • 233k • 403 • 48

Vamba

Video Mamba

TIGER-Lab/Vamba-Qwen2-VL-7B

Video-Text-to-Text • 11B • Updated Mar 18 • 45 • 16
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 22
TIGER-Lab/Vript

Viewer • Updated Sep 17 • 10.9k • 59

ABC

A collection of models and datasets from ABC: Achieving Better Control of Multimodal Embeddings using VLMs.

TIGER-Lab/ABC-Pretraining-Data

Viewer • Updated Aug 29 • 2.25M • 460 • 5
TIGER-Lab/ABC-VG-Instruct

Viewer • Updated Aug 29 • 12.5k • 42
TIGER-Lab/ABC-Qwen2VL-Pretrain

Image-Text-to-Text • Updated Mar 11 • 52 • 1
TIGER-Lab/ABC-Qwen2VL-Instruct

Image-Text-to-Text • Updated Mar 11 • 51

PixelWorld

TIGER-Lab/PixelWorld

Viewer • Updated Feb 17 • 104k • 141 • 4
PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published Jan 31 • 17

CritiqueFineTuning

The dataset and models for CritiqueFineTuning

TIGER-Lab/WebInstruct-CFT

Viewer • Updated Feb 2 • 654k • 128 • 56
TIGER-Lab/Qwen2.5-Math-7B-CFT

Text Generation • 8B • Updated Feb 2 • 55 • 8
TIGER-Lab/Qwen2.5-32B-Instruct-CFT

Text Generation • 33B • Updated Feb 2 • 7 • 6
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 58

ScholarCopilot

TIGER-Lab/ScholarCopilot-v1

8B • Updated Apr 3 • 14 • 8
TIGER-Lab/ScholarCopilot-Data-v1

Viewer • Updated Apr 3 • 677k • 76 • 7
Running

24

24

ScholarCopilot

📊

Using RAG LLM to assist your academic writing
TIGER-Lab/arxiv-latex-5T

Viewer • Updated Apr 17 • 53.8M • 244 • 4

OmniEdit

The generalist image editing model

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 9.38k • 113
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

VLM2Vec

The VLM2Vec embedding models.

TIGER-Lab/VLM2Vec-LoRA

Text Generation • Updated Jul 13 • 1 • 11
TIGER-Lab/VLM2Vec-Full

Text Generation • 4B • Updated Apr 7 • 43.1k • 28
TIGER-Lab/MMEB-train

Viewer • Updated Jan 28 • 2.14M • 3.75k • 16
TIGER-Lab/MMEB-eval

Viewer • Updated Oct 28, 2024 • 37k • 4.1k • 11

MAmmoTH

The datasets and models for the MAmmoTH project

TIGER-Lab/MAmmoTH-7B

Text Generation • Updated Dec 5, 2023 • 77 • 8
TIGER-Lab/MAmmoTH-Coder-7B

Text Generation • Updated Dec 5, 2023 • 117 • 27
TIGER-Lab/MAmmoTH-Coder-34B

Text Generation • Updated Dec 5, 2023 • 8 • 7
TIGER-Lab/MAmmoTH-13B

Text Generation • Updated Dec 5, 2023 • 13 • 9

ImagenHub

Imagenhub

TIGER-Lab/Entity-Imagen

Viewer • Updated Mar 3, 2024 • 300 • 12 • 2
ImagenHub/Text_Guided_Image_Editing

Viewer • Updated Nov 27, 2023 • 956 • 320 • 21
ImagenHub/Text_to_Image

Viewer • Updated Nov 27, 2023 • 394 • 151 • 2

StructLM

The structure knowledge grounded language model

TIGER-Lab/StructLM-7B

Text Generation • 7B • Updated Nov 8, 2024 • 12 • 23
TIGER-Lab/StructLM-13B

Text Generation • Updated Oct 19, 2024 • 1 • 9
TIGER-Lab/StructLM-34B

Text Generation • 34B • Updated Feb 28, 2024 • 2 • 15
TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 93 • 28

Mantis

Mantis model family optimized for multi-image reasoning with interleaved text/image format

TIGER-Lab/Mantis-8B-Idefics2

Image-to-Text • 8B • Updated Nov 15, 2024 • 88 • 15
TIGER-Lab/Mantis-8B-clip-llama3

Image-to-Text • 8B • Updated Nov 15, 2024 • 102 • 1
TIGER-Lab/Mantis-8B-siglip-llama3

Image-to-Text • 8B • Updated Nov 15, 2024 • 4.22k • 33
TIGER-Lab/Mantis-Instruct

Viewer • Updated Dec 25, 2024 • 999k • 2.12k • 39

VideoScore

Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

TIGER-Lab/VideoScore

Visual Question Answering • 8B • Updated Jan 8 • 150 • 7
TIGER-Lab/VideoFeedback

Viewer • Updated Aug 10, 2024 • 37.7k • 670 • 31
TIGER-Lab/VideoScore-Bench

Viewer • Updated Jun 26, 2024 • 7.44k • 101 • 3
TIGER-Lab/VideoScore-v1.1

Video-Text-to-Text • 8B • Updated Feb 13 • 1.31k • 7

BrowserAgent

Browseragent: An agent that can interact with browser to complete tasks

TIGER-Lab/BrowserAgent-RFT

333k • Updated 2 days ago • 31 • 2
TIGER-Lab/BrowserAgent-SFT

333k • Updated 2 days ago • 49
TIGER-Lab/BrowserAgent-Data

Viewer • Updated 2 days ago • 252k • 106
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published 15 days ago • 27

VideoScore2

TIGER-Lab/VideoScore2-SFT

Visual Question Answering • 8B • Updated 14 days ago • 44
TIGER-Lab/VideoScore2

Visual Question Answering • 8B • Updated 13 days ago • 304 • 2
TIGER-Lab/VideoScore2-RL-no-SFT

Visual Question Answering • 8B • Updated 14 days ago • 21 • 1
TIGER-Lab/VideoFeedback2

Preview • Updated 13 days ago • 158

WebExplorer

web explorer model

hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11 • 759 • 10
hkust-nlp/WebExplorer-QA

Viewer • Updated Sep 9 • 100 • 325 • 4

VLM2Vec-V2

Converting VLM to a general embedding model

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated Jul 13 • 5.46k • 16
TIGER-Lab/MMEB-V2

Updated Aug 8 • 1.19k • 10
Running

55

55

MMEB Leaderboard

📊

The massive multimodal embedding benchmark
TIGER-Lab/MMEB_Raw_Video

Updated Aug 9 • 193

One-Shot-CFT

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

TIGER-Lab/One-Shot-CFT-Data

Viewer • Updated Jun 9 • 6k • 134 • 7
TIGER-Lab/One-Shot-CFT-Math-Qwen-1.5B

Text Generation • 2B • Updated Jun 9 • 3
TIGER-Lab/One-Shot-CFT-Math-Qwen-7B

Text Generation • 8B • Updated Jun 9 • 3 • 2
TIGER-Lab/One-Shot-CFT-Math-Qwen-14B

Text Generation • 15B • Updated Jun 9 • 5 • 2

MoCha

The pioneering work in Dialogue-driven Movie Shot Generation

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138
CongWei1230/MoCha-Generation-on-MoChaBench-Visualizer

Viewer • Updated May 20 • 220 • 307 • 2
CongWei1230/MoChaBench

Viewer • Updated May 20 • 220 • 24 • 3

VL-Rethinker

SoTA VLM for Reasoning

TIGER-Lab/VL-Rethinker-72B

Visual Question Answering • 73B • Updated May 5 • 76 • 5
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43
TIGER-Lab/VL-Rethinker-7B

Image-Text-to-Text • 8B • Updated May 5 • 1.23k • 13
TIGER-Lab/VL-Reasoner-72B

Visual Question Answering • 73B • Updated Apr 21 • 4 • 3

TheoremExplain

TIGER-Lab/TheoremExplainBench

Viewer • Updated Mar 31 • 240 • 42 • 28
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 48

VisualWebInstruct

Scaling up MM data

TIGER-Lab/VisualWebInstruct-Recall

Viewer • Updated Mar 16 • 361k • 371 • 4
TIGER-Lab/VisualWebInstruct-Seed

Viewer • Updated Mar 16 • 60.3k • 182 • 18
TIGER-Lab/VisualWebInstruct

Viewer • Updated Aug 12 • 1.91M • 1.2k • 38
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 24

AceCoder

TIGER-Lab/AceCoder-Qwen2.5-Coder-7B-Ins-V1.1

8B • Updated May 18 • 1 • 1
TIGER-Lab/AceCodeRM-7B

8B • Updated Apr 9 • 52 • 6
TIGER-Lab/AceCodeRM-32B

33B • Updated Apr 9 • 4 • 8
TIGER-Lab/AceCode-87K

Viewer • Updated Feb 8 • 87.1k • 1.05k • 44

MAmmoTH-VL

The dataset and model for MAmmoTH-VL

MAmmoTH-VL/MAmmoTH-VL-8B

8B • Updated Dec 9, 2024 • 6 • 19
MAmmoTH-VL/MAmmoTH-VL-8B-SI

8B • Updated Dec 11, 2024 • 2 • 3
MAmmoTH-VL/MAmmoTH-VL-Instruct-12M

Viewer • Updated Jan 5 • 37M • 3.45k • 62

VISTA

Video Augmentation for Synthetic Video Instruction-following Data Generation

TIGER-Lab/VISTA-LongVA

Video-Text-to-Text • 8B • Updated Mar 14 • 1 • 2
TIGER-Lab/VISTA-Mantis

Video-Text-to-Text • 8B • Updated Mar 14
TIGER-Lab/VISTA-VideoLLaVA

Video-Text-to-Text • 7B • Updated Mar 14 • 1
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 29

MEGA-Bench

Running

49

49

MEGA-Bench Leaderboard

🥇

A leaderboard for multimodal models
TIGER-Lab/MEGA-Bench

Viewer • Updated May 7 • 7.69k • 388 • 23
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38

TIGERScore

List of model variates of TIGEREScore checkpoints and the associated dataset

TIGER-Lab/TIGERScore-13B

Text Generation • 13B • Updated Mar 13, 2024 • 420 • 18
TIGER-Lab/MetricInstruct

Viewer • Updated Dec 3, 2023 • 42.5k • 83 • 13
TIGER-Lab/TIGERScore-7B

Text Generation • 7B • Updated Dec 5, 2023 • 26 • 2
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks

Paper • 2310.00752 • Published Oct 1, 2023 • 3

UniIR

The dataset and model for UniIR project

TIGER-Lab/UniIR

Updated Aug 7, 2024 • 6
TIGER-Lab/M-BEIR

Viewer • Updated Aug 7, 2024 • 2.86M • 1.39k • 26

Science

TIGER-Lab/MMLU-STEM

Viewer • Updated Jun 20, 2024 • 3.15k • 130 • 17
TIGER-Lab/TheoremQA

Viewer • Updated May 15, 2024 • 800 • 289 • 17
Running

12

12

Science Leaderboard

👁

Leaderboard for LLM for Science Reasoning
TIGER-Lab/MATH-plus

Viewer • Updated May 21, 2024 • 894k • 95 • 46

ConsistI2V

ConsistI2V Image-to-Video generation models

Running on Zero

35

35

ConsistI2V

🎥

Image to Video Synthesis
TIGER-Lab/ConsistI2V

Updated Apr 3, 2024 • 9 • 8

MAmmoTH2

Scaling up instruction data from the web for to build better LLMs

TIGER-Lab/MAmmoTH2-7B

Text Generation • 7B • Updated May 22, 2024 • 10
TIGER-Lab/MAmmoTH2-7B-Plus

Text Generation • 7B • Updated Nov 26, 2024 • 12.4k • • 7
TIGER-Lab/MAmmoTH2-8B

Text Generation • 8B • Updated May 22, 2024 • 6 • 2
TIGER-Lab/MAmmoTH2-8B-Plus

Text Generation • 8B • Updated May 22, 2024 • 13.2k • • 22

Long-Context

Long-context research projects

TIGER-Lab/LongICLBench

Viewer • Updated Feb 20 • 2.62k • 70 • 6
TIGER-Lab/LongRAG

Viewer • Updated Jun 26, 2024 • 9.59M • 929 • 17
Running

4

4

LongICL Leaderboard

🐍

Leaderboard for long LLM on In-context Learning

ImagenWorld

Stress Testing Image Generation Models

TIGER-Lab/ImagenWorld

Preview • Updated 4 days ago • 1.17k • 1
TIGER-Lab/ImagenWorld-model-outputs

Updated 4 days ago • 2.66k
TIGER-Lab/ImagenWorld-condition-set

Updated 4 days ago • 46
TIGER-Lab/ImagenWorld-annotated-set

Updated 4 days ago • 38

BrowserAgent

Browseragent: An agent that can interact with browser to complete tasks

TIGER-Lab/BrowserAgent-RFT

333k • Updated 2 days ago • 31 • 2
TIGER-Lab/BrowserAgent-SFT

333k • Updated 2 days ago • 49
TIGER-Lab/BrowserAgent-Data

Viewer • Updated 2 days ago • 252k • 106
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published 15 days ago • 27

EditReward

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

TIGER-Lab/EditReward-Data

Preview • Updated 15 days ago • 669 • 2
TIGER-Lab/EditReward-Bench

Preview • Updated 14 days ago • 138 • 1
TIGER-Lab/EditReward-MiMo-VL-7B-SFT-2508

Image-to-Text • Updated 8 days ago • 38 • 1

VideoScore2

TIGER-Lab/VideoScore2-SFT

Visual Question Answering • 8B • Updated 14 days ago • 44
TIGER-Lab/VideoScore2

Visual Question Answering • 8B • Updated 13 days ago • 304 • 2
TIGER-Lab/VideoScore2-RL-no-SFT

Visual Question Answering • 8B • Updated 14 days ago • 21 • 1
TIGER-Lab/VideoFeedback2

Preview • Updated 13 days ago • 158

Critique-Coder

Crique-Coder

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published about 1 month ago • 20
TIGER-Lab/Critique-Coder-8B

8B • Updated 26 days ago • 28 • 2
TIGER-Lab/Critique-Coder-4B

4B • Updated 26 days ago • 16
TIGER-Lab/rStar-Critique-Data

Preview • Updated 26 days ago • 131

WebExplorer

web explorer model

hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11 • 759 • 10
hkust-nlp/WebExplorer-QA

Viewer • Updated Sep 9 • 100 • 325 • 4

MMLU-Pro

TIGER-Lab/MMLU-Pro

Viewer • Updated 2 days ago • 12.1k • 52.7k • 387
Running on CPU Upgrade

232

232

MMLU-Pro Leaderboard

🥇

More advanced and challenging multi-task evaluation
TIGER-Lab/mmlu_pro_leaderboard_submission

Viewer • Updated Sep 1 • 227 • 67

VLM2Vec-V2

Converting VLM to a general embedding model

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated Jul 13 • 5.46k • 16
TIGER-Lab/MMEB-V2

Updated Aug 8 • 1.19k • 10
Running

55

55

MMEB Leaderboard

📊

The massive multimodal embedding benchmark
TIGER-Lab/MMEB_Raw_Video

Updated Aug 9 • 193

VisCoder

TIGER-Lab/VisCode-200K

Viewer • Updated Jun 8 • 203k • 79 • 8
TIGER-Lab/VisCoder-7B

Text Generation • 8B • Updated Jun 8 • 2 • 8
TIGER-Lab/VisCoder-3B

Text Generation • 3B • Updated Jun 8 • 3 • 3
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 26

One-Shot-CFT

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

TIGER-Lab/One-Shot-CFT-Data

Viewer • Updated Jun 9 • 6k • 134 • 7
TIGER-Lab/One-Shot-CFT-Math-Qwen-1.5B

Text Generation • 2B • Updated Jun 9 • 3
TIGER-Lab/One-Shot-CFT-Math-Qwen-7B

Text Generation • 8B • Updated Jun 9 • 3 • 2
TIGER-Lab/One-Shot-CFT-Math-Qwen-14B

Text Generation • 15B • Updated Jun 9 • 5 • 2

Pixel-Reasoner

Reasoning in the pixel space

TIGER-Lab/PixelReasoner-SFT-Data

Viewer • Updated Aug 13 • 7.85k • 296 • 4
TIGER-Lab/PixelReasoner-RL-Data

Viewer • Updated Jun 3 • 15.4k • 372 • 2
TIGER-Lab/PixelReasoner-WarmStart

Image-Text-to-Text • 8B • Updated May 26 • 2.49k • 4
TIGER-Lab/PixelReasoner-RL-v1

Image-Text-to-Text • 8B • Updated Jun 11 • 1.73k • 9

MoCha

The pioneering work in Dialogue-driven Movie Shot Generation

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138
CongWei1230/MoCha-Generation-on-MoChaBench-Visualizer

Viewer • Updated May 20 • 220 • 307 • 2
CongWei1230/MoChaBench

Viewer • Updated May 20 • 220 • 24 • 3

General-Reasoner

Advancing LLMs' general reasoning capabilities

TIGER-Lab/General-Reasoner-Qwen2.5-7B

8B • Updated May 21 • 371 • 3
TIGER-Lab/General-Reasoner-Qwen2.5-14B

Text Generation • 15B • Updated May 21 • 67 • 5
TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15 • 5.15k • • 19
TIGER-Lab/WebInstruct-verified

Viewer • Updated May 21 • 233k • 403 • 48

VL-Rethinker

SoTA VLM for Reasoning

TIGER-Lab/VL-Rethinker-72B

Visual Question Answering • 73B • Updated May 5 • 76 • 5
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43
TIGER-Lab/VL-Rethinker-7B

Image-Text-to-Text • 8B • Updated May 5 • 1.23k • 13
TIGER-Lab/VL-Reasoner-72B

Visual Question Answering • 73B • Updated Apr 21 • 4 • 3

Vamba

Video Mamba

TIGER-Lab/Vamba-Qwen2-VL-7B

Video-Text-to-Text • 11B • Updated Mar 18 • 45 • 16
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 22
TIGER-Lab/Vript

Viewer • Updated Sep 17 • 10.9k • 59

TheoremExplain

TIGER-Lab/TheoremExplainBench

Viewer • Updated Mar 31 • 240 • 42 • 28
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 48

ABC

A collection of models and datasets from ABC: Achieving Better Control of Multimodal Embeddings using VLMs.

TIGER-Lab/ABC-Pretraining-Data

Viewer • Updated Aug 29 • 2.25M • 460 • 5
TIGER-Lab/ABC-VG-Instruct

Viewer • Updated Aug 29 • 12.5k • 42
TIGER-Lab/ABC-Qwen2VL-Pretrain

Image-Text-to-Text • Updated Mar 11 • 52 • 1
TIGER-Lab/ABC-Qwen2VL-Instruct

Image-Text-to-Text • Updated Mar 11 • 51

VisualWebInstruct

Scaling up MM data

TIGER-Lab/VisualWebInstruct-Recall

Viewer • Updated Mar 16 • 361k • 371 • 4
TIGER-Lab/VisualWebInstruct-Seed

Viewer • Updated Mar 16 • 60.3k • 182 • 18
TIGER-Lab/VisualWebInstruct

Viewer • Updated Aug 12 • 1.91M • 1.2k • 38
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 24

PixelWorld

TIGER-Lab/PixelWorld

Viewer • Updated Feb 17 • 104k • 141 • 4
PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published Jan 31 • 17

AceCoder

TIGER-Lab/AceCoder-Qwen2.5-Coder-7B-Ins-V1.1

8B • Updated May 18 • 1 • 1
TIGER-Lab/AceCodeRM-7B

8B • Updated Apr 9 • 52 • 6
TIGER-Lab/AceCodeRM-32B

33B • Updated Apr 9 • 4 • 8
TIGER-Lab/AceCode-87K

Viewer • Updated Feb 8 • 87.1k • 1.05k • 44

CritiqueFineTuning

The dataset and models for CritiqueFineTuning

TIGER-Lab/WebInstruct-CFT

Viewer • Updated Feb 2 • 654k • 128 • 56
TIGER-Lab/Qwen2.5-Math-7B-CFT

Text Generation • 8B • Updated Feb 2 • 55 • 8
TIGER-Lab/Qwen2.5-32B-Instruct-CFT

Text Generation • 33B • Updated Feb 2 • 7 • 6
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 58

MAmmoTH-VL

The dataset and model for MAmmoTH-VL

MAmmoTH-VL/MAmmoTH-VL-8B

8B • Updated Dec 9, 2024 • 6 • 19
MAmmoTH-VL/MAmmoTH-VL-8B-SI

8B • Updated Dec 11, 2024 • 2 • 3
MAmmoTH-VL/MAmmoTH-VL-Instruct-12M

Viewer • Updated Jan 5 • 37M • 3.45k • 62

ScholarCopilot

TIGER-Lab/ScholarCopilot-v1

8B • Updated Apr 3 • 14 • 8
TIGER-Lab/ScholarCopilot-Data-v1

Viewer • Updated Apr 3 • 677k • 76 • 7
Running

24

24

ScholarCopilot

📊

Using RAG LLM to assist your academic writing
TIGER-Lab/arxiv-latex-5T

Viewer • Updated Apr 17 • 53.8M • 244 • 4

VISTA

Video Augmentation for Synthetic Video Instruction-following Data Generation

TIGER-Lab/VISTA-LongVA

Video-Text-to-Text • 8B • Updated Mar 14 • 1 • 2
TIGER-Lab/VISTA-Mantis

Video-Text-to-Text • 8B • Updated Mar 14
TIGER-Lab/VISTA-VideoLLaVA

Video-Text-to-Text • 7B • Updated Mar 14 • 1
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 29

OmniEdit

The generalist image editing model

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 9.38k • 113
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

MEGA-Bench

Running

49

49

MEGA-Bench Leaderboard

🥇

A leaderboard for multimodal models
TIGER-Lab/MEGA-Bench

Viewer • Updated May 7 • 7.69k • 388 • 23
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38

VLM2Vec

The VLM2Vec embedding models.

TIGER-Lab/VLM2Vec-LoRA

Text Generation • Updated Jul 13 • 1 • 11
TIGER-Lab/VLM2Vec-Full

Text Generation • 4B • Updated Apr 7 • 43.1k • 28
TIGER-Lab/MMEB-train

Viewer • Updated Jan 28 • 2.14M • 3.75k • 16
TIGER-Lab/MMEB-eval

Viewer • Updated Oct 28, 2024 • 37k • 4.1k • 11

TIGERScore

List of model variates of TIGEREScore checkpoints and the associated dataset

TIGER-Lab/TIGERScore-13B

Text Generation • 13B • Updated Mar 13, 2024 • 420 • 18
TIGER-Lab/MetricInstruct

Viewer • Updated Dec 3, 2023 • 42.5k • 83 • 13
TIGER-Lab/TIGERScore-7B

Text Generation • 7B • Updated Dec 5, 2023 • 26 • 2
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks

Paper • 2310.00752 • Published Oct 1, 2023 • 3

MAmmoTH

The datasets and models for the MAmmoTH project

TIGER-Lab/MAmmoTH-7B

Text Generation • Updated Dec 5, 2023 • 77 • 8
TIGER-Lab/MAmmoTH-Coder-7B

Text Generation • Updated Dec 5, 2023 • 117 • 27
TIGER-Lab/MAmmoTH-Coder-34B

Text Generation • Updated Dec 5, 2023 • 8 • 7
TIGER-Lab/MAmmoTH-13B

Text Generation • Updated Dec 5, 2023 • 13 • 9

UniIR

The dataset and model for UniIR project

TIGER-Lab/UniIR

Updated Aug 7, 2024 • 6
TIGER-Lab/M-BEIR

Viewer • Updated Aug 7, 2024 • 2.86M • 1.39k • 26

ImagenHub

Imagenhub

TIGER-Lab/Entity-Imagen

Viewer • Updated Mar 3, 2024 • 300 • 12 • 2
ImagenHub/Text_Guided_Image_Editing

Viewer • Updated Nov 27, 2023 • 956 • 320 • 21
ImagenHub/Text_to_Image

Viewer • Updated Nov 27, 2023 • 394 • 151 • 2

Science

TIGER-Lab/MMLU-STEM

Viewer • Updated Jun 20, 2024 • 3.15k • 130 • 17
TIGER-Lab/TheoremQA

Viewer • Updated May 15, 2024 • 800 • 289 • 17
Running

12

12

Science Leaderboard

👁

Leaderboard for LLM for Science Reasoning
TIGER-Lab/MATH-plus

Viewer • Updated May 21, 2024 • 894k • 95 • 46

StructLM

The structure knowledge grounded language model

TIGER-Lab/StructLM-7B

Text Generation • 7B • Updated Nov 8, 2024 • 12 • 23
TIGER-Lab/StructLM-13B

Text Generation • Updated Oct 19, 2024 • 1 • 9
TIGER-Lab/StructLM-34B

Text Generation • 34B • Updated Feb 28, 2024 • 2 • 15
TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 93 • 28

ConsistI2V

ConsistI2V Image-to-Video generation models

Running on Zero

35

35

ConsistI2V

🎥

Image to Video Synthesis
TIGER-Lab/ConsistI2V

Updated Apr 3, 2024 • 9 • 8

Mantis

Mantis model family optimized for multi-image reasoning with interleaved text/image format

TIGER-Lab/Mantis-8B-Idefics2

Image-to-Text • 8B • Updated Nov 15, 2024 • 88 • 15
TIGER-Lab/Mantis-8B-clip-llama3

Image-to-Text • 8B • Updated Nov 15, 2024 • 102 • 1
TIGER-Lab/Mantis-8B-siglip-llama3

Image-to-Text • 8B • Updated Nov 15, 2024 • 4.22k • 33
TIGER-Lab/Mantis-Instruct

Viewer • Updated Dec 25, 2024 • 999k • 2.12k • 39

MAmmoTH2

Scaling up instruction data from the web for to build better LLMs

TIGER-Lab/MAmmoTH2-7B

Text Generation • 7B • Updated May 22, 2024 • 10
TIGER-Lab/MAmmoTH2-7B-Plus

Text Generation • 7B • Updated Nov 26, 2024 • 12.4k • • 7
TIGER-Lab/MAmmoTH2-8B

Text Generation • 8B • Updated May 22, 2024 • 6 • 2
TIGER-Lab/MAmmoTH2-8B-Plus

Text Generation • 8B • Updated May 22, 2024 • 13.2k • • 22

VideoScore

Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

TIGER-Lab/VideoScore

Visual Question Answering • 8B • Updated Jan 8 • 150 • 7
TIGER-Lab/VideoFeedback

Viewer • Updated Aug 10, 2024 • 37.7k • 670 • 31
TIGER-Lab/VideoScore-Bench

Viewer • Updated Jun 26, 2024 • 7.44k • 101 • 3
TIGER-Lab/VideoScore-v1.1

Video-Text-to-Text • 8B • Updated Feb 13 • 1.31k • 7

Long-Context

Long-context research projects

TIGER-Lab/LongICLBench

Viewer • Updated Feb 20 • 2.62k • 70 • 6
TIGER-Lab/LongRAG

Viewer • Updated Jun 26, 2024 • 9.59M • 929 • 17
Running

4

4

LongICL Leaderboard

🐍

Leaderboard for long LLM on In-context Learning

AI & ML interests

Recent Activity

Papers

Team members 27

TIGER-Lab 's collections 38

MMLU-Pro Leaderboard

ScholarCopilot

MMEB Leaderboard

MEGA-Bench Leaderboard

Science Leaderboard

ConsistI2V

LongICL Leaderboard