15 8 28

Yebowen Hu

huuuyeah

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4

authored a paper about 2 months ago

MeetingBank: A Benchmark Dataset for Meeting Summarization

authored a paper about 2 months ago

InFoBench: Evaluating Instruction Following Ability in Large Language Models

View all activity

Organizations

None yet

authored 7 papers about 2 months ago

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4

Paper • 2305.14702 • Published May 24, 2023 • 1

MeetingBank: A Benchmark Dataset for Meeting Summarization

Paper • 2305.17529 • Published May 27, 2023 • 1

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Paper • 2401.03601 • Published Jan 7, 2024 • 7

SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs

Paper • 2402.10979 • Published Feb 15, 2024

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Paper • 2406.12084 • Published Jun 17, 2024

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12 • 39

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published Aug 28 • 21

upvoted a paper about 2 months ago

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published Aug 28 • 21

upvoted 2 papers 2 months ago

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published Aug 25 • 25

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

authored a paper 2 months ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

New activity in huuuyeah/meetingbank 2 months ago

Update README.md

#2 opened 3 months ago by

parvezshah

liked 2 datasets 3 months ago

casehold/casehold

Viewer • Updated Oct 4, 2023 • 585k • 761 • 19

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 12 • 1

updated a dataset 3 months ago

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 12 • 1

published a dataset 3 months ago

huuuyeah/DeFine

Viewer • Updated Jul 26 • 587 • 12 • 1

upvoted a paper 9 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 25

liked a dataset about 1 year ago

huuuyeah/DecipherPref

Viewer • Updated Oct 3, 2024 • 8.31k • 11 • 2

updated a dataset about 1 year ago

huuuyeah/DecipherPref

Viewer • Updated Oct 3, 2024 • 8.31k • 11 • 2

liked a dataset about 1 year ago

huuuyeah/SportsGen

Viewer • Updated Oct 3, 2024 • 70k • 79 • 5

Yebowen Hu

AI & ML interests

Recent Activity

Organizations

huuuyeah's activity

Update README.md