Guilherme Bueno

gbaf

AI & ML interests

None yet

Recent Activity

liked a dataset 24 days ago

HuggingFaceFW/finepdfs

liked a model 5 months ago

deepseek-ai/DeepSeek-R1-0528

liked a model 7 months ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

View all activity

Organizations

None yet

liked a dataset 24 days ago

HuggingFaceFW/finepdfs

Viewer • Updated Sep 8 • 475M • 52.5k • 645

liked a model 5 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 542k • • 2.38k

liked 2 models 7 months ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • 402B • Updated May 22 • 20.5k • • 418

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 210k • • 1.13k

upvoted 16 papers 7 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published Mar 13 • 25

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10 • 44

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 47

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13 • 55

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11 • 27

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published Mar 10 • 39

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 36

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 123

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published Mar 10 • 23

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11 • 17

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Guilherme Bueno

AI & ML interests

Recent Activity

Organizations

gbaf's activity