125 645 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

commented on a paper about 2 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

upvoted a paper about 2 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

commented on a paper about 2 months ago

Why Language Models Hallucinate

View all activity

Organizations

None yet

commented a paper about 2 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 673 •

upvoted a paper about 2 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 673

commented a paper about 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189 •

upvoted a paper about 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

commented a paper about 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189 •

upvoted a paper about 2 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

commented 2 papers 2 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15 • 8 •

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15 • 8 •

upvoted a paper 2 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 36

commented a paper 2 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 36 •

commented a paper 3 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 236 •

upvoted 8 papers 3 months ago

Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

Paper • 2508.01691 • Published Aug 3 • 9

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3 • 13

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 18

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published Aug 1 • 33

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258

commented a paper 3 months ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 120 •

Michael Barry

AI & ML interests

Recent Activity

Organizations

MichaelBarryUK's activity