Ruizhe Li
rzdiversity
ยท
AI & ML interests
Mechanistic Interpretability, Multimodal LLMs
Recent Activity
authored
a paper
1 day ago
Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
submitted
a paper
1 day ago
Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
Organizations
None yet