AI & ML interests
None defined yet.
Recent Activity
Papers
LLM Safety From Within: Detecting Harmful Content with Internal Representations
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
models 15
UofTCSSLab/C1-SFT-4B
Text Generation • 4B • Updated • 22
UofTCSSLab/C1-4B
Text Generation • 4B • Updated • 23
UofTCSSLab/Maia3-23M-ponder
Updated
UofTCSSLab/Maia3-79M
Updated • 14
UofTCSSLab/Maia3-ablate-3M
Updated • 2
UofTCSSLab/Maia3-23M
Updated • 1
UofTCSSLab/Maia3-5M
Updated • 3
UofTCSSLab/Maia3-ablate-relpos
Updated
UofTCSSLab/Maia3-ablate-abspos
Updated
UofTCSSLab/Maia3-ablate-hist1
Updated