arxiv:2604.27263
théo gigant
AI & ML interests
multimodal
Recent Activity
authored a paper about 1 month ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation upvoted a paper about 1 month ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation submitted a paper about 1 month ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation