Yuxian Gu's picture

Yuxian Gu

t1101675

·

https://t1101675.github.io/

AI & ML interests

Efficient methods for language models

Recent Activity

upvoted a paper 24 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

updated a Space about 2 months ago

t1101675/trackio

published a Space about 2 months ago

t1101675/trackio

View all activity

Organizations

New activity in allenai/social_i_qa 3 months ago

Convert dataset to Parquet

#4 opened 5 months ago by

New activity in MiniLLM/MiniLLM-gpt2-340M 9 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

New activity in MiniLLM/SFT-gpt2-120M 9 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

New activity in MiniLLM/SFT-gpt2-760M 9 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

New activity in Data-Selection/PDS-470M 9 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

New activity in Data-Selection/PDS-160M 9 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

Add link to paper

#2 opened 9 months ago by

New activity in Data-Selection/PDS-470M 9 months ago

Clarify Model Description and Add Project Page Link

#2 opened 9 months ago by

New activity in Data-Selection/PDS-1B 9 months ago

Add link to code repository

#2 opened 9 months ago by

New activity in Data-Selection/PDS-1.7B 9 months ago

Add link to Github and improve description

#2 opened 9 months ago by

New activity in Data-Selection/BSL-1.7B 9 months ago

Add link to code

#2 opened 9 months ago by

New activity in MiniLLM/MiniPLM-Qwen-500M 9 months ago

Improve model card: add paper abstract and link to paper

#1 opened 9 months ago by

New activity in MiniLLM/MiniPLM-llama3.1-212M 9 months ago

Add library name and link to code repository

#1 opened 9 months ago by

New activity in MiniLLM/MiniPLM-Mamba-130M 9 months ago

Improve MiniPLM-Mamba-130M model card

#1 opened 9 months ago by

New activity in MiniLLM/MiniPLM-Qwen-1.2B 9 months ago

Add link to code

#1 opened 9 months ago by

New activity in MiniLLM/Ref-Pretrain-Qwen-104M 9 months ago

Add link to code

#1 opened 9 months ago by

New activity in MiniLLM/Pretrain-Qwen-1.2B 9 months ago

Add link to code

#1 opened 9 months ago by

New activity in MiniLLM/Pretrain-Qwen-500M 9 months ago

No changes needed

#1 opened 9 months ago by

New activity in MiniLLM/Pretrain-Qwen-200M 9 months ago

Add link to code

#1 opened 9 months ago by

New activity in MiniLLM/VanillaKD-Pretrain-Qwen-200M 9 months ago

Add link to code and base model tag

#1 opened 9 months ago by