update files to remove sparse attention usage for the 0.5B model 5253c7f verified xcjthu commited on 18 days ago