YangPan
Akeeper
		ยท
				AI & ML interests
None yet
		Recent Activity
						liked
								a model
							
						about 2 months ago
						
					
						
						
						
						baidu/ERNIE-4.5-21B-A3B-Thinking
						
						upvoted 
								a
								paper
							
						8 months ago
						
					
						
						
						HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid
  Normalization
						
						upvoted 
								a
								paper
							
						9 months ago
						
					
						
						
						Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
						