- 
	
	
	The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-DeterminismPaper • 2407.10457 • Published • 24
- 
	
	
	Adding Error Bars to Evals: A Statistical Approach to Language Model EvaluationsPaper • 2411.00640 • Published • 3
- 
	
	
	Law of the Weakest Link: Cross Capabilities of Large Language ModelsPaper • 2409.19951 • Published • 54
Vignesh
Vigneshwaran
		AI & ML interests
None yet
		Recent Activity
						liked
								a dataset
							
						about 2 months ago
						
					
						
						
						
						HuggingFaceFW/finepdfs
						
						updated 
								a collection
							
						5 months ago
						
					RLHF
						
						updated 
								a collection
							
						6 months ago
						
					RLHF
						 
								