Base Model
Dataset
- amazon-science/esci-data- shopping_qeries_dataset_products.parquetの日本語データを使用(339,059件)
 
Parameter
- max_length: 1024
- learning_rate: 1e-5
- scheduler_type: WarmupCosineLR
- num_train_epochs: 3
- per_device_train_batch_size: 64
- per_device_eval_batch_size: 64
- gradient_accumulation_steps: 1
- Downloads last month
- 2
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	🙋
			
		Ask for provider support
Model tree for kasys/Phi-3.5_CPT_ESCI-v0.2.1
Base model
microsoft/Phi-3.5-mini-instruct