Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
NeelNanda
/
Attn_Only_3L512W_C4_Code
like
0
Transformers
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Attn_Only_3L512W_C4_Code
36 GB
1 contributor
History:
2 commits
NeelNanda
Auto Commit
d4bef94
about 3 years ago
checkpoints
Auto Commit
about 3 years ago
.gitattributes
Safe
1.43 kB
initial commit
about 3 years ago
config.json
1.28 kB
Auto Commit
about 3 years ago
model_final.pth
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BoolStorage"
,
"torch.FloatStorage"
What is a pickle import?
216 MB
xet
Auto Commit
about 3 years ago
model_init.pth
pickle
Detected Pickle imports (4)
"torch.BoolStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
216 MB
xet
Auto Commit
about 3 years ago
optimizer_state_dict.pth
pickle
Detected Pickle imports (6)
"numpy.dtype"
,
"numpy.core.multiarray.scalar"
,
"_codecs.encode"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
How to fix it?
425 MB
xet
Auto Commit
about 3 years ago
scheduler_state_dict.pth
pickle
Detected Pickle imports (3)
"_codecs.encode"
,
"numpy.dtype"
,
"numpy.core.multiarray.scalar"
How to fix it?
751 Bytes
xet
Auto Commit
about 3 years ago