Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Aleph-Alpha
/
tfree-hat-pretrained-7b-base
like
15
Follow
Aleph Alpha
280
Safetensors
PyTorch
English
German
hierarchical_autoregressive_transformer
Aleph Alpha Research
Hierarchical Autoregressive Transformer
HAT
custom_code
arxiv:
2501.10322
License:
open-aleph-license
Model card
Files
Files and versions
xet
Community
c4797ee
tfree-hat-pretrained-7b-base
Commit History
Fix model weight
c4797ee
verified
maxmeuer
commited on
Oct 22
Update code for latest transformer
6a30f3b
verified
maxmeuer
commited on
Oct 22
Fixed cached generation
61d4a54
verified
nvedant07
commited on
Oct 10
Update model.safetensors.index.json
0094d6d
verified
nvedant07
commited on
Sep 15
Upload bfloat16 model weight files
ad478dd
verified
nvedant07
commited on
Sep 15
Delete float32 model weight files
453521c
verified
nvedant07
commited on
Sep 15
Fix chat template
93ac6c6
verified
nvedant07
commited on
Sep 11
Clarify long-context vs. pretraining checkpoints
127a86b
verified
janmetzen-aa
commited on
Aug 20
Added LC checkpoint
65de10b
verified
nvedant07
commited on
Aug 20
Fix type (word batch size)
ec773c0
verified
janmetzen-aa
commited on
Aug 12
Remove table and add direct link to DPO model
980fb52
verified
janmetzen-aa
commited on
Aug 4
Update README.md
ef77140
verified
nvedant07
commited on
Aug 1
Update README.md
e7b4c95
verified
nvedant07
commited on
Aug 1
Update README.md
6748bec
verified
janmetzen-aa
commited on
Aug 1
Upload config.py with huggingface_hub
6a69d29
verified
nvedant07
commited on
Jul 31
Upload folder using huggingface_hub
4b9bdee
verified
nvedant07
commited on
Jul 31
initial commit
9242856
verified
nvedant07
commited on
Jul 31