Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Aleph-Alpha
/
tfree-hat-pretrained-7b-base
like
15
Follow
Aleph Alpha
283
Safetensors
PyTorch
English
German
hierarchical_autoregressive_transformer
Aleph Alpha Research
Hierarchical Autoregressive Transformer
HAT
custom_code
arxiv:
2501.10322
License:
open-aleph-license
Model card
Files
Files and versions
xet
Community
main
tfree-hat-pretrained-7b-base
/
README.md
Commit History
Fix model weight
c4797ee
verified
maxmeuer
commited on
Oct 22
Update code for latest transformer
6a30f3b
verified
maxmeuer
commited on
Oct 22
Clarify long-context vs. pretraining checkpoints
127a86b
verified
janmetzen-aa
commited on
Aug 20
Fix type (word batch size)
ec773c0
verified
janmetzen-aa
commited on
Aug 12
Remove table and add direct link to DPO model
980fb52
verified
janmetzen-aa
commited on
Aug 4
Update README.md
ef77140
verified
nvedant07
commited on
Aug 1
Update README.md
e7b4c95
verified
nvedant07
commited on
Aug 1
Update README.md
6748bec
verified
janmetzen-aa
commited on
Aug 1
Upload folder using huggingface_hub
4b9bdee
verified
nvedant07
commited on
Jul 31