Improve paper link display title

This PR improves the model card by updating the display title of the paper link in the content to its full official title, "TPTT: Transforming Pretrained Transformers into Titans", while keeping the existing Hugging Face Papers URL. This change enhances clarity and consistency with the paper's official documentation.

No other changes are made to the metadata, badges (including the Arxiv link as per guidelines), or usage examples, as they are already well-documented.

Files changed (1) hide show

README.md +16 -16

README.md CHANGED Viewed

@@ -1,15 +1,15 @@
 ---
-language: en
-license: apache-2.0
-library_name: transformers
-tags:
-  - tptt
-  - peft
-  - trust_remote_code
-pipeline_tag: text-generation
 base_model: allenai/OLMoE-1B-7B-0924
 datasets:
 - yahma/alpaca-cleaned
 ---
 # Titans-v2-OLMoE-1B-7B-0924
@@ -34,20 +34,20 @@ datasets:
 Titanesque version of `allenai/OLMoE-1B-7B-0924` with parallel linearized attention (TPTT 😊) and PEFT.
-The architecture was presented in the paper [TPTT](https://huggingface.co/papers/2506.17671).
 ## Model list
 Classic model parameter with LiZA injection :
-| Subfolder                      | Max Self Attn Length | Mag Weight | Cross Gate | Max Chunk Size | Bidirectional | LoRA | Description                                           |
-|-------------------------------|----------------------|------------|------------|----------------|---------------|------|-------------------------------------------------------|
-| delta_rule  | 8192 (default)       | 0.5        | False      | 64             | False         | Yes  | Parallel linearized attention with delta_rule operator|
-| delta_rule_gelu | 8192 (default) | 0.5        | False      | 64             | False         | Yes  | Non-linear operator with gelu activation              |
-| delta_product    | 8192 (default) | 0.5        | False      | 64             | False         | Yes  | Second order operator with derivative trick              |
-| delta_product_r  | 8192 (default) | 0.5        | False      | 64             | False         | Yes  | Second order operator with rotative trick             |
-| delta_product_c  | 8192 (default) | 0.5        | False      | 64             | False         | Yes  | Second order operator with combined trick             |
 ## Usage

 ---
 base_model: allenai/OLMoE-1B-7B-0924
 datasets:
 - yahma/alpaca-cleaned
+language: en
+library_name: transformers
+license: apache-2.0
+pipeline_tag: text-generation
+tags:
+- tptt
+- peft
+- trust_remote_code
 ---
 # Titans-v2-OLMoE-1B-7B-0924
 Titanesque version of `allenai/OLMoE-1B-7B-0924` with parallel linearized attention (TPTT 😊) and PEFT.
+The architecture was presented in the paper [TPTT: Transforming Pretrained Transformers into Titans](https://huggingface.co/papers/2506.17671).
 ## Model list
 Classic model parameter with LiZA injection :
+| Subfolder | Max Self Attn Length | Mag Weight | Cross Gate | Max Chunk Size | Bidirectional | LoRA | Description |
+|---|---|---|---|---|---|---|---|
+| delta_rule | 8192 (default) | 0.5 | False | 64 | False | Yes | Parallel linearized attention with delta_rule operator|
+| delta_rule_gelu | 8192 (default) | 0.5 | False | 64 | False | Yes | Non-linear operator with gelu activation |
+| delta_product | 8192 (default) | 0.5 | False | 64 | False | Yes | Second order operator with derivative trick |
+| delta_product_r | 8192 (default) | 0.5 | False | 64 | False | Yes | Second order operator with rotative trick |
+| delta_product_c | 8192 (default) | 0.5 | False | 64 | False | Yes | Second order operator with combined trick |
 ## Usage