tau
/

t5-v1_1-large-rss

text2text-generation

text-generation-inference

Model card Files Files and versions

ocastel commited on Aug 17, 2021

Commit

fb1f066

·

1 Parent(s): 0bbac5e

Update README.md

Files changed (1) hide show

README.md +19 -6

README.md CHANGED Viewed

@@ -36,7 +36,7 @@ question = 'When was Obama inaugurated?'
 text = f'Text: {passage}.\nQuestion: {question}\nAnswer:{tokenizer.additional_special_tokens[0]}.'
 encoded_input = tokenizer(text, return_tensors='pt')
 output_ids = model.generate(input_ids=encoded_input.input_ids, attention_mask=encoded_input.attention_mask,
-               eos_token_id=tokenizer.additional_special_tokens_ids[1])
 tokenizer.decode(output_ids[0])
 ```
 The generated answer is then `"<pad><extra_id_0> 2009<extra_id_1>"`, while the one generated by the original [T5-v1.1-large](https://huggingface.co/google/t5-v1_1-large) is `"<pad><extra_id_0> On January 20, 2009<extra_id_1>"` - a correct yet non-extractive answer.
@@ -59,6 +59,22 @@ The gap between the two models diminishes as more training examples are introduc
 ### BibTeX entry and citation info
 ```bibtex
 @misc{castel2021optimal,
       title={How Optimal is Greedy Decoding for Extractive Question Answering?},
       author={Or Castel and Ori Ram and Avia Efrat and Omer Levy},
@@ -66,9 +82,6 @@ The gap between the two models diminishes as more training examples are introduc
       eprint={2108.05857},
       archivePrefix={arXiv},
       primaryClass={cs.CL}
-}```
-<a href="https://huggingface.co/exbert/?model=distilbert-base-uncased">
-	<img width="300px" src="https://cdn-media.huggingface.co/exbert/button.png">
-</a>

 text = f'Text: {passage}.\nQuestion: {question}\nAnswer:{tokenizer.additional_special_tokens[0]}.'
 encoded_input = tokenizer(text, return_tensors='pt')
 output_ids = model.generate(input_ids=encoded_input.input_ids, attention_mask=encoded_input.attention_mask,
+               eos_token_id=tokenizer.additional_special_tokens_ids[1], num_beams=1, max_length=512, min_length=3)
 tokenizer.decode(output_ids[0])
 ```
 The generated answer is then `"<pad><extra_id_0> 2009<extra_id_1>"`, while the one generated by the original [T5-v1.1-large](https://huggingface.co/google/t5-v1_1-large) is `"<pad><extra_id_0> On January 20, 2009<extra_id_1>"` - a correct yet non-extractive answer.
 ### BibTeX entry and citation info
 ```bibtex
+@inproceedings{ram-etal-2021-shot,
+    title = "Few-Shot Question Answering by Pretraining Span Selection",
+    author = "Ram, Ori  and
+      Kirstain, Yuval  and
+      Berant, Jonathan  and
+      Globerson, Amir  and
+      Levy, Omer",
+    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)",
+    month = aug,
+    year = "2021",
+    address = "Online",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2021.acl-long.239",
+    doi = "10.18653/v1/2021.acl-long.239",
+    pages = "3066--3079",
+},
 @misc{castel2021optimal,
       title={How Optimal is Greedy Decoding for Extractive Question Answering?},
       author={Or Castel and Ori Ram and Avia Efrat and Omer Levy},
       eprint={2108.05857},
       archivePrefix={arXiv},
       primaryClass={cs.CL}
+}
+```