Mizuiro-sakura
/

t5-CAMERA-title-generation

Text Generation

text2text-generation

text-generation-inference

Model card Files Files and versions

Mizuiro-sakura commited on Mar 21, 2023

Commit

d60a422

·

1 Parent(s): 888d961

Update README.md

Files changed (1) hide show

README.md +21 -1

README.md CHANGED Viewed

@@ -11,7 +11,27 @@ datasets: shunk031/CAMERA
 pipeline_tag: text2text-generation
 ---
-# タイトル生成
 # 使い方　how to use
 transformers, datasets, sentencepieceをinstallして、下記のコードを実行してください。

 pipeline_tag: text2text-generation
 ---
+# sonoisa/t5-base-japaneseをファインチューニングして、タイトル生成に用いれるようにしたモデルです。
+文章を入力すると、生成型要約を行い、タイトルを生成します。
+# This model is a title generation model which is based on sonoisa/t5-base-japanese.
+If you input the text, this model ouput the title of the text.
+# sonoisa/t5-base-japaneseとは？　what is sonoisa/t5-base-japanese?
+>This is a T5 (Text-to-Text Transfer Transformer) model pretrained on Japanese corpus.
+>次の日本語コーパス（約100GB）を用いて事前学習を行ったT5 (Text-to-Text Transfer Transformer) モデルです。
+>Wikipediaの日本語ダンプデータ (2020年7月6日時点のもの)
+>OSCARの日本語コーパス
+>CC-100の日本語コーパス
+>このモデルは事前学習のみを行なったものであり、特定のタスクに利用するにはファインチューニングする必要があります。
+>本モデルにも、大規模コーパスを用いた言語モデルにつきまとう、学習データの内容の偏りに由来する偏った（倫理的ではなかったり、有害だったり、バイアスがあったりする）出力結果になる問題が潜在的にあります。 この問題が発生しうることを想定した上で、被害が発生しない用途にのみ利用するよう気をつけてください。
+>SentencePieceトークナイザーの学習には上記Wikipediaの全データを用いました。
+https://huggingface.co/sonoisa/t5-base-japanese/blob/main/README.md
+より引用
 # 使い方　how to use
 transformers, datasets, sentencepieceをinstallして、下記のコードを実行してください。