pszemraj
/

griffin-v0.01-c3t-8layer-simplewiki

Text Generation

recurrent_gemma

Generated from Trainer

Model card Files Files and versions

pszemraj commited on Apr 25, 2024

Commit

c21eff8

·

verified ·

1 Parent(s): f60ce31

Update README.md

Files changed (1) hide show

README.md +12 -5

README.md CHANGED Viewed

@@ -3,21 +3,28 @@ tags:
 - generated_from_trainer
 metrics:
 - accuracy
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# griffin-1024-c3t-8layer-simple_wikipedia_LM-vN
 This model is a fine-tuned version of [./griffin-1024-c3t-8layer](https://huggingface.co/./griffin-1024-c3t-8layer) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.1928
 - Accuracy: 0.4084
-## Model description
-More information needed
 ## Intended uses & limitations
@@ -61,4 +68,4 @@ The following hyperparameters were used during training:
 - Transformers 4.40.1
 - Pytorch 2.2.0+cu121
 - Datasets 2.19.0
-- Tokenizers 0.19.1

 - generated_from_trainer
 metrics:
 - accuracy
+license: apache-2.0
+datasets:
+- pszemraj/simple_wikipedia_LM
+language:
+- en
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# pszemraj/griffin-v0.01-c3t-8layer-simplewiki
+- griffin/recurrent_gemma arch
+- claude3 tokenizer (as an HF gpt2 tokenizer)
+## Model description
 This model is a fine-tuned version of [./griffin-1024-c3t-8layer](https://huggingface.co/./griffin-1024-c3t-8layer) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.1928
 - Accuracy: 0.4084
 ## Intended uses & limitations
 - Transformers 4.40.1
 - Pytorch 2.2.0+cu121
 - Datasets 2.19.0
+- Tokenizers 0.19.1