pszemraj commited on
Commit
c21eff8
·
verified ·
1 Parent(s): f60ce31

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -5
README.md CHANGED
@@ -3,21 +3,28 @@ tags:
3
  - generated_from_trainer
4
  metrics:
5
  - accuracy
 
 
 
 
 
6
  ---
7
 
8
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
9
  should probably proofread and complete it, then remove this comment. -->
10
 
11
- # griffin-1024-c3t-8layer-simple_wikipedia_LM-vN
 
 
 
 
 
12
 
13
  This model is a fine-tuned version of [./griffin-1024-c3t-8layer](https://huggingface.co/./griffin-1024-c3t-8layer) on an unknown dataset.
14
  It achieves the following results on the evaluation set:
15
  - Loss: 4.1928
16
  - Accuracy: 0.4084
17
 
18
- ## Model description
19
-
20
- More information needed
21
 
22
  ## Intended uses & limitations
23
 
@@ -61,4 +68,4 @@ The following hyperparameters were used during training:
61
  - Transformers 4.40.1
62
  - Pytorch 2.2.0+cu121
63
  - Datasets 2.19.0
64
- - Tokenizers 0.19.1
 
3
  - generated_from_trainer
4
  metrics:
5
  - accuracy
6
+ license: apache-2.0
7
+ datasets:
8
+ - pszemraj/simple_wikipedia_LM
9
+ language:
10
+ - en
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # pszemraj/griffin-v0.01-c3t-8layer-simplewiki
17
+
18
+ - griffin/recurrent_gemma arch
19
+ - claude3 tokenizer (as an HF gpt2 tokenizer)
20
+
21
+ ## Model description
22
 
23
  This model is a fine-tuned version of [./griffin-1024-c3t-8layer](https://huggingface.co/./griffin-1024-c3t-8layer) on an unknown dataset.
24
  It achieves the following results on the evaluation set:
25
  - Loss: 4.1928
26
  - Accuracy: 0.4084
27
 
 
 
 
28
 
29
  ## Intended uses & limitations
30
 
 
68
  - Transformers 4.40.1
69
  - Pytorch 2.2.0+cu121
70
  - Datasets 2.19.0
71
+ - Tokenizers 0.19.1