penfever commited on
Commit
74c00c8
·
verified ·
1 Parent(s): e28269e

Model save

Browse files
Files changed (2) hide show
  1. README.md +6 -7
  2. generation_config.json +1 -1
README.md CHANGED
@@ -4,7 +4,6 @@ license: apache-2.0
4
  base_model: Qwen/Qwen2.5-7B-Instruct
5
  tags:
6
  - llama-factory
7
- - full
8
  - generated_from_trainer
9
  model-index:
10
  - name: a1_math_formulas
@@ -16,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # a1_math_formulas
18
 
19
- This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) on the mlfoundations-dev/a1_math_formulas dataset.
20
 
21
  ## Model description
22
 
@@ -44,7 +43,7 @@ The following hyperparameters were used during training:
44
  - gradient_accumulation_steps: 8
45
  - total_train_batch_size: 128
46
  - total_eval_batch_size: 128
47
- - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: cosine
49
  - lr_scheduler_warmup_ratio: 0.1
50
  - num_epochs: 5.0
@@ -55,7 +54,7 @@ The following hyperparameters were used during training:
55
 
56
  ### Framework versions
57
 
58
- - Transformers 4.46.1
59
- - Pytorch 2.5.1
60
- - Datasets 3.1.0
61
- - Tokenizers 0.20.3
 
4
  base_model: Qwen/Qwen2.5-7B-Instruct
5
  tags:
6
  - llama-factory
 
7
  - generated_from_trainer
8
  model-index:
9
  - name: a1_math_formulas
 
15
 
16
  # a1_math_formulas
17
 
18
+ This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) on an unknown dataset.
19
 
20
  ## Model description
21
 
 
43
  - gradient_accumulation_steps: 8
44
  - total_train_batch_size: 128
45
  - total_eval_batch_size: 128
46
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
  - lr_scheduler_type: cosine
48
  - lr_scheduler_warmup_ratio: 0.1
49
  - num_epochs: 5.0
 
54
 
55
  ### Framework versions
56
 
57
+ - Transformers 4.55.0
58
+ - Pytorch 2.7.0+cu128
59
+ - Datasets 3.6.0
60
+ - Tokenizers 0.21.1
generation_config.json CHANGED
@@ -10,5 +10,5 @@
10
  "temperature": 0.7,
11
  "top_k": 20,
12
  "top_p": 0.8,
13
- "transformers_version": "4.46.1"
14
  }
 
10
  "temperature": 0.7,
11
  "top_k": 20,
12
  "top_p": 0.8,
13
+ "transformers_version": "4.55.0"
14
  }