Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,9 @@ This network uses the optimal transport flow matching objective outlined [Flow M
|
|
| 10 |
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
|
| 11 |
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
|
| 12 |
|
|
|
|
|
|
|
|
|
|
| 13 |

|
| 14 |

|
| 15 |

|
|
|
|
| 10 |
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
|
| 11 |
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
|
| 12 |
|
| 13 |
+
`python train.py` will train a new image network on the provided dataset. A pretrained model is provided here, step_1799.safetensors. To test, use:
|
| 14 |
+
`python test_sample.py step_1799.safetensors`
|
| 15 |
+
|
| 16 |

|
| 17 |

|
| 18 |

|