Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,13 +11,25 @@ license: mit
|
|
| 11 |
short_description: Torch Transformers Diffusion SFT for Computer Vision
|
| 12 |
---
|
| 13 |
|
| 14 |
-
# SFT Tiny Titans 🚀
|
| 15 |
Tune NLP 🧠 or CV 🎨 fast! Texts 📝 or pics 📸, SFT shines ✨. `pip install -r requirements.txt`, `streamlit run app.py`. Snap cams 📷, craft art—AI’s lean & mean! 🎉 #SFTSpeed
|
| 16 |
|
|
|
|
| 17 |
|
| 18 |
-
|
| 19 |
|
| 20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
## Abstract
|
| 23 |
TorchTransformers Diffusion SFT Titans harnesses `torch`, `transformers`, and `diffusers` for cutting-edge NLP and CV, powered by supervised fine-tuning (SFT). Dual `st.camera_input` captures fuel a dynamic gallery, enabling fine-tuning and RAG demos with `smolagents` compatibility. Key papers illuminate the stack:
|
|
|
|
| 11 |
short_description: Torch Transformers Diffusion SFT for Computer Vision
|
| 12 |
---
|
| 13 |
|
|
|
|
| 14 |
Tune NLP 🧠 or CV 🎨 fast! Texts 📝 or pics 📸, SFT shines ✨. `pip install -r requirements.txt`, `streamlit run app.py`. Snap cams 📷, craft art—AI’s lean & mean! 🎉 #SFTSpeed
|
| 15 |
|
| 16 |
+
# SFT Tiny Titans 🚀 (Small Diffusion Delight!)
|
| 17 |
|
| 18 |
+
A Streamlit app for Supervised Fine-Tuning (SFT) of small diffusion models, featuring multi-camera capture, model testing, and agentic RAG demos with a playful UI.
|
| 19 |
|
| 20 |
+
## Features 🎉
|
| 21 |
+
- **Build Titan 🌱**: Spin up tiny diffusion models from Hugging Face (Micro Diffusion, Latent Diffusion, FLUX.1 Distilled).
|
| 22 |
+
- **Camera Snap 📷**: Snap pics with 6 cameras using a 4-column grid UI per cam—witty, emoji-packed controls for device, label, hint, and visibility! 📸✨
|
| 23 |
+
- **Fine-Tune Titan (CV) 🔧**: Tune models with 3 use cases—denoising, stylization, multi-angle generation—using your camera captures, with CSV/MD exports.
|
| 24 |
+
- **Test Titan (CV) 🧪**: Generate images from prompts with your tuned diffusion titan.
|
| 25 |
+
- **Agentic RAG Party (CV) 🌐**: Craft superhero party visuals from camera-inspired prompts.
|
| 26 |
+
- **Media Gallery 🎨**: View, download, or zap captured images with flair.
|
| 27 |
+
|
| 28 |
+
## Installation 🛠️
|
| 29 |
+
1. Clone the repo:
|
| 30 |
+
```bash
|
| 31 |
+
git clone <repository-url>
|
| 32 |
+
cd sft-tiny-titans
|
| 33 |
|
| 34 |
## Abstract
|
| 35 |
TorchTransformers Diffusion SFT Titans harnesses `torch`, `transformers`, and `diffusers` for cutting-edge NLP and CV, powered by supervised fine-tuning (SFT). Dual `st.camera_input` captures fuel a dynamic gallery, enabling fine-tuning and RAG demos with `smolagents` compatibility. Key papers illuminate the stack:
|