Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
|
@@ -551,12 +551,6 @@ with gr.Blocks(css=css) as demo:
|
|
| 551 |
## 📝 Quantization Options
|
| 552 |
|
| 553 |
### Quantization Types
|
| 554 |
-
"Int4WeightOnly",
|
| 555 |
-
"GemliteUIntXWeightOnly"
|
| 556 |
-
"Int8WeightOnly",
|
| 557 |
-
"Int8DynamicActivationInt8Weight",
|
| 558 |
-
"Float8WeightOnly",
|
| 559 |
-
"Float8DynamicActivationFloat8Weight",
|
| 560 |
- **Int4WeightOnly**: 4-bit weight-only quantization
|
| 561 |
- **GemliteUIntXWeightOnly**: uintx gemlite quantization (default to 4 bit only for now)
|
| 562 |
- **Int8WeightOnly**: 8-bit weight-only quantization
|
|
|
|
| 551 |
## 📝 Quantization Options
|
| 552 |
|
| 553 |
### Quantization Types
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 554 |
- **Int4WeightOnly**: 4-bit weight-only quantization
|
| 555 |
- **GemliteUIntXWeightOnly**: uintx gemlite quantization (default to 4 bit only for now)
|
| 556 |
- **Int8WeightOnly**: 8-bit weight-only quantization
|