Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | 
         @@ -60,7 +60,7 @@ This is a fine-tuned model for Korean speech-to-text translation, from [microsof 
     | 
|
| 60 | 
         | 
| 61 | 
         
             
            Total 35K samples. Each sample is a pair of Korean speech and its transcription. Dataset was sampled 16kHz.
         
     | 
| 62 | 
         | 
| 63 | 
         
            -
            The model was trained on a single A100 80GB GPU for  
     | 
| 64 | 
         | 
| 65 | 
         
             
            Note that this model is just a PoC/experimental purpose, and not intended to be used in production. More high-quality data, tuning, ablation studies, and experiments are needed.
         
     | 
| 66 | 
         | 
| 
         | 
|
| 60 | 
         | 
| 61 | 
         
             
            Total 35K samples. Each sample is a pair of Korean speech and its transcription. Dataset was sampled 16kHz.
         
     | 
| 62 | 
         | 
| 63 | 
         
            +
            The model was trained on a single A100 80GB GPU for 4 epochs with a batch size of 16 using the `sample_finetune_speech.py` script from [microsoft/Phi-4-multimodal-instruct](https://huggingface.co/microsoft/Phi-4-multimodal-instruct)
         
     | 
| 64 | 
         | 
| 65 | 
         
             
            Note that this model is just a PoC/experimental purpose, and not intended to be used in production. More high-quality data, tuning, ablation studies, and experiments are needed.
         
     | 
| 66 | 
         |