Spaces:

NDEM
/

README

Running

porhan commited on Oct 8, 2024

Commit

919a76d

verified ·

1 Parent(s): 713909e

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -19,10 +19,12 @@ This work was granted access to the HPC resources of IDRIS under the allocations
 Models currently available are:
-  - Wav2vec2 base model () pretrained (no fine-tuning) on Librispeech (English speech), FMA (music), subset of audioset, or all of them together. It also includes a model pretrained on VoxPopuli french dataset.
   - Wav2vec2 tiny model, where we used only 3 transformer layers. Models' performances are surprisingly high.
 Scientific papers using the models provided in this repository:
 Orhan, P., Boubenec, Y., & King, J.-R. (2024). Algebraic structures emerge from the self-supervised learning of natural sounds. https://doi.org/10.1101/2024.03.13.584776
-Models are pretrained using HuggingFace's trainer.

 Models currently available are:
+  - Wav2vec2 base model (https://huggingface.co/facebook/wav2vec2-base), but pretrained (no fine-tuning) on Librispeech (English speech), FMA (music), subset of audioset, or all of them together. It also includes a model pretrained on VoxPopuli french dataset.
   - Wav2vec2 tiny model, where we used only 3 transformer layers. Models' performances are surprisingly high.
 Scientific papers using the models provided in this repository:
 Orhan, P., Boubenec, Y., & King, J.-R. (2024). Algebraic structures emerge from the self-supervised learning of natural sounds. https://doi.org/10.1101/2024.03.13.584776
+Models are pretrained using HuggingFace's trainer.
+These models pretraining are often shorter (100,000 steps compared to 400 000) than original pretraining because of resource scarcity.
+In my experience, most emergences I studied had happened before 100 000 steps.