Spaces:

OpenSound
/

FlexSED

Runtime error

OpenSound commited on Oct 12

Commit

49309df

verified ·

1 Parent(s): f7ad1a2

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,60 +1,13 @@
-# FlexSED: Towards Open-Vocabulary Sound Event Detection
-[![arXiv](https://img.shields.io/badge/arXiv-2409.10819-brightgreen.svg?style=flat-square)](https://arxiv.org/abs/2509.18606)
-[![Hugging Face Models](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue)](https://huggingface.co/Higobeatz/FlexSED/tree/main)
-## News
-- Oct 2025: 📦 Released code and pretrained checkpoint
-- Sep 2025: 🎉 FlexSED Spotlighted at WASPAA 2025
-## Installation
-Clone the repository:
-```
-git clone git@github.com:JHU-LCAP/FlexSED.git
-```
-Install the dependencies:
-```
-cd FlexSED
-pip install -r requirements.txt
-```
-## Usage
-```python
-from api import FlexSED
-import torch
-import soundfile as sf
-# load model
-flexsed = FlexSED(device='cuda')
-# run inference
-events = ["Dog"]
-preds = flexsed.run_inference("example.wav", events)
-# visualize prediciton
-flexsed.to_multi_plot(preds, events, fname="example2")
-# (Optional) visualize prediciton by video
-# flexsed.to_multi_video(preds, events, audio_path="example2.wav", fname="example2")
-```
-## Training
-WIP
-## Reference
-If you find the code useful for your research, please consider citing:
-```bibtex
-@article{hai2025flexsed,
-  title={FlexSED: Towards Open-Vocabulary Sound Event Detection},
-  author={Hai, Jiarui and Wang, Helin and Guo, Weizhe and Elhilali, Mounya},
-  journal={arXiv preprint arXiv:2509.18606},
-  year={2025}
-}
-```

+---
+title: FlexSED
+emoji: 🎧
+colorFrom: green
+colorTo: indigo
+sdk: gradio
+sdk_version: 5.31.0
+app_file: app.py
+pinned: false
+license: mit
+short_description: State-of-the-art target speech extractor
+tags: ["sound-event-detection"]
+---