|
|
--- |
|
|
title: README |
|
|
emoji: π¨ |
|
|
colorFrom: gray |
|
|
colorTo: gray |
|
|
sdk: static |
|
|
pinned: false |
|
|
--- |
|
|
|
|
|
# MAAP LAB π΅ β Music AI & Audio Research |
|
|
> βWhy are there so few labs in Korea dedicated to Music AI? We built one.β |
|
|
|
|
|
<div align="left"> |
|
|
|
|
|
**Focus areas:** |
|
|
πΌ Audio Generation Β· π·οΈ Music Tagging Β· π£οΈ Voice Conversion Β· π§ Transformers Β· π¨ Diffusion |
|
|
|
|
|
</div> |
|
|
|
|
|
## Mission |
|
|
Advance the foundations of **Music AI** through practical research in tagging, generation, and dataset-centric methods β then share our results openly with the community. β¨ |
|
|
|
|
|
## Open Science |
|
|
We aim to publish at top venues (e.g., **ICASSP**, **ISMIR**, **AAAI**) and release code, models, and datasets whenever possible. π’ |
|
|
|
|
|
--- |
|
|
|
|
|
## Latest News ποΈ |
|
|
- β
**NeurIPS Workshop 2025**: |
|
|
- Accepted π **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion** |
|
|
- βοΈ **ICASSP 2026 submission**: |
|
|
- Resubmitted and extended version of **Jamendo-QA** |
|
|
- Preprint available on [arXiv (2509.15662)](https://arxiv.org/abs/2509.15662) |
|
|
- π§° GPU resources via university support: **NVIDIA A100**, **A6000**, **RTX 4090** βοΈ |
|
|
|
|
|
--- |
|
|
|
|
|
## Our Activities π― |
|
|
|
|
|
### Project 1 β Music Tagging (Completed) π·οΈ |
|
|
- Built a tagging & augmentation pipeline with **CLAP**, **Beam Search**, **Stable Audio** |
|
|
- Focus: dataset **augmentation/creation** for future work |
|
|
- Targets: short-term **word generation** β long-term **sentence generation** with **LLMs** |
|
|
- **Outcome:** 2 **NeurIPS Workshop submissions** (1 accepted, 1 rejected β resubmitted to ICASSP) |
|
|
<sub>[AIBA β Accepted β
](#) Β· [Jamendo-QA β Arxiv π](https://arxiv.org/abs/2509.15662)</sub> |
|
|
|
|
|
### Project 2 β Efficient Music Generation (In Progress) πΆ |
|
|
- Exploring **Diffusion** & **DiT (e.g., Flux)** |
|
|
- **LoRA/Adapters** to avoid full fine-tuning |
|
|
- Goal: robust generation for **data-scarce** genres/instruments/domains |
|
|
- Roadmap: dataset curation β baseline reproduction β adapter experiments β ablations β release |
|
|
|
|
|
--- |
|
|
|
|
|
## Publications & Submissions π |
|
|
- **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion** β *Accepted at NeurIPS Workshop 2025* |
|
|
- **Jamendo-QA: A Large-Scale Music Question Answering Dataset** β *Submitted to ICASSP 2026 Β· Preprint on [arXiv](https://arxiv.org/abs/2509.15662)* |
|
|
|
|
|
--- |
|
|
|
|
|
## Get Involved π€ |
|
|
Interested in collaborating on **Music AI**? We welcome discussions on datasets, evaluation, and model design. |
|
|
**Contact:** _arsol970812@gmail.com_ βοΈ |
|
|
|
|
|
--- |
|
|
|
|
|
<sub>Β© 2025 MAAP LAB β’ Built with β€οΈ for music & AI.</sub> |