File size: 2,610 Bytes
cc0ffcc 1f33dfe 8d94fb5 1f33dfe 8d94fb5 1f33dfe 8d94fb5 1f33dfe |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
---
title: README
emoji: π¨
colorFrom: gray
colorTo: gray
sdk: static
pinned: false
---
# MAAP LAB π΅ β Music AI & Audio Research
> βWhy are there so few labs in Korea dedicated to Music AI? We built one.β
<div align="left">
**Focus areas:**
πΌ Audio Generation Β· π·οΈ Music Tagging Β· π£οΈ Voice Conversion Β· π§ Transformers Β· π¨ Diffusion
</div>
## Mission
Advance the foundations of **Music AI** through practical research in tagging, generation, and dataset-centric methods β then share our results openly with the community. β¨
## Open Science
We aim to publish at top venues (e.g., **ICASSP**, **ISMIR**, **AAAI**) and release code, models, and datasets whenever possible. π’
---
## Latest News ποΈ
- β
**NeurIPS Workshop 2025**:
- Accepted π **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion**
- βοΈ **ICASSP 2026 submission**:
- Resubmitted and extended version of **Jamendo-QA**
- Preprint available on [arXiv (2509.15662)](https://arxiv.org/abs/2509.15662)
- π§° GPU resources via university support: **NVIDIA A100**, **A6000**, **RTX 4090** βοΈ
---
## Our Activities π―
### Project 1 β Music Tagging (Completed) π·οΈ
- Built a tagging & augmentation pipeline with **CLAP**, **Beam Search**, **Stable Audio**
- Focus: dataset **augmentation/creation** for future work
- Targets: short-term **word generation** β long-term **sentence generation** with **LLMs**
- **Outcome:** 2 **NeurIPS Workshop submissions** (1 accepted, 1 rejected β resubmitted to ICASSP)
<sub>[AIBA β Accepted β
](#) Β· [Jamendo-QA β Arxiv π](https://arxiv.org/abs/2509.15662)</sub>
### Project 2 β Efficient Music Generation (In Progress) πΆ
- Exploring **Diffusion** & **DiT (e.g., Flux)**
- **LoRA/Adapters** to avoid full fine-tuning
- Goal: robust generation for **data-scarce** genres/instruments/domains
- Roadmap: dataset curation β baseline reproduction β adapter experiments β ablations β release
---
## Publications & Submissions π
- **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion** β *Accepted at NeurIPS Workshop 2025*
- **Jamendo-QA: A Large-Scale Music Question Answering Dataset** β *Submitted to ICASSP 2026 Β· Preprint on [arXiv](https://arxiv.org/abs/2509.15662)*
---
## Get Involved π€
Interested in collaborating on **Music AI**? We welcome discussions on datasets, evaluation, and model design.
**Contact:** _arsol970812@gmail.com_ βοΈ
---
<sub>Β© 2025 MAAP LAB β’ Built with β€οΈ for music & AI.</sub> |