File size: 2,610 Bytes
cc0ffcc
 
 
 
 
 
 
 
 
1f33dfe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8d94fb5
 
 
 
 
1f33dfe
 
 
 
 
 
 
 
 
 
8d94fb5
 
1f33dfe
 
 
 
 
 
 
 
 
 
8d94fb5
 
1f33dfe
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
title: README
emoji: 🐨
colorFrom: gray
colorTo: gray
sdk: static
pinned: false
---

# MAAP LAB 🎡 β€” Music AI & Audio Research
> β€œWhy are there so few labs in Korea dedicated to Music AI? We built one.”

<div align="left">

**Focus areas:**  
🎼 Audio Generation Β· 🏷️ Music Tagging Β· πŸ—£οΈ Voice Conversion Β· 🧠 Transformers Β· πŸ’¨ Diffusion

</div>

## Mission
Advance the foundations of **Music AI** through practical research in tagging, generation, and dataset-centric methods β€” then share our results openly with the community. ✨

## Open Science
We aim to publish at top venues (e.g., **ICASSP**, **ISMIR**, **AAAI**) and release code, models, and datasets whenever possible. πŸ“’

---

## Latest News πŸ—žοΈ
- βœ… **NeurIPS Workshop 2025**:  
  - Accepted πŸŽ‰ **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion**  
- ✍️ **ICASSP 2026 submission**:  
  - Resubmitted and extended version of **Jamendo-QA**  
  - Preprint available on [arXiv (2509.15662)](https://arxiv.org/abs/2509.15662)  
- 🧰 GPU resources via university support: **NVIDIA A100**, **A6000**, **RTX 4090** βš™οΈ

---

## Our Activities 🎯

### Project 1 β€” Music Tagging (Completed) 🏷️
- Built a tagging & augmentation pipeline with **CLAP**, **Beam Search**, **Stable Audio**  
- Focus: dataset **augmentation/creation** for future work  
- Targets: short-term **word generation** β†’ long-term **sentence generation** with **LLMs**  
- **Outcome:** 2 **NeurIPS Workshop submissions** (1 accepted, 1 rejected β†’ resubmitted to ICASSP)  
<sub>[AIBA β€” Accepted βœ…](#) Β· [Jamendo-QA β€” Arxiv πŸ”—](https://arxiv.org/abs/2509.15662)</sub>

### Project 2 β€” Efficient Music Generation (In Progress) 🎢
- Exploring **Diffusion** & **DiT (e.g., Flux)**  
- **LoRA/Adapters** to avoid full fine-tuning  
- Goal: robust generation for **data-scarce** genres/instruments/domains  
- Roadmap: dataset curation β†’ baseline reproduction β†’ adapter experiments β†’ ablations β†’ release

---

## Publications & Submissions πŸ“š
- **AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion** β€” *Accepted at NeurIPS Workshop 2025*  
- **Jamendo-QA: A Large-Scale Music Question Answering Dataset** β€” *Submitted to ICASSP 2026 Β· Preprint on [arXiv](https://arxiv.org/abs/2509.15662)*

---

## Get Involved 🀝
Interested in collaborating on **Music AI**? We welcome discussions on datasets, evaluation, and model design.  
**Contact:** _arsol970812@gmail.com_ βœ‰οΈ

---

<sub>Β© 2025 MAAP LAB β€’ Built with ❀️ for music & AI.</sub>