F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data Paper • 2510.02294 • Published 30 days ago • 43
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding Paper • 2409.04183 • Published Sep 6, 2024 • 2
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code Paper • 2311.07989 • Published Nov 14, 2023 • 26
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning Paper • 2311.02303 • Published Nov 4, 2023 • 12
CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model Paper • 2310.06266 • Published Oct 10, 2023 • 2
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models Paper • 2410.06741 • Published Oct 9, 2024 • 3
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions Paper • 2410.06577 • Published Oct 9, 2024 • 14
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM Paper • 2503.17793 • Published Mar 22 • 23
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Paper • 2505.16901 • Published May 22 • 47