Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
FeYuan 
posted an update 19 days ago
Post
217
Meet LLaMAX2! Lightweight Pipeline - SFT on Qwen3-Instruct Models without Catastrophic Forgetting !!!
✨Highlights:
🔹 SOTA Translation: State-of-the-art translation performance across both high- and low-resource trained languages.
🔹 Lightweight Pipeline: Engineered for efficiency, our pipeline uses minimal parallel data and applies layer-selective tuning to a powerful instruct model.
🔹 Strong Reasoning Capabilities: Exhibits reasoning abilities that are competitive with top-tier models like Qwen3-Instruct.

Welcome to use our models. More Details:
🎉 Paper: LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning (2510.09189)
🎉 Code: https://github.com/CONE-MT/LLaMAX2.0
🎉 Model: LLaMAX/llamax20-68ad1c154fcf2623b75a068c

In this post