Loading...
Multilingual Iterative Model Pruning: What Matters?
Wibowo, Haryo Akbarianto ; Song, Haiyue ; Tanaka, Hideki ; Utiyama, Masao ; Aji, Alham Fikri ; Dabre, Raj
Wibowo, Haryo Akbarianto
Song, Haiyue
Tanaka, Hideki
Utiyama, Masao
Aji, Alham Fikri
Dabre, Raj
Files
Loading...
2025.ijcnlp-long.32.pdf
Adobe PDF, 12.31 MB
Supervisor
Department
Natural Language Processing
Embargo End Date
Type
Conference proceeding
Date
License
http://creativecommons.org/licenses/by/4.0/
Language
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Pruning techniques have been studied to construct small models for efficiency, yet the effect of cross-lingual, which shows language performance transferability, is understudied in this field. In this work, we investigate cross-lingual effects in multilingual large language model compression using iterative pruning and recovery. We employ structured layer pruning with LoRA-based recovery and knowledge distillation, testing whether calibration languages different from target evaluation languages can preserve multilingual performance. Experiments on Qwen2.5-7B and Llama3.1-8B demonstrate that any recovery language consistently outperforms no-recovery baselines, with even low-resource languages like Swahili providing ~5% improvements. In contrast to expectations, dominant pretraining languages do not always yield the best results, where Indonesian achieves the highest performance in Llama3.1-8B, while Japanese performs the best in Qwen2.5-7B. Our findings reveal that cross-lingual calibration effectively maintains multilingual capabilities in the iterative pruning.
Citation
H.A. Wibowo, H. Song, H. Tanaka, M. Utiyama, A.F. Aji, R. Dabre, "Multilingual Iterative Model Pruning: What Matters?," 2025, pp. 543-571.
Source
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Conference
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Keywords
Subjects
Source
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Publisher
Association for Computational Linguistics
