查看文章

arxiv.org 中的 [PDF]

Wizardcoder: Empowering code large language models with evol-instruct

作者

Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, Qingwei Lin, Daxin Jiang

发表日期

2023/6/14

期刊

arXiv preprint arXiv:2306.08568

简介

Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated exceptional performance in code-related tasks. However, most existing models are solely pre-trained on extensive raw code data without instruction fine-tuning. In this paper, we introduce WizardCoder, which empowers Code LLMs with complex instruction fine-tuning, by adapting the Evol-Instruct method to the domain of code. Through comprehensive experiments on four prominent code generation benchmarks, namely HumanEval, HumanEval+, MBPP, and DS-1000, we unveil the exceptional capabilities of our model. It surpasses all other open-source Code LLMs by a substantial margin. Moreover, our model even outperforms the largest closed LLMs, Anthropic's Claude and Google's Bard, on HumanEval and HumanEval+. Our code, model weights, and data are public at https://github.com/nlpxucan/WizardLM

引用总数

被引用次数：303

2022202320241 87 213

学术搜索中的文章

Wizardcoder: Empowering code large language models with evol-instruct

Z Luo, C Xu, P Zhao, Q Sun, X Geng, W Hu, C Tao… - arXiv preprint arXiv:2306.08568, 2023

被引用次数：303 相关文章所有 4 个版本