View article

[PDF] from ieee.org

Scideberta: Learning deberta for science technology documents and fine-tuning information extraction tasks

Authors

Yuna Jeong, Eunhui Kim

Publication date

2022/6/8

Journal

IEEE Access

Volume

Pages

60805-60813

Publisher

IEEE

Description

Deep learning-based language models (LMs) have transcended the gold standard (human baseline) of SQuAD 1.1 and GLUE benchmarks in April and July 2019, respectively. As of 2022, the top five LMs on the SuperGLUE benchmark leaderboard have exceeded the gold standard. Even people with good general knowledge will struggle to solve problems in specialized fields such as medicine and artificial intelligence. Just as humans learn specialized knowledge through bachelor’s, master’s, and doctoral courses, LMs also require a process to develop the ability to understand domain specific knowledge. Thus, this study proposes SciDeBERTa and SciDeBERTa(CS) as a pre-trained LM (PLM) specialized in the science technology domain. We further pretrain the DeBERTa, which was trained with a general corpus, with the science technology domain corpus. Experiments verified that SciDeBERTa(CS …

Total citations

Cited by 12

202320246 6

Scholar articles

Scideberta: Learning deberta for science technology documents and fine-tuning information extraction tasks

Y Jeong, E Kim - IEEE Access, 2022