UW Interactive Data Lab
IDL logo

SimpleScience: Lexical Simplification of Scientific Terminology

Yea-Seul Kim, Jessica Hullman, Matthew Burgess, Eytan Adar. Empirical Methods in Natural Language Processing, 2016
Abstract
Lexical simplification of scientific terms represents a unique challenge due to the lack of a standard parallel corpora and fast rate at which vocabulary shift along with research. We introduce SimpleScience, a lexical simplification approach for scientific terminology. We use word embeddings to extract simplification rules from a parallel corpora containing scientific publications and Wikipedia. To evaluate our system we construct SimpleSciGold, a novel gold standard set for science-related simplifications. We find that our approach outperforms prior context-aware approaches at generating simplifications for scientific terms.
BibTeX
@inproceedings{2016-simple-science,
  title = {SimpleScience: Lexical Simplification of Scientific Terminology},
  author = {Kim, Yea-Seul AND Hullman, Jessica AND Burgess, Matthew AND Adar, Eytan},
  booktitle = {Empirical Methods in Natural Language Processing},
  year = {2016},
  url = {https://uwdata.github.io/papers/simple-science},
  doi = {10.18653/v1/d16-1114}
}