Expanding paraphrase lexicons by exploiting lexical variants

DOIResolve DOI: http://doi.org/10.3115/v1/N15-1065
AuthorSearch for: ; Search for:
TypeArticle
Proceedings titleProceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Conference2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, May 31-June 5, 2015, Denver, Colorado, USA
ISBN978-1-941643-49-5
Article numberN15-1065
Pages630640
AbstractThis study tackles the problem of paraphrase acquisition: achieving high coverage as well as accuracy. Our method first induces paraphrase patterns from given seed paraphrases, exploiting the generality of paraphrases exhibited by pairs of lexical variants, e.g., “amendment” and “amending,” in a fully empirical way. It then searches monolingual corpora for new paraphrases that match the patterns. This can extract paraphrases comprising words that are completely different from those of the given seeds. In experiments, our method expanded seed sets by factors of 42 to 206, gaining 84% to 208% more coverage than a previous method that generalizes only identical word forms. Human evaluation through a paraphrase substitution test demonstrated that the newly acquired paraphrases retained reasonable quality, given substantially high-quality seeds
Publication date
PublisherAssociation for Computational Linguistics
LanguageEnglish
AffiliationInformation and Communication Technologies; National Research Council Canada
Peer reviewedYes
NPARC number23000029
Export citationExport as RIS
Report a correctionReport a correction
Record identifier17c01587-4974-485b-916e-e67059fde624
Record created2016-05-30
Record modified2016-05-30
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)