Enlarging paraphrase collections through generalization and instantiation

AuthorSearch for: ; Search for: ; Search for:
TypeArticle
Proceedings titleProceedings of the Joint Conference on Empirical Methods on Natural Language Processing and Computational Natural Language Learning
ConferenceJoint Conference on Empirical Methods on Natural Language Processing and Computational Natural Language Learning (EMNLP-CONLL 2012), Jeju Island, Korea, July 12-14, 2012
Pages631642; # of pages: 12
AbstractThis paper presents a paraphrase acquisition method that uncovers and exploits generalities underlying paraphrases: paraphrase patterns are first induced and then used to collect novel instances. Unlike existing methods, ours uses both bilingual parallel and monolingual corpora. While the former are regarded as a source of high-quality seed paraphrases, the latter are searched for paraphrases that match patterns learned from the seed paraphrases. We show how one can use monolingual corpora, which are far more numerous and larger than bilingual corpora, to obtain paraphrases that rival in quality those derived directly from bilingual corpora. In our experiments, the number of paraphrase pairs obtained in this way from monolingual corpora was a large multiple of the number of seed paraphrases. Human evaluation through a paraphrase substitution test demonstrated that the newly acquired paraphrase pairs are of reasonable quality. Remaining noise can be further reduced by filtering seed paraphrases.
Publication date
Linkhttp://aclweb.org/anthology-new/D/D12/D12-1058.pdf
LanguageEnglish
AffiliationInformation and Communication Technologies; National Research Council Canada
Peer reviewedYes
NPARC number20494942
Export citationExport as RIS
Report a correctionReport a correction
Record identifierc56f83ba-76e7-4660-8a6f-89f069ee2c44
Record created2012-08-17
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)
Date modified: