Coh-metrix: analysis of text on cohesion and language

Arthur C Graesser, Danielle S McNamara, Max M Louwerse, Zhiqiang Cai

Research output: Contribution to journalArticleScientificpeer-review

668 Citations (Scopus)

Abstract

Advances in computational linguistics and discourse processing have made it possible to automate many language- and text-processing mechanisms. We have developed a computer tool called Coh-Metrix, which analyzes texts on over 200 measures of cohesion, language, and readability. Its modules use lexicons, part-of-speech classifiers, syntactic parsers, templates, corpora, latent semantic analysis, and other components that are widely used in computational linguistics. After the user enters an English text, CohMetrix returns measures requested by the user. In addition, a facility allows the user to store the results of these analyses in data files (such as Text, Excel, and SPSS). Standard text readability formulas scale texts on difficulty by relying on word length and sentence length, whereas Coh-Metrix is sensitive to cohesion relations, world knowledge, and language and discourse characteristics.

Original languageEnglish
Pages (from-to)193-202
Number of pages10
JournalBehavior Research Methods, Instruments & Computers
Volume36
Issue number2
Publication statusPublished - 2004
Externally publishedYes

Keywords

  • Comprehension
  • Humans
  • Language
  • Linguistics
  • Natural Language Processing
  • Reading
  • Software
  • User-Computer Interface

Fingerprint Dive into the research topics of 'Coh-metrix: analysis of text on cohesion and language'. Together they form a unique fingerprint.

  • Cite this