Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

What measurements should linguists use when comparing texts written by different writers? We report aspects of a systematic evaluation of 381 different language measures derived from 200 analytic tools, carried out during the pilot for a study exploring genetic contributions to language variation. The measures covered lexis, structure, meaning, and discourse features, and were evaluated with a focus on capturing numerically the qualitative features that linguists consider central to differentiating one text from another. We review principles for selecting analytic tools, and the choices faced by the researcher in processing and analysing data. We then identify and demonstrate five of the measures, which between them provide a useful profile of different linguistic features, and note correlations with psychometric measures taken for each writer. We conclude with some caveats regarding general issues of validity and some indications about potential links between our work and research into authorship attribution for forensic purposes

Original publication




Journal article


International Journal of Corpus Linguistics


John Benjamins Publishing Company

Publication Date





429 - 473