CASS

ESRC Centre for Corpus Approaches to Social Science

BNC2014

The British National Corpus 2014

The British National Corpus 2014 (BNC2014) is a major project led by Lancaster University. We created a 100-million-word corpus (a large collection of ‘real life’ language) of present-day British English. This corpus can be used by researchers to understand more about how language works and how it is evolving. Educators, dictionary compilers and the interested public will also be able to access the corpus to find usage examples of modern British English in different genres.

The whole corpus is now available for research (non-commercial) purposes.

The project has been supported by ESRC grants no. EP/P001559/1, ES/K002155/1 and ES/R008906/1.

Contact: v.brezina@lancaster.ac.uk

How to get access?

BNC2014 Written

The corpus is freely available (together with the spoken part) via #LancsBox X. All major research functionalities are available via this tool.

We are also looking into the possibility of releasing the BNC2014 Written via other popular platforms; due to copyright reasons the full texts of the written corpus cannot be released at this stage.

BNC2014 Spoken