Spoken BNC2014

Compiling a new, publicly accessible corpus of British English conversation

This project is a collaboration between CASS and Cambridge University Press. Together, we are collecting samples of real-life, informal, spoken interactions between speakers of British English from across the United Kingdom. These will then be transcribed and made available publicly as the Spoken BNC2014.

We are interested in audio recordings of face-to-face conversations between people who are native British English speakers. The recordings can be on any subject, but should be natural conversation (rather than, e.g. monologues or speeches).

We encourage people from all over the UK to participate. Recordings should be made in a digital format (rather than on e.g. analogue tape players) and sent in MP3 format. For each hour of good quality recordings we receive, along with all associated consent forms and information sheets completed correctly, we will pay £18. Each recording does not have to be 1 hour in length; you may submit two 30 minute recordings, or three 20 minute recordings, but for each hour in total, you will receive £18.

To register your interest in this exciting new project please contact corpus(Replace this parenthesis with the @ sign)cambridge.org


Team:

Co-Investigator: Tony McEnery

Co-Investigator: Claire Dembry (Cambridge University Press)

Co-Investigator: Andrew Hardie

Senior Research Associate: Vaclav Brezina

Research Student: Robbie Love


Read the latest updates on this project:

  • Spoken BNC2014 book announcement (5 August 2016)

    We are excited to announce a forthcoming book which will be published as part of the Routledge Advances in Corpus Linguistics series. “Corpus Approaches to Contemporary British Speech: Sociolinguistic Studies of the Spoken BNC2014” (edited by Vaclav Brezina, Robbie Love and Karin Aijmer) will feature a collection of research which is currently being undertaken by ...

  • The Spoken BNC2014 early access projects: Part 4 (16 March 2016)

    In January, we announced the recipients of the Spoken BNC2014 Early Access Data Grants. Over the next several months, they will use exclusive access to the first five million words of Spoken BNC2014 data to carry out a total of thirteen research projects. In this series of blogs, we are excited to share more information about ...

  • The Spoken BNC2014 early access projects: Part 3 (7 March 2016)

    In January, we announced the recipients of the Spoken BNC2014 Early Access Data Grants. Over the next several months, they will use exclusive access to the first five million words of Spoken BNC2014 data to carry out a total of thirteen research projects. In this series of blogs, we are excited to share more information about ...

  • The Spoken BNC2014 early access projects: Part 2 (4 March 2016)

    In January, we announced the recipients of the Spoken BNC2014 Early Access Data Grants. Over the next several months, they will use exclusive access to the first five million words of Spoken BNC2014 data to carry out a total of thirteen research projects. In this series of blogs, we are excited to share more information about ...

  • The Spoken BNC2014 early access projects: Part 1 (1 March 2016)

    In January, we announced the recipients of the Spoken BNC2014 Early Access Data Grants. Over the next several months, they will use exclusive access to the first five million words of Spoken BNC2014 data to carry out a total of thirteen research projects. In this series of blogs, we are excited to share more information about ...

  • What’s wrong with “a bunch of migrants”? Looking at the linguistic evidence (28 January 2016)

    This week at Prime Minister’s Questions, David Cameron used the term “a bunch of migrants” to describe refugees at a camp in Calais. He was subsequently criticised by Labour MPs and members of the general public on Twitter, and the story was reported on in mainstream newspapers like the Guardian and the Telegraph. Critics described ...

  • Spoken BNC2014 Early Access Data Grant Scheme – winning proposals (13 January 2016)

    Lancaster University’s ESRC funded Centre for Corpus Approaches to Social Science (CASS) and Cambridge University Press are pleased to announce the recipients of the Spoken BNC2014 Early Access Data Grants. These successful applicants will receive exclusive early access to approximately five million words of the Spoken BNC2014 via CQPweb. They will be the first to ...

  • Spoken BNC2014 Early Access Data Grant Scheme – Applications now open (5 November 2015)

    Lancaster University’s ESRC funded Centre for Corpus Approaches to Social Science (CASS) and Cambridge University Press are excited to announce the Spoken British National Corpus 2014 Early Access Data Grant scheme. Applications are now open for researchers at any level in the field of corpus linguistics and beyond to gain early access to a large subset ...

  • The Spoken British National Corpus 2014 – project update (11 August 2015)

    It has been little over a year since CASS and Cambridge University Press announced a collaboration to compile a successor to the spoken component of the British National Corpus, the Spoken BNC2014. This will be the largest corpus of spoken British English since the original, with the advantage of being collected in the 2010s rather ...

  • The Spoken BNC2014 project features in the Daily Mail (26 August 2014)

    The recently announced collaboration between Cambridge University Press and CASS, the Spoken BNC2014 project, has made headlines in the Daily Mail. The article, entitled, “No longer marvellous – now we’re all awesome: Britons are using more American words because traditional English is in decline”, describes the preliminary findings of the project, which is in its early ...