Sat15Nov2014University of Reading
The aim of this workshop is to bring together researchers who adopt the tools and approaches of Corpus Linguistics to study communication in online environments especially social media sites and interactive online and comment forums. We invite papers that:
- look at the interplay of public/hegemonic and private/grassroots discourse in social media and online forums that feature voices of ‘ordinary people’
- deal with methodological questions of mining and annotating data from online and social media platforms
- present case studies based on corpora of social media and online interactive communication
- discuss specific linguistic features, practices and phenomena of social media and online interactive communication
Wed26Nov20144:00 pmCounty South C89, Lancaster University, UK
Dora Alexopoulou (Cambridge)
Joint work with J Geertzen, A Korhonen and D Meurers
The emergence of online EFL teaching platforms offering teaching and learning to students around the globe results in unprecedented amounts of learner production data: data can come from rich task sets across the proficiency spectrum and learners from a variety of linguistic, educational and cultural backgrounds. Exploiting such datasets opens important opportunities for SLA research and, in particular, linking SLA findings to second language teaching. But at the same time, such datasets have all the pitfalls of big data: a range of variables standardly controlled for in carefully designed data collections (e.g. task sets) are not considered. Access to unprecedented numbers of learners is set against lack of rich learner metadata targeted in typical data collections. In addition, the very context of production poses arbitrary constraints (e.g. word limits on writings). Last, but not least, the size of such datasets brings new challenges for extracting information and addressing the noisy aspects of the data.
Can we then use such data for SLA research, crucially, to link SLA findings to teaching second languages? I will argue that Natural Language Processing (NLP) tools can help us address many of the methodological issues and will show that we can obtain valuable information for SLA research. I will use the EF-Cambridge Open Language Database (EFCAMDAT) as an example of a big data resource. I will focus on the the developmental trajectory of Relative Clauses (RCs) as a study case and consider specific issues that can affect the developmental picture, such as task effects, formulaic language and national language effects. I will conclude by showing that not only we can arrive at reliable generalisations about RC development based on a resource like EFCAMDAT, but we can also obtain new generalisations, a fact strongly indicating the potential of big educational data for SLA research.
Words, words, words – corpora and lexis
The following scholars have far confirmed their participation in the conference as plenary speakers:
- Kate Burridge, Monash University
- Thomas Herbst, Erlangen University
- Graeme Trousdale, Edinburgh University
- Edmund Weiner, Oxford University / OED
The conference will follow the regular ICAME format: On Wednesday afternoon, we will start with one or several pre-conference workshops, and the day will end with a first plenary and a reception at the "Electoral Palace" (Kurfürstliches Palais) - one of the most beautiful rococo palaces in the world. On Friday afternoon, the conference excursion will take us to Bernkastel and its surroundings, and will of course also include a boat trip. The conference will end at lunchtime on Sunday.
Trier is Germany's oldest city. It boasts a number of impressive UNESCO World Heritage sites dating back to Roman times, and accompanying partners will find plenty of interesting things to explore. The greater Trier region is also known for its excellent wine and beautiful scenery.
Official website: ICAME36