You are here: University Home > English > Conference home > Workshops and colloquia

Workshops and colloquia

The following workshops and colloquia will be offered on Monday 20 July (a provisional timetable is given below).

You can download participant lists for the workshops with room information here. Places on the workshops are limited. If you didn’t manage to get a place, we recommend you check at the registration desk on the day to see whether there have been any cancellations. We would hope that all participants who have registered for a workshop stay for the full session and do not disturb the running of the workshops by switching rooms.
For colloquia we have booked rooms that are big enough, so we there are no fixed participant lists. Colloquia will be held in the mathematics lecture block (see also our travel advice page).


WORKSHOPS

Please note: All Workshops are now full

Michael Barlow "Using Parallel Corpora for Language Analysis and Language Teaching"

Chris Greaves, Elena Tognini Bonelli, Martin Warren "Exploring phraseological variation and aboutness with ConcGram" Click here for further details 

Patrick Hanks "Mapping Meaning onto Use using Corpus Pattern Analysis"

Adam Kilgarriff "The Sketch Engine"

Dawn Knight, Svenja Adolphs, Ronald Carter and Paul Tennent "A multi-modal approach to the construction and analysis of spoken corpora" Click here for further details


COLLOQUIA

Renata Condi de Souza, Márcia Veirano Pinto, Marilisa Shimazumi, Jose Lopes Moreira Filho, Tony Berber Sardinha, Tania Shepherd "Applications of Corpus Linguistics for EFL classrooms"

Oliver Čulo, Silvia Hansen-Schirra, Stella Neumann "Beyond corpus construction: exploitation and maintenance of parallel corpora" Click here for further details

Dagmar Divjak, Stefan Th. Gries "Converging and diverging evidence: corpora and other (cognitive) phenomena?" Click here for further details

Bettina Fischer-Starcke and Martin Wynne, "Corpus Linguistics and Literature" Click here for further details

Giovanni Parodi, Romualdo Ibañez, René Venegas and Cristián González "Working with the Spanish Corpus PUCV-2006: Academic and professional genre identification and variations"

Alan Scott Partington, Alison Duguid, Anna Marchi, Charlotte Taylor, Caroline Clark "Using parallel newspaper corpora for Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS)" Click here for further details

Alan Wallington, John Barnden, Mark Lee, Rosamund Moon, Gill Philip, Jeanette Littlemore "Corpus Based Approaches to Figurative Language (Endorsed by RaAM)" Click here for further details


PROVISIONAL TIMETABLE

 

Colloquia

Workshops

10.00

Registration

11.00 Partington et. al. Using parallel newspaper corpora for Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS) Wallington et. al. Corpus Based Approaches to Figurative Language (Endorsed by RaAM) Parodi et. al. Working with the Spanish Corpus PUCV-2006: Academic and professional genre identification and variations de Souza et. al. Applications of Corpus Linguistics for EFL classrooms    
12.00 Barlow Using Parallel Corpora for Language Analysis and Language Teaching
13.00

Lunch

14.00 Fischer-Starcke et. al.Corpus Linguistics and Literature Corpus Based Approaches to Figurative Language (continued) Divjak et. al. Converging and diverging evidence: corpora and other (cognitive) phenomena? Čulo et. al. Beyond corpus construction: exploitation and maintenance of parallel corpora Greaves et. al. Exploring phraseological variation and aboutness with ConcGram Kilgarriff The Sketch Engine
16.00

Coffee

16.30 Corpus Linguistics and Literature (continued) Corpus Based Approaches to Figurative Language (continued) Converging and diverging evidence (continued) Beyond corpus construction (continued) Knight et. al. A multi-modal approach to the construction and analysis of spoken corpora Hanks Mapping Meaning onto Use using Corpus Pattern Analysis
18.30

Main Conference Registration

19.00

Official opening of the conference and wine reception

 

FURTHER DETAILS

   
Chris Greaves, Elena Tognini Bonelli, Martin Warren "Exploring phraseological variation and aboutness with ConcGram"

This workshop demonstrates ConcGram (Greaves, 2009) which is designed to automatically identify phraseology and, in particular, phraseological variation in texts and corpora. ConcGram makes a significant contribution to better uncovering phraseology which is underpins Sinclair’s (1987) idiom principle.

The idea behind the program was to fully automatically retrieve the co-selections which comprise lexical items (Sinclair, 1996 and 1998; Cheng, Greaves, Sinclair and Warren, 2009) and other forms of phraseology from a text or corpus. Single word frequencies are not the best indicators of frequent phraseologies in a corpus, and n-grams miss phraseologies that have constituency (AB, A*B) and/or positional (AB, BA) variation. ConcGram identifies all such co-occurrences of two or more words fully automatically with no prior search parameters and so supports corpus-driven research (Tognini-Bonelli, 2001).

The workshop is devoted to a hands-on introduction to the program and begins with how to concgram a corpus. Examples of concgram concordances are viewed and the functions which enable the user to review concgram configurations are explained.

The importance of ‘list management’ is described and how to use ConcGram in ‘user-nominated’ mode. Lastly, a procedure is described to arrive at the aboutness of a text or corpus based on its most frequent concgrams (Sinclair, 2006).

References

Cheng, W., Greaves, C., Sinclair, J. McH. and Warren, M. in press, 2009. Uncovering the extent of the phraseological tendency: towards a systematic analysis of concgrams. Applied Linguistics.

Greaves, C. 2009. ConcGram 1.0: A phraseological Search Engine. Amsterdam: John Benjamins.

Sinclair, J. McH. 1987. Collocation: A Progress Report.  In R. Steele and T. Threadgold (eds.) Language Topics: Essays in Honour of Michael Halliday. Amsterdam: John Benjamins. 319-331.

Sinclair, J. McH. 1996. The search for units of meaning. Textus 9/1: 75-106.

Sinclair, J. McH. 1998. The lexical item. In. E. Weigand (ed.) Contrastive Lexical Semantics. Amsterdam: John Benjamins. 1-24.

Sinclair, J. McH. 2006. Aboutness 2. (manuscript), Tuscan Word Centre, Italy.

Tognini-Bonelli, E. 2001. Corpus Linguistics at Work. Amsterdam: John Benjamins.


   
Dawn Knight, Svenja Adolphs, Ronald Carter and Paul Tennent "A multi-modal approach to the construction and analysis of spoken corpora

This workshop outlines a novel approach for the construction and analysis of multi-modal corpora. It draws upon development made as part of two projects based at the University of Nottingham: Digital Records for eSocial Science 1 and 2 (DReSS I and II).

During the workshop we will provide a real-time demonstration of key features of DRS (Digital Replay System), and will discuss how this free software can provide an ideal platform for constructing bespoke multi-modal corpus datasets.

Following this demonstration, participants will have the opportunity to test-drive DRS for themselves (guidelines for use will be provided), and to ask any technical questions that might arise as a result of this. A range of practical, methodological and ethical challenges and considerations will also be discussed here. Participants are also encouraged to provide feedback (and any related questions) on the system and to fuel discussions on the potential long-term applications of such a tool in the future of multi-modal Corpus Linguistic research (encouraging participants to draw on their own research experiences to contextualise their ideas/ feedback).

As an extension to the work on multi-modal representation of spoken discourse, we will also briefly discuss how DRS is currently being adapted to support the collection and collation of more heterogeneous datasets, including SMS messages, MMS messages, interaction in virtual environments, GPS data and face-to-face situated discourse, as part of the DReSS II project (preliminary datasets collected as part of this new project will be showcased, within the DRS environment). The focus of this work is on enabling a more detailed investigation of the interface between a variety of different communicative modes from an individual’s perspective, tracking a specific person’s (inter)actions over time (i.e. across an hour, day or even week).


   
Oliver Čulo, Silvia Hansen-Schirra, Stella Neumann "Beyond corpus construction: exploitation and maintenance of parallel corpora"

Parallel corpora, i.e. collections of originals and their translations, can be used in various ways for the benefit of translation studies, machine translation, linguistics, computational linguistics or simply the human translator. In computational linguistics, parallel data collections serve as training material for machine translation, a data basis for multilingual grammar induction, automatic lexicography etc. Translation scholars use parallel corpora in order to investigate specific properties of translations. For professional translators, parallel corpora serve as reference works, which enable quick and interactive access and information processing.

With issues of the creation of large data collections including multiple annotation and alignment largely solved, exploitation of these collections remains a bottleneck. In order to effectively use annotated and aligned parallel corpora, there are certain considerations to be made:

• How can query tools be enhanced with regard to usability and effectiveness?
• How can we ensure high quality of information extraction tools working on corpora?
• How can corpus quality, especially issues like data format validity and robust processing, be enforced?
• How can we keep corpora maintainable for future expansion and data sustainability?

The abstracts of contributors to the colloquium are organized as follows. We start with two topics that concentrate on corpus formats – the first being about the balance between academia and economy, the other with a view on several distinct languages – and how they influence query and maintenance. The third presentation introduces a tool for parallel treebanks of interest particularly with a view to querying. Presentation four is an example of enriching and extending an existing corpus resource thus addressing aspects of corpus sustainability and maintenance. The last two presentations focus on extracting information for more theoretical topics, querying parallel corpora for studies on translation effects.

Each presentation is scheduled with 35 minutes, with at least 5 minutes for questions and discussion. A general discussion during the last 30 minutes will be guided by the questions raised above.


   
Dagmar Divjak, Stefan Th. Gries "Converging and diverging evidence: corpora and other (cognitive) phenomena?"

In this colloquium, we want to focus on how well corpora predict cognitive phenomena. Despite the importance attributed to frequency in contemporary linguistics, the relationship between frequencies of occurrence in texts on the one hand, and status or structure in cognition as reflected in experiments on the other hand has not been studied in great detail, and hence remains poorly understood. This colloquium explores the relationship between certain aspects of language and their representation in cognition as mediated by frequency counts in both text and experiment. Do certain types of experimental data fit certain types of corpus data better than others? Which corpus-derived statistics correlate best with experimental results? Or do corpus data have to be understood and analyzed radically differently to obtain the wealth of cognitive information they (might) contain?

For this colloquium we have selected submissions that report on converging as well as diverging evidence between corpus and experimental data and interpret the implications of this from a cognitive-linguistic or psycholinguistic perspective. The six contributions deal with phenomena from morphology and morphosyntax (A. Caines “You talking to me? Testing corpus data with a shadowing experiment”, J. Svanlund “Frequency, familiarity, and conventionalization”), lexico-grammar (M. Mos, P. Berck & A. van den Bosch “The predictive value of word-level perplexity in human sentence processing”, L. Teddiman “Conversion & the Lexicon: Comparing evidence from corpora and experimentation”) and lexical semantics (D. Dobrovol’skij “The Lexical Co-Occurrence of the Russian Degree Modifier črezvyčajno. Corpus Evidence vs. Survey Data”, J. Littlemore & F. MacArthur “Figurative extensions of word meaning: How do corpus data and intuition match up?”). They provide both converging (M. Mos, P. Berck & A. van den Bosch, L. Teddiman, D. Dobrovol’skij) and diverging (J. Svanlund, J. Littlemore & F. MacArthur) evidence.


   
Bettina Fischer-Starcke, Martin Wynne "Corpus Linguistics and Literature"

Corpus stylistics combines the analytic techniques of corpus linguistics with the goals of stylistics, examining the language of literary texts by using computational means of analysis. This colloquium will continue the ongoing process of the presentation and discussion of approaches to corpus stylistic analysis. It aims to further the discussion of what is possible, to discuss current approaches, and to provide a forum for practitioners of the field to meet and to discuss future directions. This colloquium will present and consider the latest methods of analysis in corpus linguistics that are helping to shed light on various aspects of the language of literature, and consider whether corpus stylistics has now come of age.

A series of short presentations and discussions will demonstrate that is becoming increasingly possible to test empirically claims about the language of literature, to search for and provide evidence from texts, and to establish the norms of literary and non-literary style.

Doug Biber, Using corpus-based analysis to study fictional style: A multi-dimensional analysis of variation among and within novels

Carmen Aguilera Carnerero & Encarnación Hidalgo Tenorio, Shakespeare’s linguistic subversion: a corpus-based study of modality and transitivity in Hamlet

Bettina Fischer-Starcke, The Phraseology of Jane Austen’s Pride and Prejudice

Jane Johnson, A corpus-assisted analysis of salient stylistic features in nineteenth-century Italian novels and their English translations

Andrew Kehoe & Matt Gee, A wiki tool for the collaborative study of literary texts

Lesley Moss, Henry James Beyond the Numbers: Applying corpus analysis to the text

Kieran O'Halloran, Reading Group Discourse: a Corpus-based Analysis of Argumentation and Collaboration

Martin Wynne, Corpus stylistics: a survey of methods and resources


   
Alan Scott Partington, Alison Duguid, Anna Marchi, Charlotte Taylor, Caroline Clark
"Using a newspaper corpus for Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS)"

This colloquium presents a collection of papers within the discipline of Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS), which employs relatively large corpora of a parallel structure and content from different moments of contemporary time. The SiBol group, who are presenting papers in this colloquium, are currently using two large corpora of broadsheet newspaper discourse from 1993 and 2005 (approximately 100 and 150 million words respectively). The size of the corpora allows for the diachronic study of not only grammatical developments but also variations in lexical and phrasal preferences, enabling observation of changes in newspaper style (which reflects shifting relationships between newspapers and their readerships as well as perhaps overall changes in language), and analysis of socio-cultural change; comparing attitudes to social cultural and political phenomena. The colloquium aims to give an overview of MD-CADS and the rationale behind the corpus compilation and interrogation (Partington) and then to present a series of case studies illustrating both the methodology and findings. Duguid scrutinises lists of key-items from the 1993 and 2005 corpora and considers some lexical evidence of increasing conversationalisation or informalisation, particularly in the way in which evaluation has become salient in the discourse. Marchi examines and compares the use of the items moral* and ethic* to see what the papers construed as moral issues in 1993 and 2005 and how they are evaluated. Taylor conducts an empirical investigation of the changing rhetorical function of science in the news. Following the individual papers there will be opportunity for more general discussion on the methodology of MD CADS, the organisers will be particularly interested to hear from researchers participating in similar projects.