Title: Pedagogic uses of a corpus of student writing
1Pedagogic uses of a corpus of student writing
- and their implications for sampling and
annotation - Alois Heuboeck
- University of Reading, UK
2The British Academic Written English (BAWE)
corpus of student writing
- Project in progress at the universities of
Reading, Warwick and Oxford Brookes - Funded by the Economic and Social Research
Council - (project nr. RES-000-23-0800)
3Outline
- Corpora in LT uses and purposes
- Accessing corpus information interfaces
- Building corpora requirements and decisions -
the BAWE corpus
4Using corpora in language pedagogy
classroom
pedagogic uses
materials
description
motivational
purposes
linguistic
5Interfaces (1) the concordance
typical query options
- wildcards (e.g. investigat)
6Information interfaces (2)
statistics
- e.g. word list, key words
- macrostructural properties and choices
corpus items
- generic types, e.g. CARS model (Swales 1990)
7Requirements a good corpus for language
pedagogy
- Representative target variety
- Relevant information, annotation
- Usable e.g. interface, size
8Representativeness
The corpus as a representative sample should
reflect
Conflicting principles
- distribution and quantitative relations
quantitative representativeness
qualitative representativeness
9Representativeness (2)the BAWE corpus
A trade-off stratified sampling
AH
PS
Frame 2 4 disciplinary groups à 768 ass.
English
History
Linguistics
Classics
Archaeology
History of Art
Physics
Chemistry
Meteorology
Mathematics
Computer Science
Engineering
Frame 1 the university corpus S3,072 ass.
Frame 3 4x6 disciplines à 128 ass.
Frame 4 4 levels per discipline à 32 ass.
SS
LS
Biological Sciences
Health Social Care
Sociology
Law
Business
Politics
Anthropology
Publishing
Medicine
Biochemistry
Agriculture
Food Sciences
10Relevance
Relevant information in corpus
Significant query
Corpus annotation
Features lexicogrammatical, structural etc.
11Relevance (2) features annotated in the BAWE
corpus
- textual structure of running text
- other interesting features
12Corpus size
For the pedagogical analysis of many common
grammatical phenomena a full-size research corpus
is much too large. (Osborne 2000)
Modularity subcorpora
Specialised corpora
13Conclusion 3 views
- Qualitative vs. quantitative representation
corpus as representation of a (set of) target
variety/varieties
- Corpus annotation and interfaces query
instances of lexicogrammatical (etc.) features
and phenomena
balanced samples of target variety/varieties
14Pedagogic uses of a corpus of student writing
and their implications for sampling and annotation
Alois Heuboeck University of Reading,
UK a.heuboeck_at_reading.ac.uk The British
Academic Written English corpus http//www.warwick
.ac.uk/go/BAWE/overview