Title: Main Mono and Bilingual Tasks: Track Organisation and Results Analysis
1Main Mono and Bilingual Tasks Track
Organisation and Results Analysis
2Outline
3CLEF Infrastructure DIRECT
4Information Hierarchy
- experimental collections and the experiments are
data, since they are the raw, basic elements
needed for any further investigation - performance measurements are information, since
they are the result of computations and
processing on the data, - descriptive statistics and the hypothesis tests
are knowledge, since they are a further
elaboration of the information carried by the
performance measurements - theories, models, algorithms, and techniques are
wisdom, since they provide interpretation,
explanation, and formalization of the content of
the previous levels.
5Approach to the Evaluation (1/2)
- Introduce a conceptual model
- it makes clear what are the entities entailed by
the information space of an evaluation campaign,
their features, and their relationships - logical models can be derived from it to manage
and preserve the experimental data - commonly agreed data formats for exchanging
information can be derived from it - Develop common metadata formats
- they provide meaning to the data, and thereby
enable their sharing and re-use - they allow to keep track of the lineage of the
managed information - Adopt a unique identification mechanism
- it allows for explicit citation and easy access
to the scientific data and it supports the
enrichement of the scientific data
6Approach to the Evaluation (2/2)
- Provide common tools for statistical analyses
- they allow for judging whether measured
differences between retrieval methods can be
considered statistically significant - a uniform way of performing statistical analyses
on experiments make the analysis and assessment
of the experiments comparable too - Design and develop a Digital Library System (DLS)
for IR scientific data - it is well suited for managing and making
accessible the scientific data and the
experiments produced during the course of an
evaluation campaign - it also provides tools for analyzing, comparing,
and citing the scientific data of an evaluation
campaign, as well as curating, preserving,
annotating, enriching, and promoting the re-use
of them - Give to organizations responsible for evaluation
initiatives an active role in this process - they should take a leadership role in developing
a comprehensive strategy for long-lived digital
data collections and drive the research community
through this process in order to improve the way
of doing research - they should take care also of defining guiding
principles, policies, best practices for making
use of the scientific data produced during the
evaluation campaign itself
7Internationalization of the User Interface
Bulgarian Petya Osenova, Kiril Simov
Czech Pavel Pecina
English Marco Dussin
French Jacques Savoy
German Thomas Mandl
Indonesian Mirna Adriani
Italian Marco Dussin
Portuguese Paulo Rocha, Diana Santos
Spanish Julio Villena Román
8Identification Digital Object Identifiers (DOI)
10.2415/AH-BILI-X2BG-CLEF2007.JHU-APL.APLBIENBGTD4
- DOIs
- allow us to uniquely identify a digital object
- are persistent and actionable
- aim especially at the intellectual property
- We assign DOIs to
- collections - prefix 10.2453
- topics - prefix 10.2452
- experiments - prefix 10.2415
- pools - prefix 10.2454
- statistical tests - prefix 10.2455
http//www.medra.org
9DOI Resolution
http//dx.doi.org
10Experiment Metrics
11Experiment Statistics
12Experiment Plots
13Task Statistics
14Task Plots
15Appendices (1/2)
16Appendices (2/2)
17Track Overview
18Participation
22 participants 12 countries
19Participation by Country
20Tasks and Collections
- Monolingual and bilingual tasks have principally
offered for Central European languages
Bulgarian, Czech and Hungarian - Topics in 16 languages
- European languages Bulgarian, Czech, English,
French, Hungarian, Italian and Spanish - non-European languages (for X2EN) Amharic,
Chinese, Indonesian, Oromo - Indian sub-task Bengali, Hindi, Marathi, Tamil
and Telugu
Language Task Collection
Bulgarian Monolingual BG, Bilingual X2BG Sega 2002, Standart 2002, Novinar 2002
Cezch Monolingual CS, Bilingual X2CS Mlada fronta DNES 2002, Lidové Noviny 2002
Hungarian Monolingual HU, Bilingual X2HU Magyar Hirlap 2002
English Bilingual X2EN (Indian sub-task) LA Times 2002
21Participation by Task
172 submitted runs
Disappointing participation
22Runs by Source Language
23Monolingual Tasks
24Monolingual Bulgarian
25Monolingual Czech
26Monolingual Hungarian
27Monolingual English
28Approaches to Monolingual Retrieval
- Main emphasis
- stemming
- morphological analysis
- relevance feed-back
Morphological Lemmatizer
29Bilingual Tasks
30Bilingual X ? English
31Approaches to Bilingual X2EN
- Main emphasis
- bilingual dictionaries
- machine translation
- coverage of lexicons
- use of pivot languages
Best Bilingual English system is about88 of the
best monolingual system
- bilingual dictionaries and pivot languages
- query expansion with RF
- parallel corpora
- translation ambiguity resolution with a graph
based approach - lexicon coverage with a pattern-based approach
- Afaan Oromo stemmer
- stop list creation
- bilingual Oromo-English dictionary creation
- Bilingual Hungarian to English
- bilingual dictionary
- exploiting Wikipedia to remove improbable
translations
32Bilingual X2EN Indian Subtask