__
.
--

.
ASLIB CONFERENCE

Translating and the Computer 27 Conference

Day One: 24 November 2005 (09.00-17.40)
(Click here for Day Two: 25 November 2005)

09.00 Registration

09.45 Introduction by Chair: Chris Pyne
During the past 20 years Chris Pyne has worked as a technical translator, agency manager, consultant and managing director within the translation industry. He has always taken a keen interest in translation technology and is the proud owner of a Multiterm 1.0 version on a 5¼" floppy disk. He is currently Global Partner Manager for translation partners at SAP AG and together with his team is developing the SAP Translation Ecosystem with over 60 agencies.

09.50 Keynote Presentation: TM, TRADOS & SDL. What's Next?
Jochen Hummel is General Manager EMEA (technology division) with a focus on technology architecture and ensuring SDL continues to serve the needs of SDL's major customers within the localization supply chain. SDL recently acquired TRADOS.
He started his career as freelance translator and developer for IBM. In 1984 he co-founded TRADOS in Stuttgart, Germany and grew it to a successful provider of translation and programming services. Around 1990 he converted the business into a software company and designed and developed TRADOS Translator's Workbench, with over 70% market share the most used translation memory application in the world. In 1997 Microsoft bought 20% of TRADOS, Microsoft's first direct investment in a German company. In 2000 he relocated the HQ to the US, assembled a world-class executive management team and raised money to develop and market enterprise applications for handling multilingual content. Even during the IT slowdown he grew TRADOS' enterprise business at a 35% year-on-year growth rate, while holding the chairman and CTO position. He has been finalist in the 2001 Greater Washington E&Y Entrepreneur of the Year Award.

10.35 Discussion

10.40 Bringing Intelligence to Translation Memory Technology
Mickel Grönroos and Ari Becks, Master's Innovations Ltd, Finland

This paper argues that Translation Intelligence - the patented translation technology invented and developed by the Finnish translation technology company Master's Innovations Ltd - is the next generation of translation memory technology that supersedes current state-of-the-art TM systems in usability. This is because it is based on real self-learning and intelligent re-use of human translation knowledge, which adapts it to the user's way of translating ten times faster than a traditional TM tool that mainly recycles strings of characters. In contrast to traditional TM systems, Translation Intelligence operates on flexibly sized segments on the sub-sentence level, where there is in all text types much more repetition than on the sentence level. Furthermore, on the grammatical and semantic levels, Translation Intelligence reuses whenever possible human translation knowledge as translation patterns. Consequently, it is able to generate a translation of a previously unseen segment by making us of pre-translated, similarly structured examples.

11.20 Discussion

11.25 Coffee

11.55 SIMILIS Second-Generation Translation Memory Software
Emmanuel Planas, Lingua et Machina, France

First generation memory works at the first level: information. It sees text as code. SIMILIS second-generation memory offers the possibility of combining two types of machines to work at the morphological-lexical-syntactic level. The software contains a monolingual lexicon for each language processed and algorithms that allow it to analyse and identify grammatical categories of individual words (because the software now works with words rather than a code) and then group these categories of words into phrases. This new capability offers a more efficient translation support tool. To take just one example, SIMILIS can recognise that one group of words (for example, a nominal group) is separate from another group (for example, a verbal group) and then offer the translation for one of the groups as a possible translation later in the text.

12.30 Discussion

12.35 Finding the System that Suits You Best: Towards the Normalization of MT Evaluation
Paula Estrella, Andrei Popescu-Belis and Nancy Underwood, University of Geneva, Switzerland

The Framework for MT Evaluation, FEMTI, brings together the many disparate metrics and methods which have been devised for MT and helps evaluators to design an evaluation plan based on their particular context of use, aiming to generate more reusable plans. The term evaluators does not only mean developers and programmers but also end-users, managers and anyone else with a stake in the acquisition or deployment of a system, thus the use of FEMTI is not limited to experts in the field of MT. This paper will describe FEMTI and the enhancements being made to it, in particular the interfaces which not only allow evaluators to create their own tailor-made evaluation plans, but also contribute their experience and expertise in improving the resource for the community at large.

13.10 Discussion

13.15 Lunch in the Atrium

14.15 Panel Discussion: The Future if TM Technology - with Jochen Hummel and others

15.15 Tea

15.45 Reverse Localisation
Reinhard Schäler, University of Limerick, Ireland

This paper will look at the general perception that localisation is about
linguistic and cultural adaptation of digital content to the requirements of
foreign markets and that localisation is successful if the origin of the
material can no longer be detected. In fact, what is
happening in a more and more globalised society (not just economy) is that
publishers, and especially publishers of advertisements, play with
'strangeness' and stereotypes. So for example, there are advertisements
running completely in French on Irish television. Radio advertisements in
English speaking countries are completely in German (or in English with
heavy German accents). Rather than adapting to the culture of the target
country, rather than avoiding differences, publishers highlight the
differences, focus on 'strangeness', introduce (rather than avoid)
accents -- to increase sales. As a by-product, the entertainment value for
the consumer increases significantly.

16.20 Discussion

16.25 Translation of Magazines and Brochures Using a Tagged RTF File Workflow
David Calvert, TransForm GmbH, Germany

The translation of corporate magazines and brochures poses a particular set of challenges due to the need for specific expertise in the areas of DTP software, typography, reprographics and hardware platforms. Integrating translation memory and terminology databases while making widespread use of freelance translators further complicate the process. This paper looks at how
these issues have been tackled in one specific company.

17.00 Discussion

17.05 TROFFI - comprehensive management system tailored for a translation agency
Andrew and Jurek Nedoma, Lido-Lang Technical Translations

This presentation describes the main features and advantages of a completely new software package called TROFFI (for TRanslation OFFIce), tailored for global managing a translation agency of any size. The presentation is addressed to Language Service Providers interested in solutions for their companies and its project management, as well as for companies dealing with preparation, implementation and maintenance of Quality Management Systems.
The idea and features of TROFFI are inspired and based on almost 15-year experience in managing a Translation Agency as well as experience in preparation for and successful certification to ISO 9001:2000 standard.
The functions of TROFFI are structured along three axes:
1) modern technology enabling accessibility of TROFFI via the Internet;
2) advantages of TROFFI as a global translation office management system,
3) tool prepared for ensuring conformity with ISO 9001:2000 standard requirements.

17.40 Discussion and close of Day One


Day Two: 25 November 2005 (09.00-17.00)

09.00 Coffee and Registration (for Day Two delegates only)

09.25 Introduction by Chair: Professor Ruslan Mitkov
Prof. Dr. Ruslan Mitkov has been working in Computational Linguistics, Machine Translation and related areas since the early 1980s. His extensively cited research output and interests cover areas such as anaphora resolution, machine translation, translation memory and translation aids, automatic abstracting, centering, term extraction, question answering and computer-aided learning/testing. He is author of the monograph Anaphora resolution (Longman) and sole editor of the Oxford Handbook of Computational Linguistics (Oxford University Press). Current projects include his role as Editor-in-Chief of the Natural Language Processing book series of John Benjamins publishers and Editor-Consultant of Oxford University Press' publications in Computational Linguistics. He was recently Guest Editor of the journals Machine Translation and Computational Linguistics. He has been invited to give keynote speeches at a number of international conferences and has acted as Programme Chair for several important conferences on Natural Language Processing and Machine Translation. At the University of Wolverhampton he developed a successful group in Natural Language Processing which is already known for its innovative research in various areas of the field and for its NLP tools and resources.

09.25 Automatic Detection of Translation Errors: The TransCheck System
Graham Russell, Université de Motréal, Canada

This paper will discuss the application of the state of the art language technology to the practical problem of automatic detection of translation errors. Specifically, it describes the current version of TransCheck, a translation tool under development. The work is motivated with reference to features of the contemporary translation market, including the acceptance of computer implemented support for different parts of the translation-process. It is situated with respect to other types of textual error detection and to the overall translation problem; the nature of translation errors is discussed and the difficulty of general case translation error detection is shown.

10.05 Discussion

10.10 EXTER: A Breakthrough Solution for Efficient Terminology Extraction
Cyril Chantrier, TEMIS SA, France

Temis decided to collaborate with EDF and build a new generation of terminology extraction tools, built on the Lexter prototype and the Temis extraction technology-namely Insight Discovery Extractor (IDE). The name of the extraction solution is EXTER. Having worked on a project for a car manufacturer on a corpus of 3 million words in French, the EXTER solution provided an extremely good level of quality and relevancy of proposed terms, according to the users of output with 50 000 terms candidates instead of the expected 300 000. As a result, not only the quality of the proposed term is higher, but the validation and the cleaning has been divided by a factor of 6, increasing the ROI and the TTM of such a project. The current version is supporting French and English, with extensions in German, Spanish and Italian.

10.50 Discussion

10.55 Coffee

11.15 MyCaTEx, A Language Independent Term Extractor
José Vega, my-xML, Luxembourg

This paper presents my-xML Candidate Term Extractor (MyCaTEx), that works without any language specific resources. It is currently being developed by my-xML, a language engineering company specialising in multilingual content management. MyCaTEx term extraction algorithm is based on the current research of Jacques Vergne, University of Caen, France. MyCaTEx can be used for term extraction, semi-automatic and automatic generation of multilingual thesauri and document tagging and classification.

11.50 Discussion

11.55 Research Meets Practice: t-survey 2005: An Online Survey on Terminology Extraction and Terminology Management
Daniel Zielinski, Saarland University, Germany

This paper reports the results of an ongoing online survey on terminology management and terminology extraction conducted by the Linguistic Data Processing section of the Applied Linguistics and Translating/Interpreting Department at the University. The survey has been available on the Internet in English, French, German and Spanish since mid-May 2005 and continues to be accessible to the public. It has been promoted in many major CAT mailing lists and by translator and interpreter associations. To date, almost 400 professional translators, terminologists and interpreters all over the world have responded to the questionnaire. With this survey, we want to investigate how research and practice are related in the area of terminology, extraction and to evaluate if there is any need to reconcile both. Aimed at translators, terminologists, interpreters and project managers, the main goals of the survey are to investigate the dissemination and application of terminology management tools (with a focus on terminology extraction tools) and to assess the demands on today's terminology extraction tools.

12.30 Discussion

12.35 Embedding free online machine translation into monolingual websites for multilingual dissemination: a case study of implementation
Federico Gaspari, University of Manchester, UK

A growing number of websites that are only available in one language rely
on free online machine translation (MT) services to disseminate their
contents in a variety of other languages, in order to make themselves
accessible to Internet users with different linguistic backgrounds. This
approach to the management and delivery of digital information that
bypasses professional localisation and translation raises a number of
thorny issues, but clearly shows that free online machine translation
services are regarded as valuable tools to overcome language barriers in
the online environment. However, the vast majority of websites that adopt this strategy fail to take full advantage of the potential offered by free web-based MT, mainly due to poor consideration of crucial issues in human-computer interaction and web usability that are vital to ensure that Internet users have a
positive and successful online experience. This paper presents the key
stages and challenges involved in implementing this approach to the
multilingual dissemination of online content, whereby free online machine
translation is embedded into the architecture of a monolingual website.
The main technical and practical issues are illustrated by means of an
implementation case study based on the website supporting London's
successful bid to host the 2012 Olympic Games.

13.10 Discussion

13.15 Lunch

14.15 Panel Discussion: The Current State of Localisation with Michael Anobile (LISA) and Reinhard Schaler.

15.15 Tea

15.40 Does Using Controlled Language Improve Machine Translation Results?
Nathalie de Preux, University of Geneva, Switzerland

A major concern for those companies that use MT is to improve the quality of
the system produced raw translation as much as possible. One promising
approach seems to be to influence the input text by constraining its lexical
items and grammatical constructions - in other words, restricting input text to
a controlled language (CL). The extent of the improvement of applying CL rules
can be evaluated by comparing the machine translations of CL and non CL texts.
In the quantitative part of our evaluation, the results are relatively
satisfying. It proves to be the case that once the number of errors are
counted in the translation of each version (taking into account the gravity of
the errors), there is an improvement of about 25% in the translation of texts
to which CL has been applied. However, the results of the qualitative
evaluation are not so positive, with improvement being assessed at around 8%.
Of course, the subjective nature of qualitative criteria undermines the
reliability of these results. Thus, as a whole, our results show that texts
produced with the aid of a CL lead to better translations (by MT Systems) than
do free texts.

16.15 Discussion

16.20 Controlled Language and the Implementation of Machine Translation for Technical Documentation
Laura Ramirez Polo, Saarland University, Germany

This paper will present a study examining whether texts written in controlled language are more translatable than texts which are not compliant with the CL rule set. For this study, the system CLAT, a sophisticated language checker developed by IAI, will be used. The study is divided into two phases. The goal of both phases are different and can be summarised as follows:
1) selection of resources and 2) evaluation. Using the FEMTI-Framework as a base, this study works to establish a standardised methodology and to explore new metrics of evaluation for contexts where MT comes into question as a technology and making the evaluation design re-usable for future potential evaluations.

16.55 Discussion and Close of Conference

 

 

--
-