__
.
--

.
ASLIB CONFERENCE

Translating and the Computer 29 Conference

Day One: 29 November 2007
Go to Day Two

08.30 Registration

09.15 Introduction by Chair: Olaf-Michael Stefanov, United Nations, Vienna

09.20 Translation Wikified: How will Massive Online Collaboration Impact the World of Translation?
Alain Désilets
, National Research Council of Canada

Massively collaborative sites like wikipedia, YouTube and SecondLife are revolutionizing the way in which content is produced and consumed world wide. They are part of a broader wave of open collaboration that includes things like open-source software, blogs, Creative Commons licensing and free culture. These fundamentally collaborative technologies and paradigms are bound to have a profound impact on the way that content is not only produced, but also translated. In this talk, we will sample a number of translation related questions that naturally arise in this new frontier. For example, what does it mean to translate content that is constantly being edited collaboratively by a large community of anonymous authors? How might translators benefit from open, wiki-like translation resources? In a world where anyone can write and publish original content in their native language, will we need to cover more language pairs, and if so, how might Machine Translation technology help? Could massively collaborative technologies save freelance translators from extinction by allowing them to bid on parts of large contracts without giving up their autonomy and by allowing them to share expertise within the context of a world-wide community of practice? Will we see the emergence of a new breed of "amateur" volunteer translators and will this result in a de-skilling of the translation profession? How can organisations best leverage the collaborative energy of this new breed of translators? How do we ensure the quality of translations and translation resources in such a seemingly chaotic collaborative environment? Can massively collaborative technologies help save minority languages from extinction? Can teachers of translation take advantage of such technologies to provide students with real-life translation experience early on in their training? 

We will illustrate questions like those with real-life examples of projects and trends that are currently happening. Using those examples, we will show how mass collaboration technology sometimes introduces new problems, while in other cases it helps solve them or even creates exciting new opportunities and niches that did not exist before.


10.00 Making a Sow's Ear out of a Silk Purse: (Mis)Using Online MT Services as Bilingual Dictionaries
Federico Gaspari, University of Manchester, UK
This paper reports (part of) a survey carried out among 104 university students in the United Kingdom investigating their usage of free online machine translation (MT) services. The study focuses in particular on the widespread use of these MT tools for a purpose that they were not designed for, i.e. the translation of single lexical items, as if they were bilingual dictionaries. The 104 (from an original survey of 280) had used web-based MT services in the past, and 65 of these (62.5%) reported using them for single-word lookup. This finding suggests that designers and developers of online MT services should seriously consider taking a proactive approach towards raising the awareness of users with regard to the most (in)appropriate ways of using web-based MT software. The paper argues that it would be in the interests of those who have a stake in offering and promoting MT in the online environment (e.g. system designers, developers, and ultimately the MT vendors themselves) to manage the expectations of naïve users.

10.40 Discussion

10.50 Coffee

11.20 Removing the Distinction Between a Translation Memory, a Dictionary and a Parallel Corpus
Vincent Vandeghinste, KULeuven, Belgium
This paper presents a prototype MT system which does not make the distinction between a dictionary, a sub-sentential aligned parallel corpus, and post-edited information (translators output) like a translation memory. The system is based on the METIS-approach (Vandeghinste et al, 2006), and uses an XML-based dictionary format in which not only simple word-to-word translations can be included, but which also contains complex dictionary entries, including discontinuous entries, like idioms and proverbs. The presented prototype is a system that automatically adapts its dictionary and target language corpus depending on the post-edited output as made by the users of the system, and will therefore have a learning curve in its performance.

12.00 Discussion

12.05 A Dynamic Dictionary for Discovering Indirect Translation Equivalents
Professor Anthony Hartley, University of Leeds, UK
This paper
presents a free on-line service to help translators find solutions to the translation of difficult expressions from the general lexicon. It is often the case that novel non-technical expressions prove more problematic than novel terms. The most original feature of the tool is that it uses large monolingual corpora in the source and target languages -- rather than parallel, translated corpora -- to propose and rank non-literal translation solutions. In this way, the method extends the range of existing, static bilingual dictionaries in dealing with an open set of multiword expressions. To do this, it exploits automatically created thesauri and tools for building concordances. We present an evaluation of the coverage of our tool and its usability by translators. Currently the system works between English and Russian, but it can be quickly extended to other language pairs for which machine-readable dictionaries are available. The system is the result of the 2-year research project ASSIST funded by EPSRC. The interface to the system is available at: http://corpus.leeds.ac.uk/assist/v05/.

12.45 Discussion

12.50 Lunch and Exhibition (please note: coffee will be served in the exhibition area)

14.10 Introduction by Chair: Daniel Grasmick, SAP AG, Germany

14.10 A Free Terminology Extraction Suite
Antoni Oliver and Mercè Vàzquez, Universitat Oberta de Catalunya, Spain
In this paper we will present a set of terminology extraction tools that are distributed under a Free Software License, so that users can freely download, use, distribute and modify them to meet their needs. The tools are mainly programmed in Perl and they will work under different platforms, such as Windows or Linux. These terminology extraction tools will help freelance translators, translation agencies and companies to find the best translation of a term or to build monolingual or multilingual terminological glossaries. Moreover, translators, correctors and terminologists can use The Free Terminology Extraction Suite to create a terminological database for a specialist domain so as to automatically obtain a list of domain-specific lexical units (potential terms) with their equivalent translations from bilingual corpora of domain-specific documents.

14.50 Discussion

14.55 In Other Words: Using Paraphrases in Translation
Pernilla Danielsson, University of Birmingham, UK
The presentation involves three parts; Firstly, a corpus consisting of multiple English translations of the same source texts is presented as the language resource used in this study (The Transfer Corpus). Secondly, a study focusing on the identification of paraphrases in this corpus is presented and the findings categorised, and thirdly, the findings are included in a computational tool to be used as translation aid. This paper suggests that a useful starting-point for the study of paraphrase is a set of texts that can be identified as paraphrases of each other on external rather than internal criteria. Here, texts that have more than one translation into English are compiled into a corpus, the Transfer Corpus. In the extension to this project, we envisage a tool that will offer alternative words, phrases or even grammar to that of the translator's first choice.

15.35 Discussion

15.40 Break

16.10 Lost in Specialised Translation: an Inexpensive and Under-Exploited Aid for Language Service Providers
Gloria Corpas Pastor, University of Malaga, Spain
In the paper and in the presentation Gloria will discuss a translation assignment of a specialised nature. After performing source text analysis, she will demonstrate how a step-by-step approach to building an adequate and/or representative corpus from resources in the Internet works in practice. Corpus design criteria and qualitative issues will be taken into account. Real examples will be presented that show how to mine a corpus and how to use it in order to meet translators' needs as far as terminology, documentation, target text conventions and other constraints are concerned. She will illustrate how language service providers can use freeware concordances (Antconc, ParaConc, etc.) to search for translation equivalents in comparable and parallel corpora. This demonstration will also show the clear benefits of using such corpora over any type of dictionary as they provide examples of how words or expressions are used and translated in context. All in all, she will demonstrate how corpora (a) provide instant access to real usage, (b) depict syntagmatic patterns and translation equivalents unavailable in existing lexicographic resources, and (c) facilitate guidance to style and text conventions in both SL and TL.

16.50 Discussion

16.55 End of Day One



Day Two: 30 November 2007

08.45 Coffee and Registration (for Day Two delegates only)

09.15 Introduction by Chair: Chris Pyne, SAP AG, Germany

09.20 Medical Spoken Language Translation Technology: What Do the Users Really Need?
Professor Harold Somers, University of Manchester, UK
This paper focuses on the particular use of emerging spoken language translation (SLT) technology in the medical domain, in particular to assist communication between patients with limited English and healthcare providers. The paper will first consider the "pathway to healthcare" for such patients noting how it extends beyond the focal point of a doctor-patient dialogue with a GP as it is conventionally portrayed, to include interaction at various stages with a range of medical specialists and non-specialists. The translation needs (both spoken and text) vary accordingly. The paper will then critically review work done so far on SLT in the medical domain, in particular for its bias towards the doctor as the principle user of proposed software, its often condescending view of the patient's role, and the fact that it is led by the technology (what can we do?) rather than the users' needs (what should we do?). Finally the paper will propose a range of language technologies suitable for different points along the pathway to healthcare, and will discuss some of the barriers to implementation, especially considering the fact that in many cases the languages spoken by the kinds of patients we are targeting happen also to be languages of least interest to commercial developers of language technologies.

10.00 Translation Quality Assurance Tools: Current State and Future Approaches
Julia Makoushina, Palex Languages and Software, Russia
The purpose of this paper is to review translation quality assurance tools, to define where they currently are, their advantages and disadvantages and to visualise their future capabilities and role in translation process. The review is done from the viewpoint of a practitioner with a software development background and is highly focused on what real features need to be implemented in such tools in the near future. In order to make the review more valuable we carried out a survey among translation professionals to identify the most popular QA tools, the overall acceptance and common usage of such tools, and their desired features and capabilities. During this presentation we will briefly examine how translation quality assurance tools developed, consider what they have in common and point out their unique features, benchmark their performance and discuss how the translation community accepts and uses them, what translators expect of them and what kind of future awaits them.

10.40 Discussion

10.50 Coffee

11.20 Rapid Development of RBMT Systems for Related Languages
Jernej Vicic, University of Primorska, Slovenia
This paper describes novel way of constructing rule-based machine translation systems (RBMT). RBMT systems are currently among the best performing machine translation systems. Most of the "big named" machine translation systems (Systran, 2007)(Promt, 2007) belong to this category, but these systems have a big drawback; construction of such systems demands a great amount of time and resources, and thus finance. The paper describes methods that automate parts of the construction process. The methods were evaluated in a case study: construction of a fully functional machine translation system of the closely related language pair Slovene - Serb. Evaluation was conducted on the functional machine translation system and the results, presenting coverage using referential corpus and selected evaluation metrics, are shown. Objective and subjective evaluation methods were used as only a correct mixture of methods minimizes evaluation bias. Translation quality evaluation was conducted using subjective evaluation methods where a set of native speakers scored translations. The conclusions present strong and weak points of this approach and explore grounds for further work.

12.00 Discussion

12.05 Building a bilingual dictionary from movie subtitles based on inter-lingual triggers
Kamel Smaili, David Langlois, Loria, France
Training statistical translation models requires a huge quantity of bilingual aligned corpora. Commonly, Statistical Machine Translation Systems use Canadian Hansard Corpus or corpora extracted from the proceedings of European Parliament. Then given this parallel corpora, translation probabilities are learned using Giza++ which is an implementation of the IBM's translation models. In this paper, a method is presented, based on DTW (Dynamic Time Warping), which automatically aligns subtitles. With this method, 40 parallel corpora of French and English subtitles are generated. Using this corpus to test this approach, based on inter-lingual triggers for learning translation probabilities, it was possible to automatically build a dictionary. Experiments showed that the resultant dictionary is well constructed and is suitable for machine translation. With the purpose of evaluating the actual contribution of this method, a real translation system was developed using the Pharaoh decoder. Evaluation has been achieved by using Bleu score.

12.45 Discussion

12.50 Lunch (please note: coffee will be served in the exhibition area)
An opportunity to visit the exhibition stands

14.10 Introduction by Chair: Professor Ruslan Mitkov, University of Wolverhampton, UK

14.10 E-learning and Employability in Translator Training: Introducing
E-Portfolio and Personal Development Planning at the University of Vienna

Notburga Rotheneder, University of Vienna, Austria
The aim of this project is to introduce the e-portfolio method to promote skills development reflection and personal development planning in the specific organisational and didactic context of the Centre for Translation Studies.
E-learning is regarded as a means to promote skills that are becoming increasingly relevant for translators, namely translation technology and transferable skills. The e-portfolio is an excellent tool for training reflection, both about what relevant skills are and how they are acquired, and also serves to make skills profiles and evidence of achievements visible to potential employers. These are appropriate measures to improve the employability of translation graduates. The focus is on technology skills - such as translation memory usage and management, software and website localisation, terminology extraction and management, corpus creation and exploitation, on the one hand and on transferable skills - such as organisational, communication and team skills (project management) and quality assurance - on the other hand.

14.40 Innovation and E-learning in Translator Education: A Singapore Experience
Susan Xu, SIM University, Singapore
The last decades have seen the education of translators shift to several new dimensions, thanks to rapidly-developing technology and globalization. There seems to be an increasing perception that conventional classroom teaching alone will no longer equip translators-in-training with the wide range of professional skills, knowledge and competencies that are required by today's fast evolving language service market. This paper presents an overview of the innovative development in devising integrated and detailed e-Learning plans and activities to enhance the design and delivery of the course content in the BA in Translation and Interpretation (BATI) programme. It examines how the blended approach in intergrating e-learning environment with on-site classroom changes the way translators and interpreters are being trained and resolves the problems in the training. Implication and issues encountered in the process of implementing
e-learning plans such as lack of technical experiences, challenge of incorporating pedagogy into the e-learning object, workload for teachers and students, and conceptual restriction of translation will be discussed. It is concluded that the solutions to these problems lie in developing greater control over technologies and this poses challenges and opportunities to software developers in the relevant field.

15.10 Panel Discussion on the future of Training Translators

15.30 Tea

15.50 The Long Way to a Corporate Platform for Multilingual Contents and Translation Processes: Corporate Translation Management in practice
Michael Leifeld, Polysius AG and Christian Weih, across Systems GmbH, Germany
As Polysius AG, one of the world's leading engineering companies in the field of equipment for the cement and minerals industry, were about to replace their existing translation memory, it soon became clear to them that they were renewing much more than just a software application. The decision to implement the concept of Corporate Translation Management was made with a view to increasing efficiency on various levels and this is now being carried out step by step. Because of the company's global activities, emails, reports and above all plant-related information such as manuals, clerical, technical and legal texts are translated into a current total of 10 languages. The majority of the workload of approx. 150 translation jobs per month, i.e. around 80%, need to be translated into English and Spanish and are centrally processed by the company's documentation and translation department. With the new system the documentation and translation department now has a tool for effectively managing projects that continually supports individually defined workflows. The translation memory and the terminology system form the basis for both effective translation support and the management of standardized corporate terminology. The terminology system is available by browser via the company's Intranet to ensure simple access to both the company's extensive terminology base and to the concordance search for all staff members who need these features. This paper will describe the decision process the company undertook to develop the new system.

16.25 Discussion

16.30 LTC Presentation on Workflow and Business Information Management in the Language Industry
Adriane Rinsche, LTC, UK
LTC has almost a decade of experience in helping corporate clients and language companies implement business information and workflow control tools. In this presentation we will share some of our insights by explaining various workflows and their objectives. Attendees will be able to improve awareness of how workflows are used and how they differ depending on the desired outcome. They will be given tips on how to describe and review their own processes and workflows.

17.05 Discussion

17.10 Close of Conference

Please note: Aslib reserves the right to make changes to the publicised programme without prior notice.

 

--
-