More Questions Answered on How MT Helps Improve Translation Productivity (Part II)

Part II

Welcome to Part II of the Q&A blog on How Machine Translation Helps Improve Translation Productivity. In case you missed the first part of our post, here’s a link to quickly have a look at what was covered.

Tony O’Dowd, Chief Architect of KantanMT.com and Louise Faherty, Technical Project Manager presented a webinar where they showed how LSPs (as well as enterprises) can improve the translation productivity of the language team, manage post-editing effort estimations and easily schedule projects with powerful MT engines. For this section, we are accompanied by Brian Coyle, Chief Commercial Officer at KantanMT, who joined the team on October, 2015 to strengthen KantanMT’s strategic vision.

We have provided a link to the slides used during the webinar below, along with a transcript of the Q&A session.

Please note that the answers below are not recorded verbatim and minor edits have been made to make the text more accessible.

Question: We are a mid-sized LSP and we would like to know what benefits would we enjoy if we choose to work with KantanMT, over building our own systems from scratch? The latter would be cheaper, wouldn’t it?

Answer (Brian): Tony and Louise have mentioned a lot of features available in KantanMT – indeed, the platform is very feature-rich and provides a great user experience. But on top of that, what’s really underneath KantanMT is the fact that it has access to a massive computing power, which is what Statistical Machine Translation requires in order to perform efficiently and quickly. KantanMT has the unique architecture to help provide instant on-demand access at scale.

As Louise Faherty  mentioned, we are currently translating half a billion words per month and we have 760 servers deployed currently. So if you were trying to develop something yourself, it would be hard to reach this level of proficiency in your MT. Whilst no single LSP would probably need this total number of servers, to give you an idea of the cost involved, that kind of server deployment in a self-build environment would cost in the region of €25m.

We also offer 99.99% up time with triple data-centre disaster recovery. It would be very difficult and costly to build this kind of performance yourself.  Also, with this kind of performance at your client’s disposal, you can offer Customised MT for mission critical web-based applications such as eCommerce sites.

Finally, a lot of planning, thought, development hours and research has gone into creating what we believe is the best user interface and the platform for MT, which also has the best functionality set with extreme ease of integration in the market place. So, it would be difficult for you to start on your own and build your own system that would be as robust and high quality as KantanMT.com.

Question: Could you also establish KantanNER rules to convert prices on an eCommerce websites?

Answer (Louise Faherty ): Yes, absolutely! With KantanNER, you can also establish rules, convert prices and so on. The only limitation with that being is that the exchange range will of course fluctuate. But there could be options as well of calculating that information dynamically – otherwise you would be looking at a fixed equation to convert those prices.

KantanMT_ProductivityQuestion: My client does not want us to use MT because they have had bad experience in the past with Bing Translate – what would convince them to use KantanMT? How will the output be different?

Answer (Tony O’Dowd): One of things that you have to recognise in terms of using the KantanMT platform is that you are using MT to build customised machine translation engines. So you are not going to create generic engines (Bing Translate and Google Translate are generic engines). You would be building customised engines that are trained on the previous translations, glossaries that you clients have provided. You will also be using some of our stock engines that are relevant to your client’s domain.

So when you combine that, you get an engine that will mimic the translation style of your client. Indeed, instead of generic translation engines, you are using an engine that is designed to mirror the terminology and stylistic requirements of your client. If you can achieve this through Machine Translation, you will see that there is a lot less requirement for Post-Editing, and this is one of the most important things that drives away translators from using generic systems or broad-based systems and that’s why they choose customised systems. Clients and LSPs have tested the generic systems as well as customisable engines and found that cloud-based customisable MT add a value, which is not available on free, non-customisable MT platforms.

End of Q/A session

The KantanMT Professional Services Team would once again like to thank you for all your questions during the webinar and for sending in your questions by email.

Have more burning questions? Or maybe you would like to see the brilliant platform translate in a live environment? No problem! Just send an email to demo@kantanmt.com and we will take care of the rest.

Want to stay informed about our new webinars? You can bookmark this page, or even better – sign up for our newsletter and ensure that you never miss a post!

Let's Get Started_KantanMT

All your Burning Questions Answered! How Machine Translation Helps Improve Translation Productivity (Part I)

Part I

We had so many questions during the Q&A in our last webinar session ‘How to Improve Translation Productivity‘ by the KantanMT Professional services team, that we decided to split the answers into two blog posts. So, if you don’t find your questions answered here, check out our blog next week for the remaining answers. 

KantanMT_ComputersInternet today is experiencing what is generally referred to as a ‘content explosion!’ In this fast-paced world, businesses have to strive harder and do more to stay ahead of the game – especially if they are a global business or if they have globalization aspirations. One fool-proof way in which a business can successfully go global is through effective localization. Yet, the huge amount of content available online makes human translation for everything almost impossible. The only viable option then in today’s competitive online environment is through the use of Machine Translation (MT).

On Wednesday 21st October, Tony O’Dowd, Chief Architect of KantanMT.com and Louise Faherty, Technical Project Manager at KantanMT presented a webinar where they showed how Language Service Providers (LSPs)  (as well as enterprises) can improve the translation productivity of the team, manage post-editing effort and easily schedule projects with powerful MT engines. Here is a link to the recording of the webinar on YouTube along with a transcript of the Q&A session.

The answers below are not recorded verbatim and minor edits have been made to make the text more readable.

Question: Do you have clients doing Japanese to English MT? What are the results, and how did you get them? (i.e., do you pre-process the Japanese?)

Answer (Tony O’Dowd): English to Japanese Machine Translation (MT) has indeed always posed a challenge in the MT industry. So is it possible to build a high quality, high fidelity MT system for this language combination? Well, there have been quite a few developments recently to improve the prospect of building effective engines in this language combination. For example, one of the latest changes we made on the KantanMT platform for improving the quality of MT is by using new and improved reordering models to make the translation from English to Japanese and Japanese to English much smoother, so we deliver a higher quality output. In addition to that, higher quality training data sets are now available for this language pair, compared to a couple of years ago, when I had started building English to Japanese engines. Back then it was really challenging. It is still requires some effort to build English to Japanese MT engines, but the fact that there’s more content available in these languages makes it slightly easier for us to build high-quality engines.

We are also developing example-based MT for these engines and it so far this is showing encouraging signs of improving quality for this language pair. However, we have not started deploying this development on the platform yet.

KantanMT note: For more insights into how you can prepare high-quality training data, read these tips shared by Tony O’Dowd, and Selçuk Özcan, co-founder of Transistent Language Automation Services during the webinar ‘Tips for Preparing Training Data for High Quality MT.’

Question: Have you got a webinar recorded or scheduled, where we could see how the system works hands-on?

Answer (Tony O’Dowd): If you go on to the KantanMT website, we have video links on the product features pages. So you can actually watch an explanation video while you are looking at the component.

We work in a very visual environment, and we think videos are a great way of explaining how the platform works. And, if you go on to the website, on the bottom left corner of the page, you will find our YouTube channel, which contains videos on all sorts of topics, including how to build your first enginehow to translate your first document and  how to improve the output of your engines.

If you click on the Resources menu on our site, you can access a number of tutorials that will talk you through the basics of Statistical Machine Translation Systems. In other words, explore the website and you should find what you need.

KantanMT note: Some other useful links for resources are listed below:

Question: Do you provide any Post-Editing recommendations or standards for standardising the PE process? You said translation productivity rose to 8k words per day – this is only PE, correct?

Answer (Tony O’Dowd): I will take the second question first! The 8,000 words per day is the Post-Editing (PE) rate, yes. It is not the raw translation rate. In Machine Translation, everything comes out pretranslated. So this number refers to the Post-Editing effort – like insertions, deletions, substitution of words, and so on that you need to do to get the content to publishable quality.

Louise Faherty: What we recommend to our clients is that when it comes to PE, they should try to use MT. A lot of translators who are new to using MT will try and translate manually, which is a natural tendency, of course. But what we advise our clients is to copy and paste the translation (MT) in the engine and use the MT. The more you use MT and the more you Post-Edit, the better your engine will become.

Tony O’Dowd: I will add something to Louise Faherty ’s comments there. The best example of PE recommendations that I have come across is provided by a group called TAUS. They are at the pivot of educating the industry on how to develop a proficiency in PE.

Subscribe to TAUS YouTube channel here.

Question: What do ‘PPX’ and ‘PEX’ stand for (as abbreviations)?

Answer (Louise Faherty  and Tony O’Dowd): PEX stands for Post-Editing Automation. PEX allows you to take the output of an MT engine and dynamically alter that. When would you need to use PEX? Suppose there is a situation where your engine is repeating the same error over and over again. What you can do in such cases is write a PEX file (developed in the GENTRY programming language). This allows the engine to look for patterns in the output of the engine and to dynamically change that in the output.

For example, one of our French clients did not want to have a space preceding a colon mark in the output of their MT (because this was one of their typographical standards and repeated throughout the content). So we wrote a PEX rule that forced a stylistic change in the output of the engine. This enabled the client to reduce the number of Post-Edits substantially.

PPX stands for Preprocessor automation. You can use PPX files for to normalise or improve the training data. It is based on our GENTRY programming language which is available to all our clients for free.

In short then, PPX is for your training data, while PEX is for the actual raw output of your engine.

For more questions and answers, stay tuned for the next part of this post!

Intelligent Content Distribution is Important – But How Intelligently are you Producing Your Content?

The future of content production, distribution and consumption is here. With the number of websites at 949,891,800 as of the time of publication of this blog, and increasing every second, the importance of developing structured content and its distribution has become more important than eArtificial Intelver. It is no coincidence that in the 2015 tcworld Conference held in Stuttgart, Germany, one of the main themes of discussion is Intelligent Information management.

The speakers will present on the “megatrend of our time” where content needs to cater to “smart” users through “smart” services and not “just” products. As a result of this new trend, companies need to step up to the challenge and provide users with individualised information at the right time, in the right place and in the medium of their choice. Tekom calls this the “Intelligent Information Initiative – in3.” Before talking a bit more about what Intelligent Content is all about, it is important to remember that while the structure of the content is incredibly important, it is also equally important to have content that’s relevant, reusable and above all, targeted to the “smart” users that companies aim to attract.

To know more about the speakers for this theme and the topics being presented, have a look at the tcworld conference schedule.

What’s Intelligent Content?

So what is intelligent content, and how can businesses effectively manage their content to get their products and services to market faster, without having to create new content for each new platform or medium.

Intelligent content adopts digital texts and multimedia with coding. This allows the coded content to be automatically processed for being accessed across various devises and interface.

The Intelligent bit is created by removing the formatting and adding metadata, which summarises the information related to the data. This makes finding and reusing the data/ content much easier. The metadata adds information to segments of the content, which in turn makes the content easy to be disseminated, discovered and reused by businesses.

Ann Rockley of the Rockley group fame has spoken in depth about Intelligent Content development in her work Managing Enterprise Content and she describes the importance of adopting the structure of intelligent content as follows:

Intelligent content enables automated multi-channel delivery, adaptive content, improved content discovery, and personalized content delivery in an agile world. But the power of your content to respond to tight timelines, new customer requirements, and increasing costs is based on the quality of your intelligent content strategy

It is this “quality” of content strategy that will either make you a leader in your industry or slow you down from being able to be the first to bring your product to market. If you are taking your “smart” services and products to users across the globe, or even starting your content strategy from scratch, it is important that you structure your content. This will not only make it easy to reuse and tag, but will also be extremely helpful for you when you have to get your content localized and translated into the languages of the markets you want to penetrate.

Structured content is incredibly suited to Machine Translation. And a quality (intelligent) content strategy should always plan ahead for the need of localization in the future. Even if your business is just starting off, you should begin planning your content strategy intelligently because an unstructured website can escalate into an unmanageable mess very quickly.

The real success of a company’s content strategy is knowing how and where it is consumed. Luckily the ‘always on’ availability of content means anyone can access information regardless of nationality and geographical location. If a company is serious about its content strategy, then it will put a process in place to create multilingual content that can be distributed to a multilingual audience.

Including Machine Translation into your Translation and localization Workflow will make it easier to translate more content, faster than traditional human only workflows. When your MT engine is integrated into your content management workflow, translations for your structured content can be sent to your global websites seamlessly, with minimal manual intervention.

Thanks to high success rates KantanMT has working with Structured Content, we believe that integrating Machine Translation into the workflow and planning an “quality” Intelligent Content Strategy go hand in hand.

Meet us at booth 2/A09 during the tekom trade fair and tcworld conference between 10th aTc Worldbd 12th November to learn more about Custom Machine Translation and how it can fit in your intelligent content production workflow.

We have a few  FREE Tekom Fair tickets to give away, so to be in with a chance to win, send an email to marketing@kantanmt.comwith ‘FREE Ticket’ in the subject line and we will add you to a draw. Winners will be notified by email.

Applications Of Machine Translation And Translation Memory Tools In Audiovisual Translation: A New Era?

Rafaella Athanasiadi, UCL KantanMTMaster’s student, Rafaella Athanasiadi of the University College London submitted her thesis as part of the MSc degree in Scientific, Technical and Medical Translation with Translation Technology. Rafaella was supervised by Teaching Fellow and Lecturer Dr. Emmanouela Patiniotaki and she used KantanMT.com for her research. This guest blog post looks at some of her conclusions on Machine Translation and the Localization Industry.

As Hutchins & Somers (c1992:1) argue, “the mechanization of translation has been one of humanity’s oldest dreams.” During the 20th century, the translation process changed radically. From spending endless hours in libraries to find the translation of a word, the translator has been placed in the centre of dozens of assistive tools. To name just a few, today, there are many translation software, terminology extraction tools, project management components, and machine translation systems, which translators have the opportunity to choose from while translating.

However, shifting the focus to audiovisual translation, it can be observed that not so many radical changes took place in that area, at least not until the introduction of machine translation systems in various projects (such as, the MUSA and the SUMAT project) that developed machine translation engines to optimise the subtitling process. Still, the results of such projects do not seem to be satisfactory enough to inspire confidence for the implementation of these engines in the subtitling process both by subtitling software developers and subtitlers.

Rafaella Athanasiadi, KantanMT UCLBased on my personal research that focused primarily on the European setting, in the subtitling industry it seems that only freeware SRT Translator incorporates machine translation while also offering the features that subtitling software usually incorporate (i.e. uploading multimedia files and timecoding subtitles) at the moment. Nonetheless, SRT Translator, which is not very famous among subtitlers, uses solely Google Translator by default, which is a general-domain machine translation engine and not suitable for the purposes of audiovisual translation, one could argue. The quality of the output of Google Translator was tested by translating 35 subtitles of a comedy series[1]. The output was incomprehensible and misleading in many cases.

Even though no further records of traditional subtitling software that incorporate machine translation could be found, there are many online translation platforms that allow users to upload and translate subtitles. Taking into consideration the European market, these can be either translation software like MemoQ, SDL Trados Studio and Wordfast that offer thability to load subtitle files and in some cases link them to the audiovisual content they are connected to, open source tools for translators like Google Translator Toolkit (GTT) or professional and private platforms like Transifex and XTM International that are used by companies and offered to their dedicated network of translators. Nonetheless, in order to enable machine translation in all the above applications, API keys must be purchased. GTT is an exception since it can be used for free anytime and only requires a Gmail account.

The fact that subscription fees have to be paid along with the costs of API keys for each machine translation engine provider puts their usability in question since costs may overweight subtitlers’ profits. Furthermore, these platforms cannot accommodate subtitlers’ needs; for instance, the option to upload and play multimedia files while translating the subtitles is not always possible nor any synchronization features for timecoding the subtitles to the audio track are offered. Transifex, however, is an exception since this localization platform offers users the option to upload multimedia files in the translation editor while translating the subtitles.

KantanMT Machine TranslationAccording to Macklovitch (2000:1) a translation memory is considered to be “a particular type of translation support tool that maintains a database of source and target language sentence pairs, and automatically retrieves the translation of those sentences in a new text which occur in the database.” Even though machine translation engines were developed through different projects to reduce subtitling time to the least possible degree, no attempts had been traced during this research to integrate a translation memory tool in a subtitling software for optimizing subtitling; at least in a European, Asian and Australian setting. As Smith (2013) argues, “traditionally subtitling has fallen outside the scope of translation memory packages, perhaps as it was thought to be too creative a process to benefit from the features such software offers.” However, as Diaz-Cintas (2015:638) discusses “DVD bonus material, scientific and technical documentaries, edutainment programmes, and corporate videos tend to contain the high level of lexical repetition that makes it worthwhile for translation companies to employ assisted translation and memory tools in the subtitling process.”

Even if such tools have not been integrated in subtitling software, translation memory components are used for subtitling purposes in cloud-based platforms such as GTT, Transifex and XTM International as well as in translation software, MemoQ, SDL Trados Studio, Wordfast Pro and Transit NXT by simply creating a translation memory before or while translating. It should be noted that Transit NXT is the only translation software that can accommodate the needs of subtitlers to a high level among the tools discussed in this research. Apart from the addition of specialized filters to load subtitles (that also exist in MemoQ, SDL Trados Studio and Wordfast Pro), subtitlers can upload multimedia files, translate subtitles while a translation memory component is active and also synchronise their subtitles with the Transit translation editor (Smith, 2013).

The translation editor of Transit NXT by Smith
The translation editor of Transit NXT by Smith (2013)

Figure 1: The translation editor of Transit NXT by Smith (2013)

The newly-founded company (2012) OOONA has taken a very interesting approach to subtitling by developing a unique cloud-based toolkit that is built exclusively for accommodating the needs of subtitlers. When asked the following question within the context of the MSc thesis,

Considering that other cloud-based translation platforms like GTT, Transifex and XTM International offer the option of uploading a TM or a terminology management component, do you think that it is important to offer it on a subtitling platform as well?

the representative of OOONA (Alex Yoffe) replied that not only will the company implement translation memory and terminology management components in the next phase of enhancing their platform but that they also consider these components to be very important for the subtitling process. In addition, Yoffe (2015) argued that OOONA intends to “add the option of using MT engines. Translators will be able to choose between Microsoft’s, Google’s, or customisable MT engines.” Therefore, it seems that OOONA will become a very powerful tool in the near future with features that will optimise the subtitling process to the maximum and shape the way that subtitling is carried out until now. The fact that Screen Systems, Cavena and EZTitles have partnered with OOONA is an indicator of how much potential there is in this toolkit.

As it can been argued based on the above, there is lack of subtitling software with incorporated translation memory tools. Therefore, this issue was further researched through the form of an online questionnaire that was disseminated to subtitling companies and freelance subtitlers. In addition, two companies that develop subtitling software, Screen Subtitling Systems and EZTitles, were asked to present their views on this topic. In both cases, their willingness to optimise the subtitling process in a semi-automated or a fully-automated way was apparent through their answers. The former company was in favour of a combination of machine translation tools with translation memory tools whereas the latter leaned towards a subtitling system with integrated translation memory and terminology management tools.

Nonetheless, the optimisation of the subtitling process has to coincide with the needs and preferences of subtitlers. Based on the respondents’ answers, it is clear that translation memory tools in subtitling software are desirable by subtitlers. In question,

Which tool would you prefer to have in a subtitling software? An integrated translation memory (TM) or machine translation (MT)?

more than half of the respondents (56.8%) chose TM. Interestingly, the answer Both received the second highest percentage (20.5%) which indicated that subtitlers demand as many assistive tools as possible.

One of the main conclusions that were drawn from this research was that machine translation engines need to be customised to produce good quality output and this can be achieved through customisable engines like KantanMT and Milengo. Moreover, translation memory tools are sought by subtitlers in subtitling software, while cloud-based platforms seem to occupy the translation industry today. Following this trend, subtitling software providers partner with online services/tools like the OOONA toolkit.

Based on the outcomes of this research, it could be said that we are certainly experiencing a new era in subtitling since the traditional PC-based subtitling software are now transforming into flexible and accessible platforms to enhance the subtitling experience as much as possible. It is a matter of time which tool and platform will rule the subtitling industry but one thing is for sure; the technologies of the future will bring a lot of changes in the traditional way of subtitling.

Works Cited

Diaz-Cintas, J., 2015. Technological Strides in Subtitling. In: S. Chan, ed. Routledge Encyclopedia of Translation Technology. London: Routledge, pp. 632-643.

Hutchins, J. W. & Somers, H. L. (c1992). An introduction to machine translation. London: Academic Press.

Macklovitch, E. (2000). Two Types of Translation Memory. In Proceedings of the ASLIB Conference on Translating and the Computer (Vol. 22).

Smith, Steve (2013). New Subtitling Feature in Transit NXT. November 11 2013. [Online]. Available from: http://www.star-uk.co.uk/blog/subtitling/working-with-subtitles-in-transit-nxt/. [Accessed 01 Sept. 2015].

Yoffe, A (2015). MT and TM tools in subtitling. [Interview]. 13 August 2015.

[1] Relevant data are available in Appendix 1 of the MSc thesis.

 

University Speak: The Future of Translation Technology with Swansea University

A few months back we were pleased to announce our Academic Partnership with Swansea University’s Department of Languages, Translation and Communication. The postgraduate students of the department were able to use the KantanMT platform to update or gain new skills in Translation Technology. With help of the KantanMT platform, the students learnt how to build and customise their own Statistical Machine Translation (SMT) systems in a real world scenario.

The MLTM11 Translation Technologies Module aimed to broaden and deepen students’ familiarity and expertise with current translation technologies. A number of students in the programme were interested in careers in the localization industry. Interests and aspirations ranged from working as an in-house IT specialist and/or localizer, a developer and tester with a major translation tools company, or working as an academic researcher (e.g. a PhD) in the field of language technology.

Now that the course is complete and the students are ready to contribute to the world of Machine Translation, we asked them a few questions about their experience of using the platform. We also asked the lecturer from the course to tell us a little bit about her experience of working with us. Here’s a few bites from each of our brilliant KantanMT users.

Dr. Maria Fernandez-Parra, Swansea University

LecturerDr. Maria Fernandez-Parra, Languages, Translation and Communication

About Dr Fernandez-Parra:

Dr Fernandez-Parra is interested in all aspects of translation and interpreting, particularly translation technologies, translation theory, technical and specialised translation and computer-assisted translation. She is also interested in many aspects of linguistics (especially formulaic language) as well as in the teaching of Spanish in Higher Education. One of her more recent interests is the introduction of technology into teaching in Higher Education.

What does Dr Fernandez-Parra think about KantanMT?

KantanMT is very useful for teaching purposes for various reasons but one reason that stands out is that it allows lecturers to show students an example of how to train data for MT output. This is not possible with other MT tools that I know of. Students can experiment with the creation of training data for their specific source texts, as opposed to general training data that might not be too relevant to some source texts. This allows them to get the best results possible when translating with MT and it gets students familiar with the concept of training MT engines on specific datasets.

It is good that students can start learning about this concept and have some practice and experience in it, as we know there are increasingly more career opportunities in this field of work. Another advantage of KantantMT is that it is very convenient to access it online for teaching purposes, as we do not have to worry about installing anything in our university server and there are no licensing problems involved. KantanMT staff were very helpful and quick to respond to queries. The use of KantanMT online is easy and intuitive and it does not take long for students to be working with their own training data in KantanMT. In addition, there is a wide range of useful information on the web site and a variety of useful resources too.

Student: Ms Min Luo

I think CAT Tools will become more and more helpful in China.

KantanMT: What made you choose this course, and how to do you think it will help your career?

Ms Luo: I think this course is useful. It will help the translator when they translate texts containing repetition.

KantanMT: What are your thoughts on using the KantanMT platform, what did you find good and do you have any suggestions for improvement?

Ms Luo: It is good. It is convenient. I have no suggestion.

KantanMT: What is your impression of the translation industry, and in your opinion, what do you think the industry will look like in the future?

Ms Luo: Technology changes quickly. I think CAT Tools will become more and more helpful in China.

Student: Ms Gwladys Petitfrère

KantanMT.com is very different from other MT I have used before. It is very easy to use and gives great results.

KantanMT: What made you choose this course, and how to do you think it will help your career?

Ms Petitfrère: I wanted to learn how to use various CAT tools for my career as a translator, if I needed it in the future.

KantanMT: Did you have any previous experience of Machine Translation, if so how does KantanMT.com compare?

Ms Petitfrère: KantanMT.com is very different than other MT I have used before. It is very easy to use and gives great results.

KantanMT: What are your thoughts on using the KantanMT platform, what did you find good and do you have any suggestions for improvement?

Ms Petitfrère: There is a lot of information available about every possible specificity of this tool.

KantanMT: What is your impression of the translation industry, and in your opinion, what do you think the industry will look like in the future?

Ms Petitfrère: I think CAT tools are very helpful for the translator, but hopefully we won’t act as proof-readers of the text produced by a machine.

Student: Ms So Mui Cheung

I see contradictory directions. On the one hand, MT is inevitable, welcoming and the way forward.

KantanMT: What made you choose this course, and how to do you think it will help your career?

Ms Cheung: I chose this course because firstly, it offers a PG certificate option and as I already have a Masters degree connected with the language I work with, so I didn’t want to complete another Masters degree.

Secondly, from the point of view of content, I needed a course that focuses on the technology for translation work, and lots of practical hands-on work using the tools. This course meets these criteria. The course sets a direction for a new way of approaching translation. It reflects the rapid changes happening in the field.

Question: Have you had you any previous experience of Machine Translation, if yes, how did you find using the KantanMT system?

Ms Cheung: Most people who use the internet now will have some experience of using MT if only to get a gist of internet content instantly. My comment here is based on very little interaction with the KantanMT platform. It is a tool aimed not at individuals/freelance translators but corporations/language services providers. As such, unless one is engaged to do quite a lot of work or on a regular basis, then the KantanMT platform is not one where one can get the most out of in a short time, as indeed an individual translator is unlikely to have access to it.

KantanMT: What are your thoughts on using the KantanMT platform, what did you find good and do you have any suggestions for improvement?

Ms Cheung: See my comment above – as an individual translator, I can only suggest improvements after having tested the different features thoroughly. I would welcome another chance to answer this question.

KantanMT: What is your impression of the translation industry, and in your opinion, what do you think the industry will look like in the future?

Ms Cheung: Firstly, in the translation industry, at least in the language pair of Chinese>English that I work with, good quality and relevant TMs for training the engine might be a problem. However, in the future – again, in the language pair I work with, I see contradictory directions. On the one hand, MT is inevitable, welcoming and the way forward. On the other hand, depending on the nature of the work, corporations subscribing to cloud-based MT platforms such as Kantan will have issues with confidentiality, and for this reason, I expect some adaption before a cloud-based platform is a widely or readily available tool on an LSP-freelancer basis in the foreseeable future. In the meantime, I expect to see an increase in MT text for post-editing whether online or offline.

Note from the KantanMT team:

Read more about courses offered by Swansea University, or for more information on the KantanMT University Partner Program, please contact us, (info@kantanmt.com).

About Swansea University

Swansea is a top-30 UK research university (Research Excellence Framework 2014), with a record of teaching translation tools and technologies going back to 1999. Its MA in Professional Translation (formerly MA in Translation with Language Technology) has been a member of the European Master’s in Translation Network since its inception in 2009. It also offers a vocational MA in Translation and Interpreting, and a thriving PhD programme in Translation (including recent successful projects on translation tools).

Create, Test and Deploy Post-Editing Automation Rules with KantanMT PEX Rule Editor

The KantanPEX Rule Editor enables members of KantanMT reduce the amount of manual post-editing required for a particular translation by creating, testing and deploying post-editing automation rules on their Machine Translation engines (client profiles).

The editor allows users to evaluate the output of a PEX (Post-Editing Automation) rule on a sample of translated content without needing to upload it to a client profile and run translation jobs. Users can enter up to three pairs of search and replace rules, which will be run in descending order on your content.

How to use the KantanMT PEX Rule Editor

Login into your KantanMT account using your email and your password.

You will be directed to the ‘Client Profiles’ tab in the ‘My Client Profiles’ page.  The last profile you were working on will be ‘Active’ and marked in bold.

Active Profile, KantanMT, Client Profile

To use the ‘PEX-Rule Editor’ with a profile other than the ‘Active’ profile, click on the  new profile name to select that profile for use with the ‘Kantan PEX-Rule editor’.

Then click the ‘KantanMT’ tab and select ‘PEX Editor’ from the drop-down menu.

Client Profile, KantanMT, PEX Editor

You will be directed to the ‘PEX Editor’ page.

Type the content you wish to test on, in the ‘Test Content’ box.

Test Content, PEX Rule Editor, KantanMT

Type the content you wish to search for in the ‘PEX Search Rules’ box.

PEX Search Rules, KantanMT, PEX Editor

Type what you want the replacement to be in the ‘PEX Replacement Rules’ box and click on the ‘Test PEX Rules’ button to test the PEX-Rules.

PEX Replacement Rules, Pex Editor , KantanMt , Products

The results of your PEX-Rules will now appear in the ‘Output’ box.

Output Content , PEX Rule Editor

Give the rules you have created a name by typing in the ‘Rule Name’ box.

Rule Name, PEX Rule Editor , KantanMT

Select the profile you wish to apply this rule(s) to and then click on the ‘Upload Rule’ button.

Profile and Button, KantanMT , PEX

Additional Information

KantanMT PEX editor helps reduce the amount of manual post-editing required for a particular translation, hence, reducing project turn-around times and costs. For additional information on PEX-RULES and the Kantan PEX-Rule editor please click on the links below. For more details about  KantanMT localization products  and ways of improving work productivity and efficiency please contact us at info@kantanmt.com.

 

Translation Machines in Sci-fi

Richard Brooks, CEO, K International KantanMT
Richard Brooks, CEO, K International

This blog post was written by Richard Brooks. He’s a firm believer that life imitates art, CEO of the UK-based LSP K International, a company specialising in translation services for the legal industry and director of the Association of Language Companies.

Translation Machines in Sci-fi

In science fiction, translation of the potentially infinite number of languages spoken by alien species presents a dilemma. How to deal with communication between interplanetary species without resorting to contrivance, or spending the first twenty minutes of each episode’s dialogue clumsily showing characters learning one another’s diphthongs?

The notion of a ‘universal translator’ emanated from Murray Leinster’s novella First Contact, published in 1945 (and clearly that isn’t the only debt Gene Roddenberry owes to Leinster). It’s a greatly helpful – borderline miraculous, in fact – convention of sci-fi: a technological solution to the language barrier, leaving more time for the actual narrative to unfold in one language, typically English.

With the incredible advancements in technology we’re witnessing at the moment such as Microsoft’s pilots of a Skype Translator and the industry leading work KantanMT is achieving in this area, are we seeing the beginnings of live translation – well ahead of Star Trek’s 22nd century deadline? In the meantime, let’s take a look at five of sci-fi’s finest translation machines, which beat anything real-life technology can offer – for now.

KantanMT Blog, Universal Translator

1. Star Trek: Universal Translator

An important part of Star Trek’s near-utopian vision of the future is the Universal Translator. Translating any language into another even while a person is speaking, this exceptionally handy tool means Starfleet craft in any quadrant of the galaxy can speak to new life and new civilizations without confusion.

Voiced by Star Trek creator Roddenberry’s widow Majel Barrett until her death in 2008, the development of a universal translator was, in the Trek universe, a portent of Earth’s cultures achieving universal peace. It’s difficult to imagine Google Translate having the same impact.

This convenient concept has been often copied, and occasionally parodied: in Futurama, everyone in the universe speaks English, rendering Professor Farnworth’s one successful invention – a translation device – useless, as it merely translates English into the dead language, French!

2. The Hitchhikers’ Guide to the Galaxy: the Babel Fish

Some sci-fi plays with the concept in less serious ways. In Douglas Adams’ H2G2, to help Arthur Dent deal in some small way with anything that goes on around him, inserted into his ear is a Babel Fish, memorably described by the Guide as “small, yellow, leechlike and probably the oddest thing in the universe.”

The science (such as it is) behind the Babel Fish is that it can absorb the frequencies of outside speakers, and a translation is secreted by the fish into the hearer’s brain via his or her ear canal. In a witty reversal of Star Trek’s idealistic Federation, Adams reveals that, by allowing everyone to understand one another, the Babel Fish has actually caused more war than anything else in the universe.

3. Farscape: Translator microbes

In science fiction, as in reality, it is the individual idiosyncrasies of languages which are trickiest to master. When people in the UK from a hundred miles apart may speak different languages, not to mention a range of different dialects and accents, can auditory translation really be so smooth?

One series to acknowledge this is Farscape, where astronaut John Crichton is injected with bacteria-sized ‘translator microbes’, which are injected into – and colonise – his brain. The microbes work to make their host understand any spoken information in any language – except idioms are translated literally. This leads to a great deal of confusion for John, and opportunities for humour for the audience (all jokes are language, after all) – and also perhaps renders these microbes a more realistically-limited translator technology.

4. Doctor Who: The TARDIS’ Translation Circuit

As well as being telepathically linked with the Doctor, and granting the ability to travel to any time or place in history and the future, the TARDIS’ telepathic field is used to automatically translate what the Doctor and any companions hear or read into a language which they can understand.

While wonderfully convenient, the mind-meld involved does mean that the translation circuits won’t actually work when the Doctor is unconscious – not an outright impossibility. Also, because translations are time specific, ancient civilization won’t understand neologisms – and, neatly, the Romans have never heard the word ‘volcano’ – because they’ve not lived to see an eruption.

5. Star Wars: C-3PO

Luke Skywalker is the ultimate sci-fi everyman: he is every bit as much in need of a guide to the universe he finds himself in as the viewing audience are. Reinforcing this are his guides, C-3PO and R2D2, who Luke needs with him – despite their obvious drawbacks as travelling companions – because C-3PO is programmed with millions of languages, everything from Ewok to R2’s bleeps and whistles.

When the franchise returns with The Force Awakens later this year (which most fans will rightly consider the fourth, rather than seventh, Star Wars movie), C-3PO’s translation abilities are sure to make him at least partially useful to have around.

The KantanMT team say a big Thank You to Richard for a very savvy post on translation machines in science fiction.

Richard (@RichardMBrooks) will join Tony O’Dowd, (@TonyODowd1) KantanMT Founder and Chief Architect alongside other Language industry heavyweights at the ATC Annual Conference in the Old Trafford Stadium on 24th and 25th September 2015. Register here to attend the conference. 

KantanMT at ATC Conference

If you want to learn more about Machine Translation, send us email (info@kantanmt.com) with your questions and we will be happy to answer them!