KantanOfficeMT™ Explained: Interview with Seosamh, Software Developer

officeEdit

We are very excited to bring to our readers another new interview with one of our KantanMT Feature Developers. This time we interviewed Seosamh Ó Cinnéide, following the launch of KantanOfficeMT™. Seosamh is an Associate Software Development Engineer at KantanMT, and we asked him a few questions to find out more about the features and benefits of using KantanOfficeMT.
Continue reading

A Big Thank You to Everyone Involved in the Coastal Flag Challenge for Translators without Borders

#TWBhike_CoverImage.png

Last weekend one of the most important things we discovered is that folks working in the language industry are some of the coolest, smartest, most fun-loving yet hard-working people. They are also extremely generous. After attending the LavaCon and LocWorld31 Conferences, teams and members from various companies all around the world took up the Coastal Flag Challenge to hike along the Howth trail to raise money for Translators without Borders (TWB), a non-profit organisation that works to close critical language gaps that hinder humanitarian efforts worldwide. They support the work of hundreds of organisations in the areas of crisis relief, health and education. Continue reading

New Feature Release: Interview with Louise Faherty, Project Manager, Professional Services Team on features and benefits of KantanLQR™

Following KantanMT’s announcement of the roll out of the much-anticipated KantanLQR™ platform to all its Partners worldwide, Louise Irwin from Digital Marketing Team caught up Louise Faherty, Project Manager, Professional Services, KantanMT to talk about the features, benefits and the impetus behind creating the tool. Continue reading

Machine Translation Trend: Translation Cycles Instead of One-Off Projects

KantanMT recently published a white paper on what global companies can expect to see in 2016 for Machine Translation (MT). The MT industry is rapidly charrows-151433_1280anging and moulding itself to the technical needs and globalization requirements of the present day. Our white paper puts forward six major MT trends that all businesses need to heed in order to stay relevant and ahead of their competitors.

Continue reading

Machine Translation Trend in 2016: The Age of Automatic Workflows and More Collaboration

2016Trends_1_ImageKantanMT recently published a brand-new white paper on what global companies can expect to see in 2016 for Machine Translation (MT). The MT industry is rapidly changing and moulding itself to the technical needs and globalization requirements of the present day. Our white paper puts forward six major MT trends that all businesses need to KNOW in order to stay relevant and ahead of their competitors.

Continue reading

A Trip down Memory Lane: KantanMT in 2015

KantanMT Year in ReviewWhile chatting over a mouthful of mince pies, some tourtière and a few classy glasses of mulled wine this week, we at KantanMT were suddenly struck by the realisation that 2015 was perhaps one of the most sensational, successful and eventful years for us in the company! And the fact is, we can’t wait to start working on everything that we have planned for 2016 – we are certain that the new year is going to be even more exciting for us.

Continue reading

All your Burning Questions Answered! How Machine Translation Helps Improve Translation Productivity (Part I)

Part I

We had so many questions during the Q&A in our last webinar session ‘How to Improve Translation Productivity‘ by the KantanMT Professional services team, that we decided to split the answers into two blog posts. So, if you don’t find your questions answered here, check out our blog next week for the remaining answers. 

KantanMT_ComputersInternet today is experiencing what is generally referred to as a ‘content explosion!’ In this fast-paced world, businesses have to strive harder and do more to stay ahead of the game – especially if they are a global business or if they have globalization aspirations. One fool-proof way in which a business can successfully go global is through effective localization. Yet, the huge amount of content available online makes human translation for everything almost impossible. The only viable option then in today’s competitive online environment is through the use of Machine Translation (MT).

On Wednesday 21st October, Tony O’Dowd, Chief Architect of KantanMT.com and Louise Faherty, Technical Project Manager at KantanMT presented a webinar where they showed how Language Service Providers (LSPs)  (as well as enterprises) can improve the translation productivity of the team, manage post-editing effort and easily schedule projects with powerful MT engines. Here is a link to the recording of the webinar on YouTube along with a transcript of the Q&A session.

The answers below are not recorded verbatim and minor edits have been made to make the text more readable.

Question: Do you have clients doing Japanese to English MT? What are the results, and how did you get them? (i.e., do you pre-process the Japanese?)

Answer (Tony O’Dowd): English to Japanese Machine Translation (MT) has indeed always posed a challenge in the MT industry. So is it possible to build a high quality, high fidelity MT system for this language combination? Well, there have been quite a few developments recently to improve the prospect of building effective engines in this language combination. For example, one of the latest changes we made on the KantanMT platform for improving the quality of MT is by using new and improved reordering models to make the translation from English to Japanese and Japanese to English much smoother, so we deliver a higher quality output. In addition to that, higher quality training data sets are now available for this language pair, compared to a couple of years ago, when I had started building English to Japanese engines. Back then it was really challenging. It is still requires some effort to build English to Japanese MT engines, but the fact that there’s more content available in these languages makes it slightly easier for us to build high-quality engines.

We are also developing example-based MT for these engines and it so far this is showing encouraging signs of improving quality for this language pair. However, we have not started deploying this development on the platform yet.

KantanMT note: For more insights into how you can prepare high-quality training data, read these tips shared by Tony O’Dowd, and Selçuk Özcan, co-founder of Transistent Language Automation Services during the webinar ‘Tips for Preparing Training Data for High Quality MT.’

Question: Have you got a webinar recorded or scheduled, where we could see how the system works hands-on?

Answer (Tony O’Dowd): If you go on to the KantanMT website, we have video links on the product features pages. So you can actually watch an explanation video while you are looking at the component.

We work in a very visual environment, and we think videos are a great way of explaining how the platform works. And, if you go on to the website, on the bottom left corner of the page, you will find our YouTube channel, which contains videos on all sorts of topics, including how to build your first enginehow to translate your first document and  how to improve the output of your engines.

If you click on the Resources menu on our site, you can access a number of tutorials that will talk you through the basics of Statistical Machine Translation Systems. In other words, explore the website and you should find what you need.

KantanMT note: Some other useful links for resources are listed below:

Question: Do you provide any Post-Editing recommendations or standards for standardising the PE process? You said translation productivity rose to 8k words per day – this is only PE, correct?

Answer (Tony O’Dowd): I will take the second question first! The 8,000 words per day is the Post-Editing (PE) rate, yes. It is not the raw translation rate. In Machine Translation, everything comes out pretranslated. So this number refers to the Post-Editing effort – like insertions, deletions, substitution of words, and so on that you need to do to get the content to publishable quality.

Louise Faherty: What we recommend to our clients is that when it comes to PE, they should try to use MT. A lot of translators who are new to using MT will try and translate manually, which is a natural tendency, of course. But what we advise our clients is to copy and paste the translation (MT) in the engine and use the MT. The more you use MT and the more you Post-Edit, the better your engine will become.

Tony O’Dowd: I will add something to Louise Faherty ’s comments there. The best example of PE recommendations that I have come across is provided by a group called TAUS. They are at the pivot of educating the industry on how to develop a proficiency in PE.

Subscribe to TAUS YouTube channel here.

Question: What do ‘PPX’ and ‘PEX’ stand for (as abbreviations)?

Answer (Louise Faherty  and Tony O’Dowd): PEX stands for Post-Editing Automation. PEX allows you to take the output of an MT engine and dynamically alter that. When would you need to use PEX? Suppose there is a situation where your engine is repeating the same error over and over again. What you can do in such cases is write a PEX file (developed in the GENTRY programming language). This allows the engine to look for patterns in the output of the engine and to dynamically change that in the output.

For example, one of our French clients did not want to have a space preceding a colon mark in the output of their MT (because this was one of their typographical standards and repeated throughout the content). So we wrote a PEX rule that forced a stylistic change in the output of the engine. This enabled the client to reduce the number of Post-Edits substantially.

PPX stands for Preprocessor automation. You can use PPX files for to normalise or improve the training data. It is based on our GENTRY programming language which is available to all our clients for free.

In short then, PPX is for your training data, while PEX is for the actual raw output of your engine.

For more questions and answers, stay tuned for the next part of this post!