Moses Use Case: KantanMT.com

Moses Core MT use case KantanMTJanuary 2015 marks the last month of the Moses Core project. The project started three years ago in 2012, as a collaborative effort by its members to improve translation processes and to create a competitive translation environment. Over those three years, the translation and MT landscape has changed significantly. This change and the project’s success is in no small part due to the hard work and diligence of the Moses Core project coordinator; TAUS  and with TAUS’s kind permission, KantanMT is republishing the MT use case for the KantanMT Community.

COMPANY NAME

KantanMT.com is a registered trademark of Xcelerator Machine Translations Ltd.

TIME IN MT BUSINESS

The platform was launched commercially in Q4 2013, however, we have been rigorously testing KantanMT.com in academic and commercial settings since 2012. In the beginning, the product was offered as a free trial to the KantanMT Community, and their feedback was instrumental in shaping and improving the platform to what it is today.

MOSES EXPERIENCE

The Moses technology has improved immensely over the past 12-18 months. Developer documentation and support materials, while initially very basic, have matured into a more structured, comprehensive and helpful resource. Additionally, the management of software distributions has made it easier to work with, understand and deploy. These are key elements in maintaining and supporting any open-source technology and have made Moses a key technology for the localization industry.

MosesCore

WHY MOSES?

The rise of the global economy and the driving demand for multilingual translation created a gap in the market for a sustainable translation method that could automatically scale to accommodate fluctuating translation needs. The KantanMT Development team was able to utilize the open source Moses decoder to develop a cloud-based Statistical Machine Translation (SMT) platform, where clients could build and manage their own customized MT engines without compromizing on the ownership of their data. The flexibility, scalability and security of the Moses toolkit made this possible.

The Moses toolkit offers the most flexibility in implementing an SMT solution for commercial purposes, as it allows the system’s training and decoding process to be modified. This has enabled the KantanMT team to create a high-value product that is dynamic and commercially relevant.

To ensure the product could scale and adapt to user needs the KantanMT team needed a decoder that could be built and managed on the cloud. The Moses system enabled this functionality.

Parallel language data is required to train an SMT engine. This data is an important resource for companies, and current generic SMT engines do not guarantee the security or safeguard the ownership of these assets. In using the Moses decoder, the KantanMT team created a product that could ensure its clients’ data was kept private, and not repurposed or reused in anyway.

Many global companies have large repositories of bilingual data, however, they often do not wish to deploy and maintain their own version of the Moses decoder. The KantanMT Development team was able to develop the sophisticated Moses SMT technology into a package that could be easily accessible to companies wishing to translate their content, and over time achieve localization cost savings.

MT STAFF

The current machine translation development team consists of four people, who maintain the platform and build machine translation engines for clients. Due to significant growth in the company over the past year, KantanMT.com will be hiring more staff over the course of the next few months to build engines for clients.

MT SYSTEM INFRASTRUCTURE

Insource or Outsource Moses/Implementation

Based on research, the demands of the language services industry and enterprise machine translation buyers, KantanMT has implemented and customized the Moses decoder in house to create a robust and commercially viable machine translation product that can scale and adapt to our clients’ needs. The original/base KantanAnalytics™ technology was co-developed with the CNGL Centre for Global Intelligent Content, an academic-industry research Centre based in Dublin City University, Ireland. However, all other KantanMT.com technologies have been developed in house by an in house expert development team.

Number of Engines

As of January 2015, the total number of MT engines built on KantanMT.com by the KantanMT community is 6,777 engines.

Volumes

As of January 2015, the total number of training words uploaded to the platform by the KantanMT Community has surpassed 50 billion, and the number of translated words on the platform is now more than 600 million.

USE SCENARIO

KantanMT preferred MT supplier bmmt
KantanMT.com Preferred MT Supplier

bmmt GmbH is a German language service provider with a strong focus on machine translation. It needed a Machine Translation provider, which would give the bmmt team full control of their Machine Translation training data and MT engine customization process at a low investment point. They also required a system which could correctly handle format-specific tagging and transparent transfer of mark-up information.

In early 2013, bmmt joined the KantanMT Community and began testing different customization processes using client specific training data. The team initially experienced minor problems with their SDLXLIFF files. However, the KantanMT development team were able to quickly solve this problem by restructuring some of its tokenizers.

The company began deploying production engines in mid-2013. These were showing particularly high Quality Evaluation (QE) scores due to the quality of their training data and resulted in a considerable increase in translation productivity. bmmt MT technicians found that domain specificity is a better basis for predictable output than sheer input size.

bmmt is currently using approximately 20 KantanMT engines in production across technical and automotive domains. These production ready engines are experiencing high quality metric scores for each language combination.

MARKET POSITIONING

KantanMT.com is one of the market leaders of cloud-based machine translation services. It provides cloud-based SMT services to major global enterprises and software companies wishing to translate large volumes of data. It works directly with companies to develop and implement a long term machine translation strategy, or it works with a select number of language service providers (preferred MT supplier partner program) to supply MT services to large enterprises.

VIEWS ON CURRENT STATE OF MT

Machine translation is now much more widely accepted in the industry, than it was just a few years ago. Since KantanMT.com entered the market in its testing phase in 2012, we have seen an enormous change in the attitudes and perception of MT in the language community. Access to technology such as smart-phones and tablets in non-English speaking nations has driven the global marketplace, and this in turn has increased the need for on-demand translation services – driving demand for MT services. The MosesCore Project has facilitated this demand with an open source solution that made it possible for smaller companies, and startups like us to compete against bigger MT providers, to solve the problem of language.

“The KantanMT platform sets a new industry benchmark in terms of analytics and development tools used to build and measure the quality of Statistical MT Engines. The KantanMT expert development team has introduced some of the industry’s most exciting and valuable technologies built on the Moses decoder, which are helping language and enterprise clients to translate more efficiently and reduce costs.” KantanMT.com founder and Chief Architect, Tony O’Dowd.

For more information on the Moses Core project or to access the original article, please contact TAUS (moses@taus.net) or to find out more about KantanMT.com contact Louise (info@kantanmt.com).

 

 

KantanMT @tekom Trade Fair and tcworld 2014

The KantanMT team are excited to be exhibiting for the first time at the tekom Trade Fair and tcword conference. This year, the event has found a new home at the International Congress Centre (ICS) Messe Stuttgart. This event, which is the largest of its kind is the biggest market place for technical communication in the world.

Not only will the KantanMT flag be flying high at the largest global TC event. But, KantanMT will also be taking part in sessions, tool presentations and offering personalized demos throughout the conference week. KantanMT are also offering its members a complimentary ticket to the tekom Fair with their registration.

KantanMT tekom trade fair, tcworld
Email us if you still need to get your free ticket (louisei@kantanmt.com)

Session: How does your machine Translation system measure up?

Tony O’Dowd, Founder and Chief Architect will be giving a presentation on evaluating machine translation. The presentation; ‘How does your machine Translation system measure up?’ is for localization professionals and will cover some of the most common yet critical issues for users of machine translation:

  • Measuring performance of Statistical MT
  • Recent advances in MT and data visualization techniques
  • Tracking MT efficiency in the translation process

Find it

What: Session

Where: Room C7.1OG

When: Wednesday 12th November @16:00 – 16:45

 

Joint Tool Presentation – Machine Translation for Translation Buyers: What is available and what is expected!

On the following day, KantanMT will be taking part in a joint tool presentation with German Language Service Provider (LSP) bmmt. Tony O’Dowd and Maxim Khalilov from bmmt will discussmachine translation for translation buyers: what is available and what is expected’. In this presentation, Tony and Maxim will give an overview of the current post-edited MT landscape and discuss with examples the formula for successful MT adoption, as well as what tools are available for global translation buyers. The full tool presentation program is available online on the tekom website.

Find it

What: Tool Presentation

Where: Room C10.1

When: Thursday 13th November @ 11:15 – 12:00

KantanMT_tcw-conference2014_Exhibitor

Personalized Platform Demonstrations

At the KantanMT exhibition booth, the KantanMT team will be giving personalized platform demonstrations that provide an ‘under the bonnet’ look at the cloud-based platform. The booth will be located in Hall C2 at booth A10, right next to bmmt; German LSP and KantanMT preferred partner.

Find it:

What: Personalized Platform Demonstrations

Where: KantanMT exhibition booth Hall C2, booth A10

When: Tuesday 11th – Thursday 13th November

 

Get the most out of the tekom/tcword conference – meet the teams

Large conferences and events can often be overwhelming and it’s easy to lose track of time and get wrapped up in the buzz and excitement of the event. To make sure you get the most from the conference, keep organized and make an appointment to speak with a member of the KantanMT or bmmt team.

KantanMT team – contact Louise Irwin (louisei@kantanmt.com)

Bmmt team – contact Peggy Lindner (peggy.lindner@bmmt.eu)

See you in Stuttgart!