Controlled English Rules for Translation

EC Translation publishes a set of Controlled English Rules. These simple English rules ease the translation process and leverage the translation memories’ usability. Once you practice first and use these rules in your projects, your translation memories will start to provide better hits. The controlled English rules ensure a standard in writing style and word choice. And as you author the original texts or write the translation with these rules, the standardized phrases propagate in your translation memories.

The most pertinent rules for a Translation-Oriented Controlled Language have been selected from the eight rule sets under analysis and are presented here:

This practice will sure leverage your translation memory database further.



OmegaT Version 2.6.3 Released

OmegaT is a free and open source multiplatform Computer Assisted Translation tool for language service providers and translators. It features fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.

And in March 9, 2013 OmegaT latest version 2.6.3 was released.

OmegaT 2.6.3 contains 27 enhancements and bug fixes. It comes with a new updated English manual.

The biggest improvement is the handling of team projects.

Using a standard Subversion or Git server, an unlimited number of translators can collaborate on the same project, sharing translations and glossary entries.

The projects can be used both online and offline. When getting online again, changes done offline are synchronised.

Read-only access to a project is also possible.

Microsoft Translator (2 million characters per month are free of charge) is now supported. It requires setting identifiers, which can be obtained from the Windows Azure Marketplace.

The display of the match pane is now configurable, and a “diff” display is available, to show new and deleted text in matches. In all panes, the cursor is now available, allowing selecting and copying text using the keyboard.

There are now markers for whitespace (space, tab, newline) and bidi-characters in the View menu. Tag colouring now works for RTL languages/orientation. When translating a Word (.docx) document and the target language is RTL, a number of changes are done to the target document:

– Paragraphs, sections and tables are set to bidi.
– Runs (text elements) are set to RTL.

In addition, all spaces are now preserved in Word documents.

Validating tag now supports partial verification, where only “severe” tag issues are reported.

It is possible to apply penalties to external TMXs, and to load a TMX as an “alternative source”, the source segment displaying both languages. Gzipped (.tmx.gz) TMXs are now supported.

In the list of files of the project, the encoding and the filter used are now displayed.

Remove tags is now a project property feature, instead of being a global option

The PO filter now supports plural forms, and it is possible to ignore the headers in the Editor.

All the changes and bug fixes are detailed in changes.txt.

Compared with the latest version 2.6.2, there are 2 enhancements: OmegaT can now import the result of external scripts and parsing external command arguments is more flexible.

For Git team projects, an issue was fixed with line ending on non-Windows machines.

You can download the standard version and latest version here:

And you can find more recent news about the CAT software and translation tools here:

News source: Didier Briel at OmegaT Yahoo Group, in 9.3.2013

The Lowest Prices for Turkish-English Translation has just announced its new automatic discount system… This new translation workflow automation system enables automatic detection of the permanent customers and their orders’ text similarities, in order to offer further discounts.

You can read the underlying innovations and a case study here in Turkish:

If you order more than one translation to you immediately get the lowest translation prices now.

Announcement on Free memoQ Webinars for October 2012

Kilgray MemoQ Webinar dates for October are available on the Kilgray website. Check them out!

Kilgray’s webinars are always free. Make sure that you do not miss any of these presentations.

If you missed previous presentations or just want to watch them again, check out the recorded webinars page on the Kilgray website.

Details are at Sip Sak Ceviri LSP blog:

OmegaT 2.5.5_03 and 2.6.1 released

OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.

And in September 26, OmegaT standard version 2.5.5 update 3 and latest version 2.6.1 were released.

OmegaT 2.5.5 update 3 has a new file filter for Visio (*.vxd) files and a small improvement in the Linux Kaptain script.

In the TXML filter (the native OmegaT one) the content of revisions could appear. This has been corrected.

The German localisation has been updated to 2.5.5.

In addition to the elements above, OmegaT 2.6.1 has a number of improvements.

In the project file list, the encoding and the filter used are now listed.

The PO filter was improved. It is now possible to hide headers, and OmegaT supports plural forms.

In the HTML filter, there is now an option to compress whitespace and remove comments in the target document.

To allow using OmegaT on small screens, the Project Properties dialog can now be resized.

It is possible to load a TMX as an “alternative source”, allowing displaying more than one source language in the Editor for the current segment.

You can download the new version following the directions from
for standard and latest versions.

News source: Didier Briel at OmegaT Yahoo Group, in 26.09.2012

OmegaT 2.5.5_02 and 2.6.0_04 Released

OmegaT standard version 2.5.5 update 2 and latest version 2.6.0 update 4 were released.

OmegaT is a free Computer Aided Translation (CAT) application. It runs on multiple operating systems, including Windows, Mac OS X and Linux.

OmegaT is intended for professional translators. Its features include customisable segmentation using regular expressions, translation memory with fuzzy matching and match propagation, glossary matching, dictionary matching, translation memory and reference material searching, and inline spell-checking.

OmegaT shares many features with mainstream CAT tools. These include creating, importing and exporting translation memories, fuzzy matching from translation memories, glossary look-up, and reference and concordance searching.

OmegaT also has additional features that are not always available in other CAT tools. These include:

OmegaT can translate multiple files in multiple file formats simultaneously, and consult multiple translation memories,[6] glossaries and dictionaries (limited only by available computer memory).

With regard to supported file types, OmegaT allows the user to customise file extensions and file encodings. For a number of document types, the user can choose selectively which elements must be translated (e.g. in Writer files, choose whether to include bookmarks; in Microsoft Office 2007/2010 files, choose whether to translate footnotes; or in HTML, choose whether to translate ALT text for images). The user can also choose how non-standard elements in third-party translation memories should be handled.

OmegaT’s segmentation rules are based on regular expressions. Segmentation can be configured based on language or based on file format, and successive segmentation rules inherit values from each other.

In the edit window, the user can jump directly to the next untranslated segment, or go forward or backwards in history. Users can use undo and redo, copy and paste, and switch between uppercase and lowercase in the same way as one would in an advanced text editor. The user can choose to see the source text of segments that have already been translated. The edit pane also has inline spell-checking using Hunspell dictionaries, and interactive spell-checking is done using the mouse.

Users can insert fuzzy matches using a keyboard shortcut or using the mouse. OmegaT shows the degree of similarity in fuzzy matches using colours. OmegaT can also display the date, time and the name of the user who translated any given segment. Glossary matches can be inserted using the mouse. The user can choose to have the source text copied into the target text field, or to have the highest fuzzy match automatically inserted.

In the search window, the user can choose to search the current files’ source text, target text, other translation memories, and reference files. Searches can be case sensitive, and regular expressions can also be used. Double-clicking a search result takes the user directly to that segment in the edit window.
After translation, OmegaT can perform tag validation to ensure that there are no accidental tag errors.

OmegaT can calculate statistics for the project files and translation memories before the project starts, or during the translation to show the progress of the translation job.

OmegaT can get machine translations from Apertium, Belazar and Google Translate, and display it in a separate window.

The various windows in OmegaT’s user interface can be moved around, maximised, tiled, tabbed and minimised. When OmegaT starts, a short tutorial called “Instant Start” is displayed.

You can download the new version following the directions from
for standard and latest versions.

Sources of Freelance Translation Work

The worst thing you can do as a freelancer working with translation agencies is to overcommit yourself.

Your most important personal asset is your reliability.


If you choose freelance translation, you should consider yourself a one-person translation company. Your main concern will be where to find work. The need for freelance translation is greater than anyone can estimate, and is clearly growing at a rapid rate. Worldwide, translation is a multibillion-dollar industry. But finding translation work on your own is easier said than done. The main problem is that translation is hardly ever a steady, ongoing function of any particular work source, such as an embassy, a company, a government agency, or even a publisher. None of those needs translation every day of the year. Each of them may need a great deal of translation all at once (more than any one person can handle within the given timeframe), and then none for a long time. And, if any one of them needs translation on an ongoing basis, chances are a decision will be made to hire an in-house translator rather than farm out the work.

The fact remains, however, that a well-rounded freelancer can earn well over $50,000 a year, and, in the case of highly-specialized technical translators in major languages like Spanish, German. Japanese or Russian, even $100,000 or more. The secret to all of this is establishing for yourself a good clientele. There are two ways of doing this. The first, and by far the hardest, is finding your own clients and working with them directly. You may want to contact embassies, law firms, publishers, government agencies and so on, and solicit work directly from them. If you are fortunate enough to find some good steady clients on your own, you will be doing quite well. But the problem often lies in the word “steady” What seems to be a steady client today may not be so steady tomorrow.

This brings us to the second, and by far the safer option, which is a translation agency. There are hundreds of them in the United States, and they handle huge amounts of translation business every year. Below, we will discuss translation agencies, as well as direct sources of translation available to the freelancer.

Translation Companies

Translation companies, also known as translation agencies, or translation bureaus, are for the most part privately owned commercial establishments ranging in size from one or two employees to ten or more, but hardly ever more than ten. Some are divisions of larger companies, offering translation as a secondary function. Some specialize in one language only, such as Spanish. German, or Japanese. Most offer several languages, and quite a few bill themselves as offering “all languages”. This last type is somewhat pretentious, since there are more languages in the world than any one person can identify. But what they really mean is that they will make the effort to find a translator in almost any language they may be called upon to translate.

As a general rule, translation agencies employ relatively few in-house translators, since the How of work in any given language is usually uneven. Instead, they rely on the services of a network of hundreds of freelancers w ho can handle a great variety of subjects. Those freelancers are located all over the United States and even abroad. The ones who are most reliable and professional get the major share of the work, and some of them earn the above-quoted figures.
As a freelancer, you need to cultivate at least one such agency, preferably two or three. The problem in working with only one is that, with few exceptions, there may not be a steady flow of work coming out of any given agency in any given language, in subjects you are equipped to handle. Two or three will give you better coverage, and assure a better flow. On the other hand, you may find yourself in a situation where all three ask you to do something at the same time, and you may not be able to do it. You need to establish an understanding with your agencies that would make an allowance for such a scenario, so that you don’t spoil your relationship with any one of them.

The worst thing you can do as a freelancer working with translation agencies is to overcommit yourself. Your most important personal asset is your reliability. Once you fail to meet deadlines (keep in mind – the agency stands to lose a client if deadlines are not met), your reliability becomes questionable, and if you do it once too often, you may soon find out that those phone calls from the agency offering you work assignments stop coming.


Reference: “The Translator’s Handbook”, Morry Sofer, Chapter 11, pages 123-124

Turkish Segmentation Rules for Better TM Matching in OmegaT

OmegaT’de Daha İyi ÇB Eşleşmesi İçin Türkçe Dilimleme Kuralları

Kesme/İstisna seçili olanlar (Exception selection is checked):

Örüntü Öncesi sütunu (‘pattern before’ column):
Bu nedenle,
Ek olarak,
Bu çalışmada,
Bu makalede,

Örüntü Sonrası sütunu (‘pattern after’ column):

Kesme/İstisna kutusu boş olanlar (exception selection is empty):

Örüntü Öncesi sütunu (‘pattern before’ column):

Örüntü Sonrası sütunu (‘pattern after’ column):

Here, [0-9]:[0-9] is used for not breaking the time data of 12:21 PM and such. [1-9]\. rulse for numbered items. [\.\?\!][‘”] rules is used to break at the quotation marks at the end of a sentence. The “Bu nedenle,” rule, is to break at common conjunctions at the begining of sentences to increase TM match possibility. You should add similar frequent conjunctions (“However,”, “In addition to this,”, “In this study,”, … etc.) for English also.

Translation Memory of Dates for the Years 2010-2020

We have many defacto formats to write dates in every language; and Turkish is no exception. But translators in this era need standards to keep up the breathless pace. Any non-standard term or phrase cause a TM hit miss, hence the time loss.

The following translation memory in TMX format includes English translations of approximately 14000 pieces of dates, written in various Turkish date formats. You can ensure correct standard translation of dates by importing this into your CAT software. For example, all of these dates of 21.12.2012, 21/12/2012, 21 Aralık 2012 have single corresponding translation of December 21, 2012 in this TMX.

You can download this Turkish-English translation memory here: tren-tarihler-2010-2019.rar

ps: This TMX has many-to-one relation, so it’s not suitable to use in other direction. For, English-to-Turkish version please check the website.

Informatics Terminology Database (English-Turkish)

Turkish Language Service Provider EC Tercume has just relased a new terminology database in informatics domain. This database in TMX format can be downloaded here:

It’s stated that it can be used bidirectionally, after importing it into your own Computer Aided Translation (CAT) software (i.e. Deja Vu X2, Trados, Memq, OmegaT, etc. …).

These computer terms are from the informatics glossary prepared by Turkish Informatics Association.