That information technology has revolutionized the translator’s
working environment is a fact so obvious that it no longer
even bears mentioning. For the vast majority of translators
and their clients, computers have long replaced typewriters
and reams of paper. Modems and e-mail have replaced manila
envelopes, mass storage devices have replaced drawers full
of folders, CD-ROMs supplement dictionaries and encyclopedias,
and more recently the Internet and its resources more and
more often save us a trip to the library, while various online
communities have brought many translators out of their isolation.
But information technology not only offers
us tools. It is itself a field in which more and more translation
work is actually performed. This is certainly true of other
fields like marketing materials, packaging materials, advertising
copy, and manuals. But in the case of information technology
products, it is frequently the products themselves that need
to be translated. Whenever a program or process displays a
word or a phrase on the screen, this means potential work
for a translator or many translators.
In an article on a recent market study by Ovum Reports,
The Expanding World of Globalization (Language International
9.2 1997), Liza Loughman reports that the
total worldwide revenues for globalization-related services
will increase from USD 2.8 billion in 1997 to USD 6.2 billion
in 2000. Ovum forecasts that Japanese will account for 31%
of all localization service revenues in 2000, up from 18%
in 1994, with German falling from 18% in 1994 to 12% in 2000
— which in absolute figures still represents an increase
from USD 352 million in 1995 to USD 639 million in 2000 and
continues to put German in second place. Relatively smaller
slices, but a larger pie. A verylarge pie.
International
users
International users of computer software have come to expect
their software to “talk” to them in their own
language. This is not only a matter of convenience or of national
pride, but a matter of productivity. Users who understand
a product fully will be more skilled in handling it and avoid
mistakes. So they will prefer applications in their language
and adapted to their cultural environment.
The international market has become very important
for software manufacturers. Many U.S. manufacturers derive
a large percentage of their revenues and profits from international
sales. Competition is fierce, and those companies who are
best at anticipating the needs and preferences of their international
users will benefit most.
With the advent of the Web, it has become vital for
many manufacturers and distributors to maintain a world-wide
presence. For this presence to be effective, it is not enough
to simply be there. It is equally important to make sure the
message gets across, and once again, that means: translation.
Localization
and internationalization
Internationalization , also referred to as globalization,
is the process of designing or redesigning a product so that
it can be localized with minimal changes.
Localization is the process of adapting
a product, in our context a software program, to a specific
locale, i.e., to its language, standards and cultural norms
as well as to the needs and expectations of a specific target
market. A properly localized product also meets all the legal
requirements in force in the user’s region.
Users can interact with a successfully localized product
in their own language and in a setting that feels natural
to them. This means, for instance, that all messages are in
their own language, that they can input names, addresses,
dates, and other data in the same way they would write them
down on paper, that they can freely use their standard keyboard
characters wherever an entry can be made, and that any error
messages are comprehensible to them rather than representing
American geekspeak. Localization comprises the program itself
and any online or printed documentation.
Typically, localization is undertaken from English
into other languages, as most software continues to be developed
in the U.S., and even some of the software developed elsewhere
is originally developed in English.
Localization is performed by taking the source code
for a product developed for one country and modifying the
source code and product to satisfy the needs of other countries.
Often teams of developers in different countries are needed
to adapt products. If the original product is not built with
a view toward being localized, this can be a very expensive
and time-consuming process. There is the direct cost of multiple
development teams modifying the source code of the original
product. This process also produces multiple code bases, which
makes future development and maintenance more complex.
Plenty of books on localization and internationalization
have been written that are addressed to software engineers,
web site designers, and others. Unfortunately, the role of
the translator in the localization process has, to my knowledge,
not been given any specific attention.
Introductions
to localization and internationalization
International Consulting has an instructive
overview over localization and internationalization, including
many technical and business aspects.
ILE in Boulder, Colorado, has compiled an interesting
document
in Adobe Acrobat format that lists a lot of interesting
facts about everything that should be taken care of before
we translators see the text to be translated. “This
informative guide to product internationalization explores
the ways you can develop your software, on-line help systems,
documentation and audio that will make localization as straight-forward
as possible. This overview can serve ... as a primer for anyone
involved with localization.” (Note: This document is
protected and cannot be printed, only viewed.)
How localization is
usually organized
Translation companies are often chosen to
provide localization. They eliminate the need for companies
to maintain expensive in-house personnel. In addition, they
offer a broader range of services, technical expertise, and
flexibility than freelance translators and also eliminate
the need for the company to manage a large pool of freelancers.
Translators who work on localization projects are often
part of a large and distributed team. Its complex, multi-layer
organization poses new challenges to those independent translators
who usually work on neatly delimited projects which they have
sole responsibility for. Working on localization projects
is more akin to working in the translation department of a
larger company, only that the translator does not enjoy many
of the benefits of on-the-job training and rapid information
flow which characterizes an in-house setting. In actual reality,
translators are expected to have complete command of the tools
required for software localization, know the market and a lot of products, know
their own position in the process and workflow, and understand
the constraints involved in ever shorter production cycles
— all on their own.
It is for this same reason that most translators working
in localization do so through intermediary translation bureaus,
which are often specialized localization or “language
engineering” companies. The majority of software manufacturers
now outsource their localization activities and maintain only
a skeleton team for interfacing with their supplier. These
companies, which can be responsible for as many as 10 to 20
languages or even more, will pass the documents on to subsidiary
partners in the target countries or directly to freelance
translators, either in the U.S. or overseas. These will typically
be some in-house staff, but most of the actual translation
work will probably be done by freelance translators. It is
not even uncommon for corporations to outsource the localization
of their documentation to multiple translation companies.
Project scheduling
Ideally — marketing managers say —
the localized versions will be completed at the same time
the original version goes to market. Ideally — translators
say — localization begins when the software and documentation
are finalized. It is immediately apparent that these are irreconcilable
demands. In actual practice, therefore, it is attempted to
follow a stringent project schedule where localization is
only one step behind development.
Unfortunately, development does not follow a linear
schedule; features are developed concurrently with the documentation,
and the latter is written, rewritten, and re-rewritten all
the time. What translators get to translate is not infrequently
a not-quite-final version of the ultimate product. It would
seem that programs should be developed from specifications,
and documentation should be developed from, and document,
the completed program. We have to know how we translate “Press
Enter to continue” in the software itself before we
can tell the presumptive user what the program says when we
describe the various steps in the online help.
Furthermore, translation follows a cycle of comprehension.
As translators are virtually never given more than the briefest
of summaries of the product specifications (if that!), when
it is time to translate the software, we must do a lot of
guessing as to what function actually does what and consequently
what to call it. Often enough, though, the purpose of a function,
dialog box, or command will become apparent to the translator
only when he or she finally gets to the help file that explains
it. In this case the translator may have to go back and change
the term that was used in the first version of the software
translation; and it may not even be the same person doing
the software and the help, which complicates matters. Too
often the software is already “frozen”, that is,
ready for production with no additional changes possible by
the time enlightenment comes.
One of the attributes that characterize successful
and sought-after software translators is precisely the ability
to guess correctly about what a given software string or dialog
box or function actually does, so as to avoid having to loop
back wherever possible. It is here that experience plays an
important role. All software projects should include at least
some “old hands.” Software translations done only
by inexperienced translators usually give away the fact that
the translators did not have a clue. Their experienced colleagues
may not have had a clue either, but if they have a feel for
their work, often no one will notice.
It may irk those of us for whom quality exclusively
means the linguistic quality of the final product, but: there
are other aspects to quality, and they do tend to come to
the fore in software localization.To the client, sometimes a linguistically mediocre translation
(craftsperson translation) delivered on time is much more
valuable than a perfect one (artist translation) that is three
days late. Sometimes errors and inelegancies can be caught
further down the line, in the various editing steps that (should)
follow translation, but a file that is delayed three days
may hold up an entire project and cost the client thousands
of dollars in lost time-to-market.
Components of
a software localization project
When localizing software, we are dealing
with a number of different file types. The following subsections
will present a few selected aspects of frequently encountered
file types for the various Windows platforms. I apologize
to all prospective Mac localizers, but I have no idea how
these things work on a Mac. I am therefore a good example
of someone who should not undertake localization for the Macintosh
platform. If you do not know either and cannot find out, neither
should you...
Resource
files
In localization, the visible part of the software, the user
interface, is usually simply called software per
se. In properly developed software, the texts the user sees
are included in separate files, the so-called resource
files. They contain everything the user is likely to
see and that is not created when the program actually runs
(at runtime): menus, dialog boxes, error messages,
bitmaps, cursor shapes, and so on. Help files are resources,
too, but they are usually treated differently.
The following is an example of a very simple Windows
dialog box and its representation in a resource file (in a
Windows environment this is called an .RC file, where
.RC stands for resource compiler).
IDD_SELECT DIALOG DISCARDABLE 0, 0, 167,
106
STYLE DS_MODALFRAME | WS_POPUP | WS_VISIBLE | WS_CAPTION |
WS_SYSMENU
CAPTION " Select an object "
FONT 8, "MS Sans Serif"
BEGIN
DEFPUSHBUTTON " OK ",IDOK,108,8,50,14
PUSHBUTTON " Cancel ",IDCANCEL,108,24,50,14
LISTBOX IDC_TOOLBAR_NAMES,8,8,92,88,LBS_SORT |
LBS_NOINTEGRALHEIGHT | WS_VSCROLL | WS_TABSTOP
PUSHBUTTON " &Help... ",IDHELP,108,40,50,14
PUSHBUTTON " &Rename... ",IDD_RENAME,108,64,50,14
PUSHBUTTON " &Delete ",IDD_DELETE,108,80,50,14
END
You will recognize the various elements.
The texts marked in red here (in actual files they are not
marked in color) are the texts to translate. You must not
translate anything else, including the “MS Sans Serif”
which deceptively enough is also enclosed in quotes but is
the name of the font used in the dialog box. The “&”
precedes a letter (shortcut) that is underlined in the actual
box. The “...” must be preserved as it tells the
user that another dialog box follows when that button is clicked.
— Everything else is information that tells the compiler
what to put where in the box. Chances are that the boxes need
to be resized because the translation comes out longer than
the original. This is not something that is usually included
in the translation part of localization, although some translators
possess the requisite tools to do so and offer this as a value-added
service.
Error messages are more straightforward:
STRINGTABLE DISCARDABLE
BEGIN
IDS_WINEXEC_ERROR0 " The following
error occurred:\n\nSystem was out of memory, executable file
was corrupt, or relocations were invalid. "
IDS_WINEXEC_ERROR2 " The following
error occurred:\n\nFile was not found. "
IDS_WINEXEC_ERROR3 " The following
error occurred:\n\nPath was not found. "
IDS_WINEXEC_ERROR5 " The following
error occurred:\n\nAttempt was made to dynamically link to
a task, or there was a sharing or network-protection error.
"
IDS_WINEXEC_ERROR6 " The following
error occurred:\n\nLibrary required separate data segments
for each task. "
IDS_WINEXEC_ERROR8 " The following
error occurred:\n\nThere was insufficient memory to start
the application. "
IDS_WINEXEC_ERROR10 " The following
error occurred:\n\nWindows version was incorrect. "
END
The “\n" stands for a newline
character. i.e. this is where a new line will begin in the
actual text.
Help files The
source files for Windows help files are usually .RTF files.
Windows help compilers take this file and convert it to the
familiar hypertext formats you see when pressing F1 in a Windows
program.
The Hypertext organization (hypertext is
text in which you can click on certain hot spots that will
take you somewhere else, usually to a subtopic relevant to
what you want to know) is represented in .RTF files in the
form of single- and double-underlined texts, hidden text,
and footnotes.
The title must be translated. All
the normal text must be translated as well.
The double-underlined parts represent
hyperlinks. In the .RTF file they are followed by hidden
text that tells the compiler what page to go to when
this link is clicked. The double-underlined part must be translated.
The hidden text must remain untouched.
The single-underlined parts represent
links to a help file glossary. They, too, are followed by
hidden text that tells the compiler what
entry in the glossary to show in a small window when this
link is clicked. The single-underlined part must be translated.
The hidden text must remain untouched.
The colors guide users. They should
be retained for the translated text.
The help page starts with several footnotes
before the title. These footnotes must never be removed.
As can be seen at the bottom of the example, there are different
types of footnotes.
"#"
footnotesare never translated.
They represent hyperlink targets. If a different page wants
to link to this page, it would do so by placing “Example_Application_Welcome_Menu"
in hidden text behind a hyperlink. Any changes here destroy
the help file.
"$"
footnotesare titles as they
appear in the Help content list. These footnotes must be translated.
It is recommended that this text be the same as the help page
title.
"k"
footnotes are entries in the Help index list.
They must be translated, but with the same caveats that hold
for index entries in documents.
"+"
footnotesare internal compiler
information and must not be translated.
Some help compilers support additional footnote classes.
If you now remember that graphic elements are treated
like letters by the program with which you work on the .RTF
file (probably MS Word), i.e. they can be underlined and used
as hyperlinks, you know most of what you have to know to confidently
start translating help files without destroying them.
Readme
files
Readme files usually contain late-breaking news that did not
make it into the documentation, additional setup information,
or corrections and additions to the manual. Usually these
are plain text files. If you use your word processor to work
on these, you must remember not to save them in the word processor’s
format but as plain text, using the correct character set
(e.g. Text only or MS-DOS Text). Otherwise
you should try to break lines so the result looks pleasing.
Readme files are pretty much the only place where hard returns
at the end of a line and formatting text with spaces are ever
allowed!
Screen
shots and bitmaps
Some of the elements of a user interface may be present in
the form of image files in which the information is present
as pixels and cannot be edited in the word processor or editor.
These bitmaps will be recreated with an image-processing program,
and the translator’s job is to furnish the translations
to be included. In simpler cases the translator may be asked
to overwrite graphics files. This should be billed as a value-added
service.
Examples of bitmaps are the splash screens, the
colorful rectangular messages that often greet users when
a program is started.
Word processing
and DTP files
Documentation (manual) files are often presented to translators
in heavily formatted form, with many hallmarks of document
automation, with the expectation that translators overwrite
them; this of course presupposes that translators know how
to use their tools. You will find more details in the section
Writing tools and their limits.
Client-prepared
file formats
A number of mostly larger localization companies take the
resources and files to be translated and produce text files
from them. These text files are the files that the translators
get to work on.
These companies will send you, along with the text
to be translated, instructions about how to use their formats,
where you should write your translations, what elements you
must not change, and the meaning of various tags and hints
you may encounter in the files you are working on.
One advantage of this procedure is that translators
can be provided with one set of instructions for a variety
of different source file formats. A second advantage is that
you can translate files for operating system environments
other than your own. If the target system is a system that
few people have installed, this may be the only way to work
on the files to be translated.
Incidental
files
There are a variety of other file types: for packaging, warranty
cards, and other purposes; these are too diverse to be listed
here. They do not usually present any specific problems.
Basic references
Bilingual dictionaries
In many other fields, one of the most important and most frequently
consulted references are bilingual dictionaries, often in
paper form, sometimes on CD-ROM. Bilingual dictionaries always
have their limitations, but working in a field that does not
have these references makes you appreciate them more. Bilingual
dictionaries in this field are often partly obsolete as soon
as they appear. Not only do new words come up and old words
fall into disuse, but the way existing words are used will
also change rapidly. Remember what a Winchester drive is?
Web
sitesand computer magazines
The vast majority of original published material, digital
or otherwise, is written in English today. This is why localization
is so often a one-way street from English to other languages.
At the same time, we are often faced with the necessity of
coining the words we need ourselves, or making an executive
decision between several alternatives, none of which has yet
become established.
Carefully written and edited computer magazines in
the target language are a very useful source of computer terminology.
If you don’t live in the country of your target language,
you should still be able to read them on the Web. Supposedly,
computer magazines are a little less ephemeral than web sites,
but a web site may be more on top of late-breaking news and
is also easier to search.
However, the approach of jumping on the bandwagon of
preliminary journalese “translations” as they
are usually found in the first excited account of a new program,
process, or device and of sanctioning them by copying is,
in my opinion, a serious mistake. It is precisely in the computer
field that we can do something for our readers by choosing
our words conscientiously and prudently.
Those of us who are confronted with English and with
computers and computer-related texts in our own languages
every day might tend to think that our languages are just
inundated with English loan words. On the whole, however,
this impression is misleading. Of course there are differences
in degree between the various European languages (I cannot
speak about any others) but it is probably true everywhere
that modern sports, some technical and scientific fields,
computers, and computer-related topics, and certain aspects
of cultural — particularly those that address a more
youthful audience — tend to be full of “cool”
loan words. But if you read newspaper and magazine articles
that do not happen to concentrate on these topics (and a good
number of those that do), chances are that the proclaimed
“inundation” of our language with Anglicisms is
fortunately still more or less limited to what is reasonable.
So with these caveats, computer magazines and carefully
edited web sites (too many are, unfortunately, hastily thrown
together by people with not much respect for their readers)
in the target language are probably the best sources of computer
terminology.
General monolingual
computer-related glossaries
Fortunately, it is not usually a problem finding out what
a technical term or abbreviation means. There are too many
glossaries to list here individually, but here are some good
starting points for finding them:
The University of Wasa, Finland, has an impressive
collection of computer glossary links for Chinese, Croatian,
Dutch, English, Finnish, French, German, Greek, Hungarian,
Italian, Norwegian, Portuguese, Spanish, Swedish, Russian,
Vietnamese plus some multilingual ones:
Term-Online, Online Special Language Glossaries — Computing,
Internet.
The
Translator’s Home Companion also lists a number
of interesting starting points for terminology searches.
A by now famous source for finding out the meanings
of obscure computer-related terms is the
Jargon File, the electronic Version of The New Hacker’s
Dictionary.
A well-known and well-kept listing is BABEL:
Glossary of Computer Abbreviations and Acronyms.
Most of the time when we translate software and related
documentation, we are not concerned with the fine points of
computer terminology, as the software is a tool that is supposed
to help the user get some specific work done. So the translator’s
requisite expertise must be in the field in which the software
is supposed to do its job. All the computer expertise in the
world still leaves the translator clueless when a help file
contains pages upon pages of dense accounting jargon. Translators
must not fall into the trap of accepting any software-related
assignment just because they know a lot about computers. The
results can be disastrous.
Your computer
Your most important tool is also your most important reference:
your computer itself. I am going on the assumption that you
are working on the same platform that the software product
is for — which is the recommended procedure. In that
case it makes eminent sense to use the localized version of
your operating system in the target language. One of the most
important aspects of a good user interface is that it has
to be consistent. So it is always a good idea to have everything
that your translation is supposed to be consistent with
at your fingertips.
Industry-standard
glossaries
For the same reason, user interface consistency, it is important
that all references to the operating system and other industry
standard programs are correct and use the same terminology.
If a user, for instance, is directed to click on an icon in
the Control Panel of the operating system employed, but the
name of that icon is translated by a term that is different
from what the actual icon is called, the instruction becomes
useless, and the user is led astray.
For some manufacturers there are glossaries that contain
every menu item, every message, every dialog box entry that
appears in their products.
Microsoft has made the
glossaries for its products (operating systems and applications)
available. They can be downloaded with any Internet browser.
Novell also offers its
glossaries.
There is no similar resource for Unix, and it is also
unlikely that there will ever be a general one, since there
are too many different flavors of Unix, a fact that is compounded
by localization.
To the best of my knowledge, and judging from translators’
repeated desperate pleas, Apple has not made its glossaries
available.
Client glossaries
Client glossaries, if they exist, are usually straight term
lists, often culled from earlier projects. Contextual information
is rarely given.
Like so many things in a translator’s life, the
client glossary situation is often best described by the expression
“feast or famine”. Sometimes you get nothing at
all. But some clients will try to give you a printed glossary,
sometimes hundreds of pages long. Working with these is hopeless.
You cannot find what you need in a reasonable amount of time.
Insist that the glossary be made available in electronic format.
But even electronic glossaries are sometimes too large to
be useful or fractured into too many files.
The most useful client glossaries come in electronic
format, as tables that can be read, searched, sorted and otherwise
operated on with a standard word processor or spreadsheet
program. There should be columns for source and target terms,
domains, syntactic information where needed, and — very
importantly — where that term occurs in the project.
Tabular output of terminology management systems would seem
to be ideally suited for this purpose.
If clients have a problem maintaining proper glossaries,
why not convince them that they can save a lot of time in
the long run, and offer to do it for them? Although terminology
management tools are frequently discussed among translators,
most of us do not yet use any organized form of terminology
management. What better way of learning how to use them, or
not to use them, than in the context of a billable job?
Client style
guides
Most localization companies issue their own style guides for
the various languages they handle. Often these appear to be
written for non-native speakers, as they have a tendency to
belabor points that a native speaker knows without having
been told.
However, a well-designed style guide will help guide
the translator in making stylistic decisions, such as whether
to put commands in infinitive or imperative form, how to phrase
chapter headings so they are stylistically congruent, and
other things. Of course a style guide cannot substitute for
your translator’s common sense and writing skills, but
it may be a great help in tying the different parts of a project
together and smooth out the stylistic differences between
different translators.
Project glossaries
The minimum you will need to know to be able to translate
help files or manuals is what the software texts in the actual
product are. Normally they should be available to you in tabular
format; if you don’t have one, ask for one, maybe the
project manager forgot or did not know.
Sometimes even this minimum does not exist. In that
case you can (under Windows) at least put all target terms
into one file to search by concatenating them. (Hush —
this is best done in DOS: Open a DOS window, go to where the
.RC files are, type COPY *.RC ALL.DOC, and open ALL.DOC in
your word processor. Voilà.)
But seriously, how are you going to translate a manual
in which every other sentence literally quotes program messages
which you don’t know just how they are going to appear
in the target language? Surely your customer did not think
that you will arrive at the exact same wording as your colleague
who translated (or will translate) the user interface because
there is always only one correct translation? On second thought,
maybe...
Writing tools and
their limits
Many translators consider themselves wordsmiths
and definitely not layout tinkerers. In translating help files
and manuals these days, however, an understanding of word
processors that goes beyond mere survival level is usually
a must. The PC is not a typewriter! (There is even a book
by this name, by Robin Williams.) There should be, however,
limits to what should be expected of you “for free.”
Word processing
You will most probably be given files with already formatted
texts which you will be expected to overwrite, preserving
the original format to a tee. Normally there simply is no
time for someone to copy your foreign-language text-only translation
into these highly complex files, especially since this would
require target-language expertise. Make sure you understand
styles, hidden text, fields, tables, graphics, and other objects
that will crop up in your texts. If you are open to this way
of working, you will find that it has its rewards. You will
find that seeing near-final format all the time and having
the old and the new text in the same file and thus right in
front of you is a relief to the eyes. You can use search and
replace (if your target language is amenable to that, there
are a few tricks to doing this well.)
I do not believe that overwriting a localization help
or manual file in a word processor goes beyond what can be
expected from a translator knowing his tools today. Nor do
I, frankly, believe that this is worth a surcharge, but should
be figured into your general rate for localization.
You might, however, try to negotiate a special (fixed
or time-based) price for creating, generating, and checking
a foreign-language index. It is usually not enough to simply
translate the English entries (they are often hidden inside
the text and — caveat vendor! — usually not counted
by counting routines). You can check the translation of the
entries for conformity with general usage in the target language,
understandability, and consistency by generating the index
and going back and forth. This should not be left to the engineers
who don’t speak the target language, so you should offer
to do this. It is not that difficult to learn how to generate
index entries and indexes in most word processors.
DTP and engineering
tools
These are notthe translator’s responsibility.
Creating a layout, whether in a word processor or in
a DTP program
such as Framemaker or QuarkXPress, requires expert skills.
If you are an expert, go ahead and offer this value-added
service. But do charge extra for it.
Overwriting files in a DTP program is something that
we are asked to do more and more frequently. What our clients
often forget is that DTP programs are not made for writing,
they are made for layouting. (Re)creating text in a DTP program
can slow you down considerably, and these programs are also
expensive and take time to learn. If you want to invest and
offer this value-added service, do it by all means, but do
remember that the slower speed will reduce your earnings per
hour unless you consider this in your price.
Testing files is engineering work. I would go so far
as to suggest that checking existing document automation features
(index, table of contents) that you overwrote in your regular
word processor is something you should be able and willing
to do. (But remember that creating a proper index —
as opposed to simply translating an English one, which usually
yields disastrous results — is a lot of work and should
usually not be billed at a word rate.) When you overwrite
an HTML file, I think asking you to check the result in a
browser to see whether you inadvertently destroyed any HTML
tags is not asking too much. But running help compilers, resizing
dialog boxes, following links across files, or testing the
runtime behavior of files is not translation. If you offer
this, again, this is a special value-added service to be billed
for accordingly.
Computer-assisted translation
(CAT)
In software localization, translation companies
sometimes try to use engineering methods to cut costs, ensure
consistency, and assist the translator, in short: integrate
aspects of computer-assisted translation (CAT).
The most common strategies are the leveraging of target
language terms and phrases from existing translations and
the elimination of duplicates. How the result is presented
to translators depends on the tools used. In the best of cases,
the translator is given something like a table with the source
texts in one column, any material that might help understand
strings and keep them consistent in a second column, and the
translation is then entered in a third column.
Problems with
home-grown CAT
What we are faced with is, however, too often not all that
helpful. The tools used are often created by software engineers
who don’t seem to know or care enough about the mechanics
of language. I recently received a file from a TC that they
had been given by a well-known software manufacturer whose
name you would certainly be familiar with. This file incorporated
every conceivable sin in this department. It was a short file
and should not have taken longer than an hour to do. But the
way the file was prepared made it untranslatable.
For instance: The original sentences often consisted
of fixed and variable parts, as in the following example.
No file containing ", xxx, " will be
deleted.
The objects named ", xxx, " will be
deleted
The process had resulted in a file that contained the
following in alphabetical order
" will be deleted
No file containing "
The object named "
It is easy enough to see that this is a hopeless endeavor
in many languages. In German, for instance, will be deleted
comes out as wird nicht gelöscht and werden
gelöscht, respectively — these chunks of text
are not at all the same, and in fact one is negative and one
is positive! Imagine this happening with fifty or so phrases
where you can’t see which first (prefix) strings go
with what last (suffix) strings. Imagine further — this
was the case here — German words being stuck into the
German text almost arbitrarily from a list. In some cases
the suggestions were the opposite of what the original text
was trying to say.
What was meant as a time and money saver ended up delayed
and costing twice as much as if it had been done correctly
in the first place...
In cases like this one, it is important and in the
best interest of our clients to firmly reject these source
files. There is often no way to salvage them, and if there
is, often enough it takes so much time and guesswork that
we lose money on these files. The only professional solution
is to work with the client and to explain why this “preparation”
is counterproductive. Many of the problems can be eliminated
if translators know how to work in the original source files,
where (hopefully) strings and texts are given completely and
in context. To make your point convincingly, you should know
the underlying formats and be able to work in them.
CAT (including the professional tools) suffers from
one intrinsic shortcoming: it tends to assume that what is
the same in language A must be the same in language B, and
what was the same in one situation must be the same in another
situation. Not so, as you know. These tools, then depend a
lot on whether the project management knows its stuff.
Professional
CAT
I am always a bit afraid that techies use those tools to “translate”
a lot of text, throw everything the machine was unable to
find in a file, and send this residue to the translator, who
depending on the circumstances, may be stuck with a chunk
of text for which he does not know the precedent, or text
where he can’t go back and make necessary changes to
precedent (old text may be bad or contain errors or otherwise
need improvement).
The second problem is that no CAT system can output
better text than is input. All archiving systems — and
that is what CAT essentially is, it archives and classifies
preexisting translations — suffer from the weakness
that mediocrity and mistakes are perpetuated.
The danger is that control over what is translated
how passes from the translator to the client or translation
company in charge of managing projects, that is, people whose
expertise is not necessarily linguistic in nature.
I strongly feel that we as translators should be in
control of our tools; any actual savings may well be passed
on in the form of attractive prices, but we must be sure that
the existence of CAT tools is not simply used as an excuse
for putting pressure on our rates. The tools and the time
needed to learn how to use them are a substantial investment
that has to be recovered, and everything that you or your
clients will expect to get out of a CAT system will first
have to be put in.
CAT is here to stay, and most of us will have to deal
with it in an open and receptive manner to continue to operate
profitably. But the introduction of CAT will be a bone of
contention between clients, translation companies, and independent
translators in the next few years as its potential and limits
are not yet fully understood by all those involved in the
process. It would seem only fair for everyone to share in
the benefits. Certainly CAT can save translators from monotonous
repetition (software translation can be incredibly boring
at times. Press Return to continue!) but it should
be the translator who ultimately decides if what looks like
repetition is in fact repetition. Certainly CAT can help make
the terminology in software localization projects, especially
distributed projects, more consistent, but every now and then
someone has to cut through the wallowing dust and start from
scratch.
Some of these products are more and some are less friendly
to the translator who is actually supposed to feed them. This
is not the place to discuss the pros and cons of the various
CAT systems. I will restrict myself to presenting a few URLs
for some of the more popular ones.
A good translator’s ingenuity will
often be able to overcome the worst effects of poor preparation.
But there are situations — unfortunately too many of
them — in which we as translators simply have to stand
up and state that what is expected is simply not do-able.
We must remember that if we do not do this, then the result
may be disastrous for the manufacturer, who is, after all,
spending a large amount of money on having his product translated
and whose business plans may fail if overseas sales fall behind
expectations for any reason.
What can be particularly frustrating is to discover
an error (and there is hardly a project where you won’t)
only to find that you can’t convey it back up the development
chain where it would have to be fixed (and passed back down
to all the other languages and translators).
The golden rule here, as elsewhere, is: Communicate!
Just about anything can be fixed if everyone who should
know about it does know about it early enough. Of course no
stressed-out project manager will be able to answer a lot
of questions about things that are for you, the translator,
to know. But if we want to be experts in communication, we
need to communicate, and we should do so in a most professional
manner: briefly, to the point, and addressing the designated
contact or other person in charge. But we have to make it
very clear to our contacts that the problems we discover (and
who looks more closely at texts, formatting or even the program
itself than the translator?) are not simply going to go away
by waiting.
Conclusion
Software translation does not have to be
the low-paid drudgery where inexperienced translators fumble
in the dark. It is a field that has many rewards, not the
least of which is that you will come to understand your own
computer, its operating system and application programs much
better, enabling you to make better use of your tools. With
regard to the conditions under which you work, a certain degree
of perseverance is often needed to make translation managers,
software engineers, and others upstream aware of problems,
errors, and the translator’s specific needs. In localization
we are part of a team, maybe more so than in other fields,
and have to show team spirit. But we must make sure that our
voice is heard and that our personal and professional needs
are recognized by others. If we demonstrate the requisite
subject expertise, we will earn the respect of our team partners,
our customers, and our customers’ customers.
Finally, we should never forget our responsibility
to the users of the software we are localizing.