Issue with inflections in glossary and memory
Tópico cartaz: nivaca
nivaca
nivaca
Colômbia
Local time: 23:41
Mar 23, 2015

The glossary and translation memory doesn't work well with inflected languages as Latin. For instance, if in the source I have the word "dicendum" (the gerundive of "dico" ["I say"]), and if I add it to the glossary as it appears, then other inflections of the word will not be recognised by the glossary: "dicitur", "dicamus", etc.

I would be quite useful if OmegaT allowed for certain ways of dealing with inflections in the glossary. One simple way might be the following: use or rege
... See more
The glossary and translation memory doesn't work well with inflected languages as Latin. For instance, if in the source I have the word "dicendum" (the gerundive of "dico" ["I say"]), and if I add it to the glossary as it appears, then other inflections of the word will not be recognised by the glossary: "dicitur", "dicamus", etc.

I would be quite useful if OmegaT allowed for certain ways of dealing with inflections in the glossary. One simple way might be the following: use or regex in glossary entries. E.g.: "dic[endum, atum]".
Is this possible as of today?
Collapse


 
Susan Welsh
Susan Welsh  Identity Verified
Estados Unidos
Local time: 23:41
russo para inglês
+ ...
It does Mar 23, 2015

The tokenizer function in OmegaT does that. Check the users' manual. How well it works for Latin I can't say, but it works for Russian and German.

Susan


 
nivaca
nivaca
Colômbia
Local time: 23:41
CRIADOR(A) DO TÓPICO
Not for Latin. Mar 24, 2015

But there is no tokenizer for Latin, I'm afraid.

 
Didier Briel
Didier Briel  Identity Verified
França
Local time: 05:41
inglês para francês
+ ...
It relies on the Hunspell dictionary Mar 24, 2015

nivaca wrote:

But there is no tokenizer for Latin, I'm afraid.

For languages not covered by Lucene, the tokenizer is provided by Hunspell (you have to install the Hunspell dictionary corresponding to the source language).

I tried and I couldn't get it to work. That might be because the Hunspell dictionary I installed doesn't contain the necessary information, or because it does accept some stemming, but not the one I tried.

I tried these two dictionaries:
http://rpmfind.net/linux/rpm2html/search.php?query=hunspell-la
http://extensions.openoffice.org/en/project/latin-spelling-and-hyphenation-dictionaries

You can find information on Hunspell stemming information here:
http://manpages.ubuntu.com/manpages/dapper/man4/hunspell.4.html

Didier


 
nivaca
nivaca
Colômbia
Local time: 23:41
CRIADOR(A) DO TÓPICO
Worked in Linux Mar 24, 2015

Didier,

Your recommendation of using Hunspell plus Latin dictionary worked fine in Linux. (Xubuntu 14.10). The glossary seems to work correctly now with inflections.

However, it doesn't work for me on Mac OS X. (I installed Hunspell with Brew, and used the very same dictionary.) I supposed there's still some fiddling to do in order to make it work.

Thanks.

Nicolas


 


Não há um moderador designado especificamente para este fórum.
Para reportar violações às regras do site ou para obter ajuda, favor contatar a equipe do site »


Issue with inflections in glossary and memory






Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »