Transliterating Text in OpenOffice.org

Dmitri Popov

Productivity Sauce

Dec 19, 2008 GMT
Dmitri Popov

Here is a problem: when working with an OpenOffice.org document you need to insert a few lines of text in a language that uses a non-Latin alphabet, for example Cyrillic. Of course, you can enable the Cyrillic support on your system, but it's overkill if you only need to write just a few sentences every now and then. Instead, you can use the following OpenOffice.org Basic macro to convert a selected text fragment written in the Latin alphabet into its Cyrillic version; for example "martyshka" -> "мартышка", "ogurec" -> "oгурец", and so on (this process is known as transliteration):

Sub Translit()
Dim ThisDoc As Object
Dim SearchObj As Object, nTimes As Long
Dim Args1()

ThisDoc=ThisComponent

TextSnippet=ThisDoc.getCurrentController().getSelection().getByIndex(0).getString()
If TextSnippet="" then MsgBox ("Please select a text fragment to convert!", 48, "Attention!") : End

ThisDoc=StarDesktop.loadComponentFromURL("private:factory/swriter", "_blank", 0, Args1())
ThisDoc=ThisComponent
ThisText=ThisDoc.Text
TheCursor=ThisText.createTextCursor
TheCursor.String=TextSnippet
 
Latin=Array("YA", "YA", "YA", "EH", "CH", "ZH", "SH", "SJ",_
"Ya", "Yu", "Yo", "Eh", "Ch", "Zh", "Sh", "Sj", "A","B", "V", "G", "D", "E", "Z", "I", "J", "K", "L", "M",_
"N", "P", "R", "S", "T", "U", "F", "H", "C", "``", "Y", "''",_
"ya", "yu", "yo", "eh", "ch", "zh", "sh", "sj", "a","b", "v", "g", "d", "e", "z", "i", "j", "k", "l", "m",_
"n", "p", "r", "s", "t", "u", "f", "h", "c", "`", "y", "'", "|")
Cyrillic = Array("Я", "Ю", "Ё", "Э", "Ч", "Ж", "Ш", "Щ",_
"Я", "Ю", "Ё", "Э", "Ч", "Ж", "Ш", "Щ", "А", "Б", "В", "Г", "Д", "Е", "З", "И", "Й", "К", "Л", "М",_
"Н", "П", "Р", "С", "Т", "У", "Ф", "Х", "Ц", "Ъ", "Ы", "Ь",_
"я", "ю", "ё", "э", "ч", "ж", "ш",  "щ", "а", "б", "в", "г", "д", "е", "з", "и", "й", "к", "л", "м",_
"н", "п", "р", "с", "т", "у", "ф", "х", "ц", "ъ", "ы", "ь", "")

SearchObj=ThisDoc.createReplaceDescriptor
 For I = lBound(Latin()) to uBound(Latin())
   With SearchObj
    .SearchString=Latin(I)
    .ReplaceString=Cyrillic(I)
    .SearchWords=false
    .SearchCaseSensitive=true
   End With
   nTimes = ThisDoc.replaceAll(SearchObj)
  Next I
End Sub

The way the macro works is rather simple. First the macro obtains the currently selected text and creates a blank Writer document. It then goes through the selected text snippet and replaces Latin letters and letter combinations with their Cyrillic equivalents specified in the Latin and Cyrillic arrays. This macro converts text from Latin to Cyrillic, but you can easily modify it to work with other alphabets.

comments powered by Disqus
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

Learn More

News