LibreOffice plugin to pipe whole Writer documents through Google Translate, that ought to keep most of the page formatting.

βŒˆβŒ‹ βŽ‡ branch:  PageTranslate

Hurried fixes for --merriamwebster and --synonyms.com
mario authored 4 days ago last checkin b0666af4c
πŸ“‚ META-INF Introduce crude help file for config options.β€Ήβ€Ί 467 days ago
πŸ“‚ dingonyms added related/subproject: dingonymsβ€Ήβ€Ί 205 days ago
πŸ“‚ help Add errors, update help treeβ€Ήβ€Ί 224 days ago
πŸ“‚ icons resized, lighter blue tonesβ€Ήβ€Ί 500 days ago
πŸ“‚ off Add more languages ('le continental submenus)β€Ήβ€Ί 229 days ago
πŸ“‚ pkg-desc move license text to pkg-desc/β€Ήβ€Ί 485 days ago
πŸ“‚ pythonpath Disable ToggleDropdown, update dependenciesβ€Ήβ€Ί 110 days ago
πŸ“„ Addons.xcu Disable ToggleDropdown, update dependenciesβ€Ήβ€Ί 110 days ago
πŸ“„ Makefile Move MessageBox() to unocompat (not actually used anymore, doesn't woβ€Ήβ€Ί 102 days ago
πŸ“„ NEWS Notes about ArgosTranslate, CLI alternativeβ€Ήβ€Ί 102 days ago
πŸ“„ OptionsDialog.xcu AOO compat. OptionsDialog XCU requires leaf name to equal handler ID.β€Ήβ€Ί 479 days ago
πŸ“„ OptionsDialog.xdl Add ArgosTranslate (offline/python-based backend), albeit it probablyβ€Ήβ€Ί 106 days ago
πŸ“„ OptionsSchema.xcs Introduce 🏴 button configuration to change default language/behaviourβ€Ήβ€Ί 128 days ago
πŸ“„ description.xml Add Systran backend (untested).β€Ήβ€Ί 109 days ago
πŸ“„ pagetranslate.py Move MessageBox() to unocompat (not actually used anymore, doesn't woβ€Ήβ€Ί 102 days ago

PageTranslate

OpenOffice/LibreOffice extension that translates whole documents, whilst retaining some formatting.

  • Iterates over paragraphs / text segments / and tables (only linebreaks kept)
  • Utilizes Google translate (or DeepL, Microsoft Translate, MyMemory, PONS, or Yandex/Qcri/Linguee).
  • And is therefore rather slow.
    (Writer freezes during process for long documents!)
  • Still allows for text-selection mode translation (from original extension).

Install

Download the package file, and import it in LibreOffice via Tools β†’ Extension Manager β†’ Add.

Location Features Link
Source repository Newest package, Linux-only package, no python dependencies pagetranslate.oxt
LibreOffice Extensions Older/stable versions, bundles python requests library https://extensions.libreoffice.org/extensions/pagetranslate
OpenOffice Extensions Full releases, AOO/LO/Windows compatible (bundles deep-translator+requests) https://extensions.openoffice.org/en/project/pagetranslate

For using the unbundled/development package, you should install Python dependencies on your system:
pip3 install -U requests translate deep-translator six

Usage

  • Hit the Tβ†’πŸ‡¬πŸ‡§ toolbar button to get entire document translated to English. Have some trust and patience for longer documents.
  • The second button 🏴 translates to the local/system language.
  • Or select a paragraph, to get a partial translation.
  • Alternatively pick a menu entry from Tools > PageTranslate.
  • The β†’System lang entry refers to your OpenOffice/Desktop setting.
  • Whereas β†’ Paragraph locale respects the text language as set in the Writer status bar.
  • A manual selection dialog is brought up by the From ➜ To πŸ—Ί menu entry. (Best used in MyMemory mode)
  • Configure options in Toolsβ†’PageTranslateβ†’Options...

Caveats

  • Generally you want to download the extension package from AOO. Those are usually full releases (2 to 20MB). The smaller package only makes sense for Linux/LibreOffice installations, because then you can automatically utilize local Python modules (pip install requests deep-translator).
  • The plugin is also likely to crash (won't affect Writer) when encountering complex documents (embedded drawings, Calc sheets, OLE elements, etc). It's only been tested with flow text and text tables.
  • Generally there's a log file in /tmp.
  • OpenOffice support is somewhat preliminary still (albeit it's mostly Python 2.7 fallbacks and XCU variations).
  • For direct content.xml translations, there's an inconvenient tool in off/. (Each office document .odt/.odc is a zip file. Extract the content.xml, apply the script, then repackage the content.xml into the .odc)

ToDo

  • Unify internal API lang=/source= handling, extract .skip() check, and outsource some of the common processing to main.
  • Integrate deep-translator backends (Yandex, QCRI, Pons, Linguee) and language mapping. Perhaps offer as alternative to T-P and internal variants.
  • Finalize DeepL web translation mode (for testing really). The API mode is untested, but likely faster because longer paragraphs can be translated at once. And reportedly it yields smoother/more natural translations. (No XML block translation yet.)
  • Unfortunately not sure yet how to bundle requests.
  • Nor how to craft an options/settings page.
  • Support for TextFrames
  • Figure out if getText() provides for some XML serialization, or how to traverse Paragraphs piece by piece (super slow mode: retain more formatting).
  • Does not yet support anything but Writer documents. (Structure is entirely distinct for Draw/Impress).
  • Not sure if the toolbar icons work at all, or if that's a bug in the Linux version of LibreOffice.