WP4: Translation Memory

Overview

As patents cover a wide range of technological areas, it is important that the PLuTO translation memory (TM) system is domain adaptable. As patent files are already classified under the international patent classification system, a key objective of this work package is to adapt this taxonomy to our TMs.

Additionaly, this package will focus on management of the TM in terms of preparing it for use in the PLuTO system. This involves pre-processing of the data for the database, alignments of multilingual segments and quality control.

Milestones

  • 4.1 Initial English and German TM data available (M6)
  • 4.2 Make French TM data available (M12)
  • 4.3 Make Spanish TM data available (M18)
  • 4.4 Make Russian TM data available (M24)
  • 4.5 Make Swedish TM data available (M30)
  • 4.6 Make Dutch TM data available (M36)

Deliverables

  • 4.1 TM components for 3 languages (M12)
  • 4.2 TM components for 5 languages (M24)
  • 4.3 TM components for all languages (M36)
  • 4.4 Final report on TM data resources (M36) | report