Top Banner
With AlignFactory, you can save incredible amounts of time by automating the entire document alignment process to feed your computer-assisted translation tool. AlignFactory automatically scans your archives to locate source-target document pairs based on their file names. For the software to identify the languages in an alignment pair, the file names must contain a pre-set language marker. Once a valid file pair has been found, AlignFactory will align the files, creating either a LogiTerm XML, HTML or TMX bitext file, based on your preferences. AlignFactory can be synchro- nized with LogiTerm and configured to scan only folders that are linked with your LogiTerm modules. AlignFactory’s many configuration options let you help the software correctly identify file pairs for alignment, even if they have different file name structures. File matching criteria include: Language markers, Filtering by file extension, Exclusion strings – for removing files with names containing certain character strings, Ignore strings – for ignoring certain character strings in file names, Ignore characters after last marker, Match files in different folders, Allow one file name with no language marker. LogiTerm bitexts AlignFactory can create LogiTerm bitexts in XML or HTML format. If you use LogiTerm, we recommend creating XML bitexts, as they are more visually appealing and do not display segment language codes when viewed in a web browser. What’s more, they contain source document metadata and are slightly faster to index than HTML bitexts. HTML bitexts, however, can be opened with any application, with no display issues. They can also be easily indexed by any full text search engine. TMX files AlignFactory can also generate TMX files to import into any translation memory. TMX file creation options are as follows: n Create one TMX file for each pair, or merge into a single file, n Insert name of source document in each segment, n Automatically add attributes (project, client, domain, etc.) to segments. Alignment Editor AlignFactory features an alignment editor tool that lets you make changes to LogiTerm bitexts. It also lets you view and edit TMX files before importing them into a translation memory. AUTOMATED DOCUMENT ALIGNMENT SOFTWARE WITH WEB CRAWLER Main Interface AlignFactory – DOCUMENT ALIGNMENT ADVANTAGES n FULLY AUTOMATED ALIGNMENT PROCESS n AUTOMATIC WEBSITE DOWNLOADING AND ALIGNMENT n NO FILE PREPARATION NEEDED BEFORE ALIGNMENT n XML, HTML AND TMX ALIGNMENT FORMATS n ALIGNMENT PROJECT CREATION n SEGMENT FILTERING n COMPATIBLE WITH OVER 100 FILE FORMATS n ALIGNMENT EDITOR
2

AUTOMATED DOCUMENT ALIGNMENT SOFTWARE WITH …Web Crawler Web Crawler The Web Crawler tool can be used with the AlignFactory alignment engine to import an entire multilingual website

May 21, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: AUTOMATED DOCUMENT ALIGNMENT SOFTWARE WITH …Web Crawler Web Crawler The Web Crawler tool can be used with the AlignFactory alignment engine to import an entire multilingual website

With AlignFactory, you can save incredible amounts of time by automating the entire document alignment process to feed your computer-assisted translation tool.

AlignFactory automatically scans your archives to locate source-target document pairs based on their file names. For the software to identify the languages in an alignment pair, the file names must contain a pre-set language marker. Once a valid file pair has been found, AlignFactory will align the files, creating either a LogiTerm XML, HTML or TMX bitext file, based on your preferences. AlignFactory can be synchro-nized with LogiTerm and configured to scan only folders that are linked with your LogiTerm modules.

AlignFactory’s many configuration options let you help the software correctly identify file pairs for alignment, even if they have different file name structures. File matching criteria include:

■ Language markers, ■ Filtering by file extension, ■ Exclusion strings – for removing files with names containing certain character strings,

■ Ignore strings – for ignoring certain character strings in file names, ■ Ignore characters after last marker, ■ Match files in different folders, ■ Allow one file name with no language marker.

LogiTerm bitexts AlignFactory can create LogiTerm bitexts in XML or HTML format. If you use LogiTerm, we recommend creating XML bitexts, as they are more visually appealing and do not display segment language codes when viewed in a web browser. What’s more, they contain source document metadata and are slightly faster to index than HTML bitexts.

HTML bitexts, however, can be opened with any application, with no display issues. They can also be easily indexed by any full text search engine.

TMX files AlignFactory can also generate TMX files to import into any translation memory. TMX file creation options are as follows:n Create one TMX file for each pair, or merge

into a single file,n Insert name of source document in each

segment,n Automatically add attributes (project, client,

domain, etc.) to segments.

Alignment Editor AlignFactory features an alignment editor tool that lets you make changes to LogiTerm bitexts. It also lets you view and edit TMX files before importing them into a translation memory.

AUTOMATED DOCUMENT ALIGNMENT SOFTWARE WITH WEB CRAWLER

Main Interface

AlignFactory – DOCUMENT ALIGNMENT

ADVANTAGESn FULLY AUTOMATED ALIGNMENT PROCESSn AUTOMATIC WEBSITE DOWNLOADING

AND ALIGNMENTn NO FILE PREPARATION NEEDED BEFORE

ALIGNMENTn XML, HTML AND TMX ALIGNMENT FORMATSn ALIGNMENT PROJECT CREATIONn SEGMENT FILTERINGn COMPATIBLE WITH OVER 100 FILE FORMATSn ALIGNMENT EDITOR

Page 2: AUTOMATED DOCUMENT ALIGNMENT SOFTWARE WITH …Web Crawler Web Crawler The Web Crawler tool can be used with the AlignFactory alignment engine to import an entire multilingual website

A product from Terminotix Inc.2053 Jeanne-d’Arc Avenue, Suite 401, Montréal, Québec, Canada H1W 3Z4T. +1 514 989-9465 I [email protected] I terminotix.com

Follow us on

Terminotix also offers the following products

TECHNICAL REQUIREMENTS ■ 1 GHz processor ■ 512 MB RAM ■ 50 MB disk space ■ Microsoft Windows Vista / 2008 / 2008 R2 / 7 / 2012 / 2012 R2 / 8 / 10 (32-bit and 64-bit)

■ Microsoft .NET Framework version 4.0

Segment filtering AlignFactory features over 18 filtering options that let you get rid of unwanted segments in your alignments automatically. Filters include:

■ Reject if both sides are the same, ■ Reject if segment contains no letters, ■ Reject duplicate segments, ■ Reject if too few words in segment, ■ Reject if one side is significantly longer, ■ Reject if too many sentences in segment.

Alignment Editor

Web Crawler

Web Crawler The Web Crawler tool can be used with the AlignFactory alignment engine to import an entire multilingual website into your translation memory. The Web Crawler automatically downloads pages and files from your chosen website. Once the download is complete, simply create an alignment project to automatically align all the downloaded pages and files. Then, all that’s left to do is import the alignments into a computer-assisted translation tool.

The Web Crawler includes filters to help you select the types of pages and files to download. These include the following domain filters:

■ Ignore top-level domain (TLD), ■ Allow two-letter domain, ■ Do not truncate URL prefix.

You can also filter downloaded files by extension or file name character string. When the download is complete, AlignFactory will generate a project report containing details of the downloaded files and any downloading errors.