Top Banner
Bo Jin Sr. Solution Architect Fatih Karaoglu Sr. Solution Architect Kofax Transformation Modules Advanced Track & What’s new in KTM Technical Track
99

16.45-17.45 KTM Advanced Technical Track

Apr 14, 2015

Download

Documents

Marc Combes

Kofax Transformation Modules – Advanced Track & What’s new in KTM
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 16.45-17.45 KTM Advanced Technical Track

Bo Jin

Sr. Solution Architect

Fatih Karaoglu

Sr. Solution Architect

Kofax Transformation Modules –

Advanced Track & What’s new in

KTM

Technical Track

Page 2: 16.45-17.45 KTM Advanced Technical Track

Agenda

Clustering Utility

Benchmarking

Separation

Classification

Extraction

Project Merge Tool

Localisation

Thin Client Enhancements

Q&A

2

Technology Enhancements

Productivity

Enhancements

Design Time

Productivity

Enhancements

Users

Page 3: 16.45-17.45 KTM Advanced Technical Track

Clustering Utility

Technology Enhancements

Page 4: 16.45-17.45 KTM Advanced Technical Track

New Utility for Clustering Unknown Documents

What it does

Requirements

Step-by-step

Importing into KTM

4

Page 5: 16.45-17.45 KTM Advanced Technical Track

What does the Kofax Clustering Utility do?

When configuring KTM content classification, the customer needs

to provide samples for each class.

What KTM requires:

5

Page 6: 16.45-17.45 KTM Advanced Technical Track

What does the Kofax Clustering Utility do?

When configuring KTM content classification, the customer needs

to provide samples for each class.

What customers usually provide:

6

Page 7: 16.45-17.45 KTM Advanced Technical Track

What does the Kofax Clustering Utility do?

presorting a document set into clusters of similar documents

User labels some of these clusters

Utility learns from labeling and pre-sorts again

Several iterations of labeling and pre-sorting

Export of sorted documents as learn-set for KTM content

classification

7

Page 8: 16.45-17.45 KTM Advanced Technical Track

What does the Kofax Clustering Utility do?

new KTM project

Customer uses Utility to provide KPSG or partner with sorted

documents

KPSG or partner uses Utility to sort documents from customer

Understanding what are the biggest subsets of documents in a

customer’s monthly mailroom volume

enhancing a KTM project

Customer adds new classes to project and needs samples for

classification

8

Page 9: 16.45-17.45 KTM Advanced Technical Track

Requirements

Kofax Clustering Utility works with XDocuments

XDocuments must be created with KTM OCR Server tool

KTM (5.5 SP2) must be installed to use Clustering Utility.

9

Page 10: 16.45-17.45 KTM Advanced Technical Track

Using the KTM OCR Server reduces the KTM base volume count

Eval licenses supported

Hardware requirements same as for KC/KTM

Files to be clustered should be local for performance

Need write access to file location

Requirements

10

Page 11: 16.45-17.45 KTM Advanced Technical Track

Step by Step – KTM OCR Server

Configuring the KTM OCR Server:

Select path to unsorted images

Enable „Save XDoc files“ and

„Save text files“

Under OCR Settings, select

proper language

Leave rest at default

Running the KTM OCR Server:

Simply press the Start button

11

Page 12: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

1. Import

Point „Import directory“ to same directory of unsorted documents

For each document, an .xdc file and a .txt file must exist

Select „Start Discovery“.

Takes a while, ~0.5 sec per document

Converts XDocs into internal format

Identifies initial clusters

12

Page 13: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

2. Discovery

Label initial 3 clusters

You see the most representative document of each cluster

Provide a name for each cluster, will be used as class name in

KTM

13

Page 14: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

2. Discovery

14

Page 15: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

2. Discovery

You can stop discovery when 80-90% of the documents are

discovered or continue until all documents are discovered

At 80-90% the most common document types are often known,

remaining documents are likely in very small clusters

Click „Review“ to continue to next step

15

Page 16: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

3. Review

Sort by categories (labels)

Examine the categories for consistency

Confirm some documents if you want to cluster again

16

Page 17: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

3. Review

17

Page 18: 16.45-17.45 KTM Advanced Technical Track

Step by Step – Kofax Clustering Utility

4. Export

Select any directory for export

Sub directories will be created for each category/label

.txt files (and tif/xdoc for reference) will be exported, since only .txt

files are used to train KTM content classification later

18

Page 19: 16.45-17.45 KTM Advanced Technical Track

Importing into KTM

In Project Builder, point New Project dialog Content Classifier

settings to exported directory

Select „Discovered documents“ sub directory

19

Page 20: 16.45-17.45 KTM Advanced Technical Track

Importing into KTM

A class is created in Project Builder for each category

Training documents are imported

Select „Train“ in Project Builder main menu

Verify in Classification Benchmark (Result Matrix)

20

Page 21: 16.45-17.45 KTM Advanced Technical Track

Importing into KTM

21

Setting this up manually

and finding/organizing the

proper training documents

takes hours or days.

With the Kofax Clustering

Utility, this example took

20 minutes.

Page 22: 16.45-17.45 KTM Advanced Technical Track

Benchmarking

Productivity Enhancements – Design Time

Page 23: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Benchmarking

Separation Benchmarking

Classification Benchmarking

Extraction Benchmarking

23

Page 24: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Separation Benchmarking

Separation Benchmark

24

Document Separation Test

Page 25: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Separation Benchmarking

Separation Benchmark

25

Separation Benchmark

Golden Batch

Golden Files – Extraction Benchmarking

Page 26: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Separation Benchmarking

Separation Benchmark

How can a Golden Batch be created?

Kofax Capture (before Export Connector)

KTM Project Builder

26

Page 27: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Separation Benchmarking

Separation Benchmark

27

Page 28: 16.45-17.45 KTM Advanced Technical Track

Separation Benchmark – Quality?

KTM 5.5 – Separation Benchmarking

28

Correct Documents

Rejected Documents

Incorrect Documents

Incorrectly classified

Additional splits

Missed splits

But confidently

Document Review...?

The worst of all three categories

False Postive

Page 29: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Classification Benchmarking

Classification Benchmark

29

Page 30: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Classification Benchmarking

Classification Benchmark

30

Page 31: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Classification Benchmarking

Classification Benchmark

31

Page 32: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Classification Benchmarking

Classification Benchmark

32

Page 33: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Classification Benchmarking

Classification Benchmark

33

Page 34: 16.45-17.45 KTM Advanced Technical Track

Extraction Benchmark

KTM 5.5 – Extraction Benchmarking

Slide 34 34

Page 35: 16.45-17.45 KTM Advanced Technical Track

Extraction Benchmark

KTM 5.5- Extraction Benchmarking

EV = Extracted Value GFV = Golden File Value (perfect file)

EV = GFV Work

EV ≠ GFV Work

EV ≠ GFV False positives

EV = GFV Super

Project quality

Project design

Slide 35 35

Page 36: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Extraction Benchmarking

Extraction Benchmark - Comparison

36

Page 37: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Extraction Benchmarking

37

Extraction Benchmark - Enhancements

Selection List

Sorting

By Column Content

By Status

Open in Document Viewer

Re-arrange columns

Page 38: 16.45-17.45 KTM Advanced Technical Track

Project Merge Tool

Productivity Enhancements – Design Time

Page 39: 16.45-17.45 KTM Advanced Technical Track

Multiple Users – One Project

KTM 5.5 – Project Merge Tool

39

Page 40: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Project Master

40

Page 41: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Copy the Project Master for each aditional user

41

Page 42: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Project Master

42

Page 43: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Copy 1

43

Page 44: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Copy 2

44

Page 45: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Merge Copy 1

45

Page 46: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Source and Destination projects

46

Page 47: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Select Classes

47

Page 48: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Summary

48

Page 49: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Save changes to destination project (Project Master)

49

Page 50: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Merge Copy 2

50

Page 51: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Source and Destination projects

51

Page 52: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Select Classes

52

Page 53: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Summary

53

Page 54: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Save changes to destination project (Project Master)

54

Page 55: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Project Master after merging

55

Page 56: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Elements that can be merged...

56

Classes

Fields

Locators

Validation Rules

Script

Localization

Validation Forms

Page 57: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

57

Page 58: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Elements

58

Page 59: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

59

Page 60: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

60

Page 61: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

61

Page 62: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

62

Page 63: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

63

Page 64: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

64

Page 65: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Summary

65

Page 66: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

Save changes

66

Page 67: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Project Merge Tool

The merged project

67

Page 68: 16.45-17.45 KTM Advanced Technical Track

Localisation

Productivity Enhancements – Users

Page 69: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

KTM Languages

English

German

69

Page 70: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Additional KTM Languages

70

# Language Pack Language ID

1 Brazilian pt-BR

2 Chinese zh-CN

3 Czech cs

4 French fr

5 Italian it

6 Japanese ja

7 Polish pl

8 Russian ru

9 Spanish es

10 Swedish sv-SE

Page 71: 16.45-17.45 KTM Advanced Technical Track

Additional KTM Languages

Graphic User Interface

Project Builder and runtime modules

Component based messages

KTM Server

Documentation (runtime modules and Userguide.pdf)

1. Document Review 2. Correction 3. Validation 4. Verification

KTM 5.5 – Localisation

71

Page 72: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Project Settings - Localization

72

Page 73: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Project Settings - Localization

73

Page 74: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

.Net concept

Primary language

Secondary language

74

English en

English (United Kingdom) en-UK

English (United Stated) en-US

Page 75: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Fall back principle

75

Page 76: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Fall back principle

76

Localise

End

Primary – Secondary

language translation?

Yes

No

Primary language

translation?

Use default value for display name Use translation value for display name

Yes

No

Page 77: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

KTM GUI, Server and Active Language

77

Page 78: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

KTM GUI Language, Server and Active Language

78

The Project.ActiveLanguage overrides the Region and Language settings

Page 79: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Summary

KTM Graphic User Interface language

KTM Server language

Project language (Project.ActiveLanguage)

79

-

-

-

Page 80: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

What can be localised?

80

KTM Element Yes/No Note

Fields

Table Columns

Formatting Methods Component messages used

Validation Methods Regular Expression only

Component messages used

Validation Form Tab captions

Field label

Simple label

Button captions

DB button captions

Group captions

Script Resources

Page 81: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Fields

81

Page 82: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Tables

82

Page 83: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Project – Script Resources

83

Page 84: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Project – Script Resources

84

Project.Resources.GetString("Error_Example")

Page 85: 16.45-17.45 KTM Advanced Technical Track

KTM project folder structure

Default language in *.fpr file

Additional languages

KTM 5.5 – Localisation

85

Document Review

Default language

Localised languages

Page 86: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Localisation.xml

External editor

Language ID

Example: Field • Default value • Localised translation

86

Page 87: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

XML Update

87

Page 88: 16.45-17.45 KTM Advanced Technical Track

KTM 5.5 – Localisation

Project design language

88

Page 89: 16.45-17.45 KTM Advanced Technical Track

Thin Client Enhancements

Productivity Enhancements – Users

Page 90: 16.45-17.45 KTM Advanced Technical Track

New and Improved Functionality Inside KTM TC 5.5

Validation Form Layouts

Annotations

Additional Batch Editing Operations

User Settings

Advanced Login Capabilities

Combo-boxes With Descriptions

Combo-boxes Inside Tables

Other “Small” Things

KTM TC 5.5 Improvements

90

Page 91: 16.45-17.45 KTM Advanced Technical Track

Support Validation Form Layouts

Different font types and sizes

Mini-viewers

Custom buttons

Location of fields

Anchoring

Layout localization

KTM TC 5.5 Improvements

91

Page 92: 16.45-17.45 KTM Advanced Technical Track

Support Annotations

Display annotations created by KTM modules

Create new annotations inside Thin Client

Edit annotations

Delete annotations

Move annotations

Hide/Display annotations

KTM TC 5.5 Improvements

92

Page 93: 16.45-17.45 KTM Advanced Technical Track

Additional Batch Editing Operations

Delete pages

Move, merge, delete documents

Move, merge, delete, split, create folders

KTM TC 5.5 Improvements

93

Page 94: 16.45-17.45 KTM Advanced Technical Track

Preserve User Settings

User name at login screen

Batch Open dialog box: size, columns, sorting settings

Panels: size, expanded states

Zoom settings: fit width, fit height, custom zoom

Annotation settings: hide/display annotations

KTM TC 5.5 Improvements

94

Page 95: 16.45-17.45 KTM Advanced Technical Track

Advanced Login Capabilities

Domain login for linked users

Single sign-on support for Active Directory users

KTM TC 5.5 Improvements

95

Page 96: 16.45-17.45 KTM Advanced Technical Track

Combo-boxes Inside Tables, Items With Descriptions

Display descriptions, values or both

Support empty strings consistently for all combo-boxes

Paging control for over 100 items

Type-ahead filtering capabilities

New script events to initialize scripted combo-boxes

KTM TC 5.5 Improvements

96

Page 97: 16.45-17.45 KTM Advanced Technical Track

Other “Small” Things…

Batch loading performance improvements (project caching)

PDF support

Reject/Unreject documents – support scripting on the server

Allow to install Thin Client Server on top of previous version

Propagate user changes in config files to a new version

KTM TC 5.5 Improvements

97

Page 98: 16.45-17.45 KTM Advanced Technical Track

Q&A

Page 99: 16.45-17.45 KTM Advanced Technical Track

Fatih Karaoglu

Sr. Solution Architect

Phone: +41 41 799 82 36

Email: [email protected]

For further information, please contact:

Bo Jin

Sr. Solution Architect

Phone: +41 41 799 82 30

Email: [email protected]