Confidential and Proprietary to India Post and Infosys Limited 1 DEPARTMENT OF POSTS MINISTRY OF COMMUNICATIONS & IT GOVERNMENT OF INDIA Data Profiler Tool Manual Submitted by Infosys Limited 44 Electronics City, Hosur Road Bangalore – 560100 India Document Name DATA PROFILER- Operating Procedure.doc Version Rev. 1.0 Document Owner Swetha (Infosys Data Migration Team) Date 3-May-2013
14
Embed
DEPARTMENT OF POSTS MINISTRY OF …tamilnadupost.nic.in/sdc/dpt/DataProfilingTool_Manual.pdf · MINISTRY OF COMMUNICATIONS & IT GOVERNMENT OF INDIA Data Profiler Tool ... (e.g. a
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Confidential and Proprietary to India Post and Infosys Limited 1
DEPARTMENT OF POSTS
MINISTRY OF COMMUNICATIONS & IT GOVERNMENT OF INDIA
Data Profiler Tool Manual
Submitted by
Infosys Limited 44 Electronics City, Hosur Road
Bangalore – 560100 India
Document Name
DATA PROFILER-Operating
Procedure.doc
Version Rev.
1.0
Document Owner
Swetha (Infosys Data
Migration Team)
Date
3-May-2013
Confidential and Proprietary to India Post and Infosys Limited 2
Revision History:
Version Author Changes
1.0 Swetha Base line document
2.0 Selvakumar Changes to the section “Instructions to install and to
execute Data Profiler Tool” and section “Others”
3.0 Selvakumar Added new sections “Link for JRE7 download” and
“Instruction to find 32 or 64 bit operating system”
4.0 Selvakumar /
Meenakshi
Made changes to the section “Instructions to install” ,
added new section “How to execute Data Profiler
Tool” and updated Support Contact for North East
circle
[Meenakshi]:- Added a new section named FAQ
5.0 Selvakumar /
Anindya
Made changes in the following section
Instructions to Install How to execute Data Profiler Tool Detailed report Support Contact
5.1 Selvakumar Made change in the following section
Link for JRE7 download
Confidential and Proprietary to India Post and Infosys Limited 3
Table of Contents Data profiling: ............................................................................................................................................... 4
Prerequisites for using the tool: ................................................................................................................... 4
About Tool: ................................................................................................................................................... 4
Link for JRE7 download ................................................................................................................................. 4
Instruction to find 32 or 64 bit operating system ......................................................................................... 5
Instructions to Install .................................................................................................................................... 5
How to execute Data Profiler Tool ................................................................................................................ 5
Support Contact .......................................................................................................................................... 13
Confidential and Proprietary to India Post and Infosys Limited 4
Data profiling: Data profiling is the process of examining the data available in an existing data source (e.g. a database or a file) and collecting statistics and information about that data.
Prerequisites for using the tool: JRE 1.7 Supports MS SQL Server with version from 2000, 2005 and 2008 Adobe reader to view the report
About Tool: Data Profiler Tool is used for creating files that can be used for analyzing the quality of data based on
data migration rules.
Link for JRE7 download The link for JRE7 installable is available in the link
Confidential and Proprietary to India Post and Infosys Limited 5
Instruction to find 32 or 64 bit operating system
The following instruction is taken from the link http://windows.microsoft.com/en-us/windows7/32-bit-and-64-
bit-Windows-frequently-asked-questions
To find out if your computer is running a 32-bit or 64-bit version of Windows in Windows 7 or
Windows Vista, do the following:
1. Open System by clicking the Start button , right-clicking Computer, and then clicking
Properties.
2. Under System, you can view the system type.
If your computer is running Windows XP, do the following:
1. Click Start.
2. Right-click My Computer, and then click Properties.
If you don't see "x64 Edition" listed, then you're running the 32-bit version of
Windows XP.
If "x64 Edition" is listed under System, you're running the 64-bit version of Windows XP.
Instructions to Install
1. Create a folder called DataProfiler.
2. Download the DataProfilingTool.zip file from the site given by DoP and Save the .zip file to the DataProfiler folder.
3. Extract the zip file and put the contents of the zip file in the folder DataProfiler created in step 1
4. Open the userdpt .reg present in the folder DataProfiler and then replace (local) present in the line "Server"="(local)" with MS SQL server name having Sanchay Post data and save the file. For example, if the server name is SQLSERVER, then that line has to be
"Server"=" SQLSERVER "
How to execute Data Profiler Tool Preconditions:
Before executing DP the following needs to be done.
Confidential and Proprietary to India Post and Infosys Limited 6
a) DBA discrepancies should be selected before generation of DPT since few discrepancies
are being updated automatically.
b) DPT should be run after the day end process and also when the SQL server is in idle state
as it is a time consuming process.
Ensure that day end process is completed and MS SQL Server is in idle state and then follow the steps
1. Double Click on the dpt.bat present in the folder DataProfiler which is created in “Instructions to install” section. On double clicking the following screen will be displayed.
5. Enter the sa password and press enter
a. Where password is the sa password for connecting to MS SQL Server
6. Once the connection is established with given password to MS SQL Server, then the following screen will display and wait for the next details in the screen
Confidential and Proprietary to India Post and Infosys Limited 7
7. Once the execution is completed, the following screen will display and press enter key
Note:
It will take minimum 1 to 5 minutes based on volume of data. So please wait until you see the message “Press any key to continue”
Log file can be referred to know the details of error if any.
Confidential and Proprietary to India Post and Infosys Limited 8
8. The output files will be generated in the folder DataProfiler which is created in step 1 with the new name as current date time format every time the tool is executed (Example 20130503113819 if the tool is run on 3rd May 2013 at 11:38:19).
9. All files except the overall summary report will be in PDF format.
10. Open the folder to view the generated output files. Below picture is the screenshot of the generated files.
Reports
There are two types of Report
1. Summary report
2. Detailed report
Summary report
The summary reports with name as “Overall Summary Report <date time>_dcy.csv” is having
discrepancy count of each rule scheme wise along with its description as shown below
Product Name Rule name Rule Description
Record Count
SB SB_Accounts_Minor_status_as_Y_but_no_DOB_Count
Count of Accounts with Minor status is Y but no DOB 4
Confidential and Proprietary to India Post and Infosys Limited 9
SB SB_Ledger_Entries_without_index_entries_Count
Count of Ledger Entries that has no entries in Index table 1