Top Banner
Data Extraction, Visualization and Processing Minor Project Presentation Santosh Ghimire – 066 BCT 533 Subit Raj Pokharel – 066 BCT 538 Sudip Kafle – 066 BCT 539 1
18

Data Extraction, Visualization and Processing with application to census and election of Nepal

Jan 15, 2015

Download

Documents

Sudip Kafle

Minor Project Presentation of Bachelor's in Computer Engineering 3rd year at IOE, Pulchowk Campus.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Data Extraction, Visualization and Processing with application to census and election of Nepal

Data Extraction, Visualization and Processing

Minor Project Presentation

Santosh Ghimire – 066 BCT 533Subit Raj Pokharel – 066 BCT 538Sudip Kafle – 066 BCT 539

Page 2: Data Extraction, Visualization and Processing with application to census and election of Nepal

2

Data Extraction

Visualization

Processing

Page 3: Data Extraction, Visualization and Processing with application to census and election of Nepal

3

System Block Diagram

Page 4: Data Extraction, Visualization and Processing with application to census and election of Nepal

Data Extraction

Page 5: Data Extraction, Visualization and Processing with application to census and election of Nepal

Different Set of Data available in Different file Formats

1. District Population Based on Ethnicity

1. Election Data2. GIS Data for

Coordinate of districts

1. Data for District Level Indicators

Page 6: Data Extraction, Visualization and Processing with application to census and election of Nepal

Database

Extraction

Parser extracts data from file and saves to database

Page 7: Data Extraction, Visualization and Processing with application to census and election of Nepal

7

Parsing

XML file has Tree-node structure› Required data present in between opening and

closing tags PDF has no standard format for storing

data› The file first converted to simple text

HTML has DOM structure.› Data may not be structurally represented

unlike XML. Data extracted using Regular expressions : PDF

and HTML

Page 8: Data Extraction, Visualization and Processing with application to census and election of Nepal

8

Data Management

Admin needs to login for Data Management

Can Add, Update and Delete Data by searching based on various criteria

Only admin can register new admin

Page 9: Data Extraction, Visualization and Processing with application to census and election of Nepal

Data Visualization

Page 10: Data Extraction, Visualization and Processing with application to census and election of Nepal

10

Population in NepalDistrict Population

Achham 228990Arghakhanchi202462Baglung 268240Baitadi 234002

Page 11: Data Extraction, Visualization and Processing with application to census and election of Nepal

11

Visualization on Map

Used Google Map API JavaScript used at Client side jQuery and JSON used to implement AJAX

Map shown on Web Page

Web Server

User sets new criteria for Map

Server Acknowledge request and sends map data in JSON format

New Map

Page 12: Data Extraction, Visualization and Processing with application to census and election of Nepal

12

Visualization with Tag Cloud

Shows overview of scattering of data. One dimension represented by Text

displayed (e.g. Name of district) Other dimension by weight(Font size and

Color) of Text › Implemented using CSS.

Weight of Tags statistically calculated based on population.

Page 13: Data Extraction, Visualization and Processing with application to census and election of Nepal

Data Processing

Page 14: Data Extraction, Visualization and Processing with application to census and election of Nepal

14

State No. 1State No. 2

State No. 3State No. 5 State No. 4

State No. 6

State No. 7State No. 8

Page 15: Data Extraction, Visualization and Processing with application to census and election of Nepal

15

Analyzing Feasibility of Federal States

Districts can be selected to form new state.

Aggregate data for each state obtained from database. › Data can be

Top Caste, Top Parties in election, development index

Coefficient of Variation used to see if it is feasible

Page 16: Data Extraction, Visualization and Processing with application to census and election of Nepal

16

Facts Finder

Informative facts extracted from raw data in Database

User allowed to choose from multiple criteria

Nested SQL queries used

Page 17: Data Extraction, Visualization and Processing with application to census and election of Nepal

17

Methodology

Programming Languages› C# with ASP .Net, JavaScript, jQuery

MS-SQL Server 2008 as Database Engine

Web Technologies› JSON, AJAX

Google Map API

Page 18: Data Extraction, Visualization and Processing with application to census and election of Nepal

18

Project Management

Each phase divided into small chunks.› Assigned to team members.

Online Repository created on BitBucket.org› Using Mercurial based TortoiseHg› Works synchronized among each member

Weekly discussion with Senior Developer at YIPL Nepal.