Top Banner
A Collaborative Approach to Data Quality at the University at Buffalo 2015 NEAIR 11/3/2015 11:10 AM (Willsboro Room) Jonathan Havey Information Analyst SUNY Buffalo Office of Institutional Analysis
22

A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Apr 18, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

A Collaborative Approach to

Data Quality at the University at Buffalo

2015 NEAIR

11/3/2015

11:10 AM (Willsboro Room)

Jonathan Havey

Information Analyst

SUNY Buffalo Office of Institutional Analysis

Page 2: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Introduction

SUNY Buffalo has launched a Data Quality Initiative (DQI)

originating in its Office of Institutional Analysis

employing a collaborative approach with

business office and technical staff

• No project plan

• No organizational meetings

• No direct mandate from senior administration

Page 3: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

IntroductionOur Approach

1. Find a starting point and dive in

2. Draw upon good working relationships

3. Find a tool to make the DQI process easy to implement

4. Repeat steps 1-3 until done

5. You’re never done

Page 4: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Find a Starting PointRubber Freshmen

We began with a required New York State reporting data element

called “Higher Education History” , in which students are classified as

new or continuing undergraduates among other designations.

We were having to scrub too many of these records before submission

to the state. Many “new” students had cumulative credits earned at

SUNY Buffalo that suggested they were hardly new.

Page 5: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Find a Starting PointDiagnosis?

When we opened up the hood of the

ETL that generates data for state

reporting and analyzed the code

deriving the values for Higher

Education History, we realized that

it was relying heavily on a field

called Admit Term to determine if a

student was new or continuing.

Page 6: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsNot my department

At this point, an IR shop might consider its work done. We identified

incorrect data being fed into code that had been developed, tested,

and approved for state reporting years before.

An email to the Registrar’s Office

might normally be the only next

step in due diligence.

Or a code change to the ETL

could be requested of the technical staff.

Page 7: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

At SUNY Buffalo, the Office of Institutional Analysis,

the Registrar’s Office, and the technical support office

all report through different channels. In addition, most data

updating is distributed throughout the university’s colleges

and departments.

Since the data element Admit Term has NO impact on any

process other than reporting, it is naturally not the highest

priority for other offices.

RelationshipsHerding cats, anyone?

Page 8: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsA better next step

There is a piece of antiquated office equipment I find very useful in

such situations. Using it, I called the Associate Registrar.

Talking about the data issue in depth revealed the nature of the

challenge they faced—and opened the door to a solution.

The next slide demonstrates why this error is so easy to make.

Page 9: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Relationships

Page 10: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsDistributed security

There are nearly 180 users with access to updating the Program/Plan

pages in our system.

Yearly training still doesn’t prevent all the errors.

Since Higher Education History identifies new vs. continuing

students, up-to-date accuracy in this data element is critical from an

enrollment management perspective.

Page 11: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsPossible solutions

Edit checks are one way to solve this problem. In ERP software

packages, each edit check—if not delivered with the product—is a

potentially expensive customization.

Page 12: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsPossible solutions

Cleaning up hundreds of records after the fact during the busy

beginning of term for Census is an unpopular activity in hectic

business offices.

Page 13: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

RelationshipsReal time scrub

In the case of Higher Education History, mass updates near Census

create a dramatic adjustment in new student numbers that is

alarming in certain quarters. In short, retention goes up but yield

goes down when “new” students become reclassified as “continuing”.

What is needed is an as-you-go solution that can also be leveraged for

providing teachable moments to transactional staff.

Page 14: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueData Integrity Checker

The technical team at UB had previously demo’ed a tool for checking

data integrity to the system support office.

Only three steps required:

Create a query in the system to identify bad data

Designate email recipients

Schedule the job using autosys

Page 15: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueConfiguration

UB Tech staff developed an easy-to-use interface.

Note that standards of naming were developed in anticipation of the

tool being used in other capacities.

Page 16: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueNotifications

The email alias members receive messages alerting them to the

presence of data errors.

Page 17: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueThe Goods

Clicking the link prompts the recipient to log in, after which they are taken to the query output.

The Associate Registrar can then contact the user who performed the transaction and leverage a teachable moment.

Page 18: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueSome more RTS examples

Students in odd combinations of majors:

Undecided Major and English Major

French Major and Non-matriculated Major

Math Minor only—no major

Minor in the position of first major

While harmless to system functionality, these combinations throw off

our ranking hierarchy when preparing data for state reporting

Page 19: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Tool to the RescueSummary of Updates One Year Out

359

46

78

44

89

6

48

64

25

5

91

76

0 50 100 150 200 250 300 350 400

(MAJORS) DEGREE AND NONDEGREE‐SEEKING

(MAJORS) NO MAJOR AND MAJOR

(MAJORS) MAJOR RANKING INCORRECT

(MAJORS) MINOR COUNTED AS MAJOR

(MAJORS) NO ACTIVE MAJOR

(MAJORS) ACCEPTED/NOT ACCEPTED MAJOR

(FINANCIAL AID) OFFER/ACCEPT/DISBURSE OUT OF SYNCH

(BIODEMO DATA) MISSING GENDER

(BIODEMO DATA) MISSING ETHNICITY

(ADMISSIONS) CONTINUING COUNTED AS NEW

(ADMISSIONS) FALSE NEW UNDERGRAD

(ADMISSIONS) FALSE NEW GRAD

Results of Thirteen Month Data Quality Initiative at SUNY Buffalo (931 Total Errors Fixed)

Page 20: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Repeat Until DoneFuture Plans for DQI

Undergraduate Admissions Data

• New staff in charge of processing presents an opportunity

• Implementation of comprehensive document management

software adds urgency

• High School GPA and hand-entered SAT scores have been

historical challenges in this area

Page 21: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Repeat Until DoneFuture Plans for DQI, continued

Missing Class Section and Instructor Data

• PeopleSoft’s rich dimensionality in this module provides an

opportunity to get a full picture of course section staffing

• Space utilization will also be targeted

• New staff in Scheduling Office provides opportunity for a fresh

look at old processes

Page 22: A Collaborative Approach to Data Quality at the University at Buffalo · 2020-02-11 · The technical team at UB had previously demo’ed a tool for checking data integrity to the

Q&AShare your triumphs, tears, frustrations