Printer Friendly
The Free Library
14,718,400 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Good data in, good data out: IHEs put in systems to capture quality data, and keep those records clean.


When database records indicated that 200 students had signed up to play on the 2004 football team at DePauw University DePauw University, at Greencastle, Ind.; coeducational; United Methodist; est. 1832, chartered 1837. The school opened in 1838 as Indiana Asbury College, and in 1884 the present name was adopted.  (Ind.), Administrative Upgrade Project Director Daniel Pfeifer realized there was either something seriously wrong with how data was being handled, or the university would be ordering a lot more uniforms.

The difficulty, it turned out, was with a spiffy spiffy - /spi'fee/ 1. Said of programs having a pretty, clever, or exceptionally well-designed interface. "Have you seen the spiffy X version of empire yet?" This was common mainstream slang during the 1940s.

2.
 new database for the Athletic department that was designed to simplify sports registrations and gym facilities access. Instead, it ended up duplicating student records in the larger system at a rate of 10 to 15 per day.

By the time Pfeifer realized what was happening, hundreds of duplicate records were in the university's general database, with some students sporting three or four records.

Although the records themselves could be untangled manually, DePauw also had data coming in about potential candidates from numerous other sources, such as high school visits, PSAT PSAT Preliminary Scholastic Aptitude Test
PSAT Puget Sound Action Team
PSAT Particulate Source Apportionment Technology
PSAT Predicted Site Acquisition Table
PSAT Princeton South Asian Theatrics
PSAT Pacific Situation Assessment Team (DoD) 
 and SAT scores, and a university marketing company. Having a system that allowed duplicate records would become a nightmare, and quickly.

"It's taken us about two years to really figure out how to stop duplicates, and make sure we're inputting quality data in the first place," says Pfeifer. "Now that we have it down, it's a huge help in knowing our numbers are right."

DePauw is far from alone in its quest to manage multiple databases, a task that's challenging for any sized institution of higher education higher education

Study beyond the level of secondary education. Institutions of higher education include not only colleges and universities but also professional schools in such fields as law, theology, medicine, business, music, and art.
. Data about prospective students mingles with alumni records, parent information, fundraising databases, and even financial data. Keeping it all straight takes effort and communication, but as many IHEs have discovered, the task is crucial for keeping costs in check and boosting data accuracy.

Information Overload A symptom of the high-tech age, which is too much information for one human being to absorb in an expanding world of people and technology. It comes from all sources including TV, newspapers, magazines as well as wanted and unwanted regular mail, e-mail and faxes.  

As nearly anyone associated with a college or university knows, databases abound in an academic environment. Not only are student records rife with grade reports and tuition statements, but schools have data on everything from how many desks are in a certain classroom to which donors have contributed in the past year.

The result for some institutions is an abundance of databases that contain similar information. The Bursar bur·sar  
n.
An official in charge of funds, as at a college or university; a treasurer.



[Middle English burser, from Medieval Latin burs
 office might have student records online, but so does the Admissions office, as well as individual undergraduate or graduate program offices, and even Food Services food services Hospital services A 24/7 department in a hospital that provides for the nutritional needs of inpatients–eg, those needing special diets, preparing meals and transporting them to the floor and, through the cafeteria, the hospital staff and .

As these databases are created and expanded, eliminating duplicates can be costly work, sometimes requiring days of manual deletion. If the databases are used to send out catalogs or other mail, bad addresses increase the university's costs.

"It's a huge issue for any institution," says Nancy Krogh, registrar at the University of Idaho The university was formed by the territorial legislature of Idaho on January 30, 1889, and opened its doors on October 3, 1892 with an initial class of 40 students. The first graduating class in 1896 contained two men and two women. . "Having multiple databases has become a bigger and bigger problem over the past few years, as more information is being input and used to make decisions on both the departmental and university-wide level."

Unifying Resources

Although IHEs may have data pulled from a number of sources, many administrators have found that centralizing the information is key to managing it.

At DePauw, university staff spent four years integrating data into a central repository, known as Client Information Service, which also serves as a university-wide address book. In addition to records for the school's 2,400 current students and their parents are records for about 60,000 prospective students and their parents, plus around 80,000 alumni and donor records.

Prior to unifying the information, the university had a legacy system where prospectives, current students, and alumni were stored in separate systems. But record overlap became a significant problem, says Pfeifer. Not only did some prospective students enroll, subsequently appearing in two databases, but alumni tended to send their children to the school as well--meaning a parent could conceivably land in all three databases.

To manage the data and eliminate duplicates, DePauw's IT department wrote programs that crawl the database every day, searching for matches based on different fields like address or phone number. Also created were relationship algorithms that evaluate connections between records. For example, a student's parents are checked separately for errors but also evaluated against each other, to see if they live at the same address or are still married.

"In a sense, we have to triangulate See triangulation.  people, relationships, and their location," says Pfeifer. "Because we started from scratch, we were able to design a system specifically for university communication and the typical relationships that are meaningful in a university context."

Although in-house applications like DePauw's can be developed, there are also many "cleaning solutions" on the market, usually in the form of a service. At Yale University Yale University, at New Haven, Conn.; coeducational. Chartered as a collegiate school for men in 1701 largely as a result of the efforts of James Pierpont, it opened at Killingworth (now Clinton) in 1702, moved (1707) to Saybrook (now Old Saybrook), and in 1716 was , for example, records are sent out to AlumniFinder, which puts together address and other contact information from multiple sources.

Centralized databases are also easier to lock down. As IHEs like the University of Connecticut The University of Connecticut is the State of Connecticut's land-grant university. It was founded in 1881 and serves more than 27,000 students on its six campuses, including more than 9,000 graduate students in multiple programs.

UConn's main campus is in Storrs, Connecticut.
, Boston College Boston College, main campus at Chestnut Hill, Mass.; coeducational; Jesuit; est. and opened 1863. Actually a university, the school's Chestnut Hill campus comprises colleges of arts and sciences and business administration, the graduate school, and schools of nursing , Stanford University Stanford University, at Stanford, Calif.; coeducational; chartered 1885, opened 1891 as Leland Stanford Junior Univ. (still the legal name). The original campus was designed by Frederick Law Olmsted. David Starr Jordan was its first president.  (Calif.), and Tufts University Tufts University, main campus at Medford, Mass.; coeducational; chartered 1852 by Universalists as a college for men. It became a university in 1955. Jackson College, formerly a coordinate undergraduate college for women, merged with the College of Liberal Arts in  (Mass.) have found, data stores are attractive targets for hackers that can sell the information on the digital black market. Putting data in an extremely secure environment and letting users pull only selected records can help prevent breaches.

Because a database wouldn't be spread out on multiple servers, it would be easier in many cases for IT staff to create firewalls or limit access, experts say. Even physical access to database servers could be strengthened if the servers are brought together within the same data center.

Data Input

Centralizing databases into one comprehensive warehouse is crucial--but also a challenge in making sure that what's being input is quality data worth saving in the first place. In a classic example of "garbage in, garbage out (humour) Garbage In, Garbage Out - (GIGO) /gi:'goh/ Wilf Hey's maxim expressing the fact that computers, unlike humans, will unquestioningly process nonsensical input data and produce nonsensical output. ," duplicates and bad information can be created not just by systems clashing with each other, but by questionable data entry.

Often, users themselves input their own information incorrectly, Pfeifer notes, or hit Submit too many times on a web form, essentially duplicating themselves in the system. "Whenever we allow clients to enter their own information, we get duplicate records," he says. "People put in their ZIP codes wrong, or transpose trans·pose
v.
To transfer one tissue, organ, or part to the place of another.
 letters, or sometimes even spell their names wrong if they're typing too fast."

Lack of proper data ownership can also present difficulties, says Krogh of Idaho. She has seen departments attempting to create "shadow databases," pulling information from a central database so that records can be manipulated on a desktop and reports generated from it. The problem with such a tactic is that data is often changed on an individual's computer. If the record gets put back in the system, it doesn't match what's in the central storehouse.

"To keep data clean, you have to establish a data owner, almost a custodian of data," Krogh notes. "These are the people that control the data, who can create new fields or enter new data."

Although such a severe clampdown clamp·down  
n.
An imposing of restrictions or controls: "Advertisers and broadcasters would raise howls of protest against any strong clampdown" Wall Street Journal.
 on who gets to change data is likely to take effort, especially at an institution used to having every department able to do input, the tactic can reduce the need for intense strategies to reduce duplication.

At Yale, 10 people are in charge of updating contact information, tracking down addresses, and data input, says Angelyn Singer, manager of alumni records. Keeping changes confined to just those staff members keeps Yale's formidable database as clean as it can be. Considering that the staff makes more than 5,000 address changes per month, having that information changed by other departments would create a sizeable problem with knowing what's accurate and what isn't.

"It's an incredible challenge trying to keep up with where people are," says Singer. "It can take a very concerted effort sometimes." The department has even written to relatives of alumni to try and get current information and make sure that the database is correct.

The effort is worthwhile, Singer adds. "Clean data is so important, and that's why you need multiple strategies for making sure that what you put in is accurate," she points out.

Team Effort

In addition to having technological systems set up properly, IHEs must also navigate the much murkier waters of interdepartmental in·ter·de·part·men·tal  
adj.
Involving or representing different departments, as of a business, an academic institution, or a government: "the petty interdepartmental squabbling that surrounds the making of . . .
 communication if they want databases to be managed expertly.

Many databases sprout from within each university department in order to keep records straight, and often these data storehouses are either simple Microsoft Excel (tool) Microsoft Excel - A spreadsheet program from Microsoft, part of their Microsoft Office suite of productivity tools for Microsoft Windows and Macintosh. Excel is probably the most widely used spreadsheet in the world.

Latest version: Excel 97, as of 1997-01-14.
 spreadsheets with loads of data stuffed into them, or homegrown applications that address a specific department's needs. Integrating these various databases into a centralized framework will go a long way toward reducing duplication and assuring quality data, but such a switch can also spark frustration and even downright resentment from those who believe they should have more control over the databases they used to manage. Administrators who don't put at least some thought into preventing hurt feelings could have a harder time making database management into a group effort.

"Today's information systems are integrated," says Dwight Fischer, CIO CIO: see American Federation of Labor and Congress of Industrial Organizations.


(Chief Information Officer) The executive officer in charge of information processing in an organization.
 of Plymouth State University Plymouth State University is a coeducational, residential university with an enrollment of approximately 4,192 undergraduate students and 1,072 graduate students. The school was founded as Plymouth Normal School in 1871.  (N.H.). "They require teams to work integrated with one database, one set of standards, one means of making changes. The teams that are most successful are those that come to realize if they work together they can deliver better services. On the other hand, teams that continue to focus more on their individual offices rather than seamless services tend to have far more organizational friction."

To help ease user grumbling, training is important--and not utilized often enough at schools, believes Tim Cooper, vice president of Sales for Higher Education Technology at Oracle. "Getting everybody to work together to see data as one source represents a paradigm shift A dramatic change in methodology or practice. It often refers to a major change in thinking and planning, which ultimately changes the way projects are implemented. For example, accessing applications and data from the Web instead of from local servers is a paradigm shift. See paradigm. ," Cooper says. "At many universities, there are too many people who want to control the data." In Cooper's view, IHEs could create more goodwill among users by ramping up education and training efforts, rather than just spending most of their budget money on technology itself. "If users understand their role, and how to get data back out of the system, they won't be frustrated," he says.

While conflict is being smoothed at the user level, IHEs should also be contemplating ways to keep data management policies strong for the future. A systems user group was created at Plymouth State to represent each department with a stake in how the central database is utilized. Included are the Registrar, Bursar, Financial Aid, Admissions, Graduate School, and IT offices. This steering committee steer·ing committee
n.
A committee that sets agendas and schedules of business, as for a legislative body or other assemblage.


steering committee
Noun
 creates data standards, and policies about who can extract data and what long-term goals Long-term goals

Financial goals expected to be accomplished in five years or longer.
 need to be achieved.

"When you implement a new, integrated information system, you agree on standards, on data and business practices, and you make that the context within which everyone works," notes Fischer. "When everyone reads from the same sheet music, the harmony is wonderful."

In addition to the larger, executive-level group, a subgroup can be established that deals with database particulars, says Krogh of the University of Idaho. "Rather than build on the fly, you should have a team working together to think about what fields are needed, or which categories are just one-time use," she says. "If you have a mishmash mish·mash  
n.
A collection or mixture of unrelated things; a hodgepodge.



[Middle English misse-masche, probably reduplication of mash, soft mixture; see mash.
 of fields that people created because they needed them once, you can't see any kind of longitudinal trends or build a comprehensive data store."

Outside Help

Software, whether it's purchased or created in-house, can help to streamline database management, but some IHEs have decided to go with managed services An umbrella term for third-party monitoring and maintaining of computers, networks and software. The actual equipment may be inhouse or at the third-party's facilities, but the "managed" implies an ongoing effort; for example, making sure the equipment is running at a certain quality  as well. Benefits to outsourcing this type of work can include more frequent de-duplication of records, access to multiple information storehouses, and manual data entry that's done by the service rather than by university staff.

An entire database doesn't have to be taken off campus just to get some of the perks that come with managed services, though. For example, Melissa Data Melissa Data Corp.
Melissa Data Corp. – based in Rancho Santa Margarita, CA – was founded in 1985 by Raymond Melissa, a computer industry veteran. The firm provides address and phone verification, postal encoding and data enhancement services, with an emphasis on
 offers multiple options for cleaning up existing databases through the use of postal service postal service, arrangements made by a government for the transmission of letters, packages, and periodicals, and for related services. Early courier systems for government use were organized in the Persian Empire under Cyrus, in the Roman Empire, and in medieval  databases, de-duplication programs, and telephone records.

IHEs can spend anywhere from $1,000 for a one-time duplicate check to more than $10,000 for regular cleaning, notes Jack Schember, the company's marketing manager. "There are many tiers for these type of services. Universities can take advantage of address correction features without spending much money."

Some also appreciate the consulting expertise that comes with buying managed services, notes Fred Sift, CIO at the University of Cincinnati The University of Cincinnati is a coeducational public research university in Cincinnati, Ohio. Ranked as one of America’s top 25 public research universities and in the top 50 of all American research universities,[2] . When his school blended several legacy systems, Sift turned to dbaDIRECT, so staffers could focus on infrastructure issues rather than spend much of their time on data cleansing See address cleansing and data hygiene.  and other routine tasks.

"Many times, universities don't have time to aggregate data and think about performance issues," says John Bostick of dba-DIRECT. "There are a number of layers, and they might be more interested in expanding their tactical skills instead of focusing on the whole data warehousing aspect of the task."

Whether service firms are used heavily or only occasionally, the input from an "outside" source can be helpful for an IHE IHE Integrating the Healthcare Enterprise
IHE Institutions of Higher Education
IHE International Institute for Infrastructural, Hydraulic and Environmental Engineering (historical acronym only, replaced by: IHE Delft, the Foundation) 
 trying to manage all aspects of a major database, Sift notes.

"With something as complex as bringing together systems like SAP and Oracle, it's helpful to have assistance in handling that transition," he says. "You need to have a strong team at the university, but you should also have an expert in more broadranging database issues. When everyone can get together for the effort, you'll be much more successful with implementing a database that works for everybody."

Getting Financials Straight

In addition to managing databases containing multiple layers of student, parent, alumni, and donor records, institutions of higher ed must also keep their financial databases as clean and efficiently working as possible. Many have implemented homegrown processes to keep budgets balanced and financials straight. Officials at The University of Vermont decided to took beyond the school's walls and even the higher ed industry for an appropriate product.

They shopped around for toots toots  
n. Slang
Babe; sweetie.



[Perhaps short for tootsie.]
 used in the corporate arena for the best option, explains Mike Gower, the school's CFO See Chief Financial Officer.  and finance vice president. Strategic Finance from Hyperion, which lets the university unite art of its financial centers, from philanthropy to state appropriations to salaries and benefits, was selected. Financial information went into a single, interrelated in·ter·re·late  
tr. & intr.v. in·ter·re·lat·ed, in·ter·re·lat·ing, in·ter·re·lates
To place in or come into mutual relationship.



in
 database that allows administrators to see how a change in one area affects others. For example, it can forecast how tuition increases will influence research awards or renovation plans.

UVM UVM University of Vermont
UVM Universidad del Valle de México
UVM Universitas Viridis Montis (University of the Green Mountains aka University of Vermont)
UVM Universal Voice Module (Cisco) 
 had started as many schools do, with a series of Microsoft Excel spreadsheets tracking information in various departments, Gower notes. But complicated formulas, interlinked documents, and complex interactions soon bogged down the system. "We realized that it was taking all our time to manage the day-to-day," he says. "We wanted to be coming up with 10-year strategic plans that included issues like how our endowment would grow, or how different debt levels would affect us."

In particular, officials were looking to do a significant amount of debt financing Debt Financing

When a firm raises money for working capital or capital expenditures by selling bonds, bills, or notes to individual and/or institutional investors. In return for lending the money, the individuals or institutions become creditors and receive a promise to repay
 to support capital projects, and having proper forecasting tools was key. Strategic Finance, used by few universities but scores of corporations, is a software application focused on modeling, so executives can understand the financial impact of different strategies. Some corporations have used the tool to determine whether to consolidate multiple partner entities, for example, or to create contingency plans if market fluctuations occur. The application gives users a long-range view of decisions, showing how it might impact the company next year, or even next decade.

Although it was fairly unproven in an academic setting, the application could take on university-specific financial data, Gower suspected, as well as integrate numerous spreadsheets into a comprehensive whole that works well.

"When you're looking at growth, you don't want to be chasing down linked spreadsheets," he says. Using financial statements as a base, UVM inputs other financial information like tuition, state grants, or endowments, and does multiple projections. To keep the data clean, there are regular spot checks of major calculations, although Gower is confident that after two years of use, the system is working as it should.

"Trying to project operating capital budgets is complicated, to say the least," he says. "There are so many variables. So it's useful to put a toot in place that can forecast property, and show how changes affect the entire university, not just one part of it."

Resources

21st Century Technologies, www.21stsoft.com

Alpha Software, www.alphasoftware.com

AlumniFinder, www.alumnifinder.com

BMC (BMC Software, Inc., Houston, TX, www.bmc.com) A leading supplier of software that supports and improves the availability, performance, and recovery of applications in complex computing environments. , www.bmc.com

Citrix, www.citrix.com

Cogito This article is about the philosophical magazine. For the software used in the extended version of the current Linux revision system git, see Cogito (software). For the famous philosophical saying by Descartes, see cogito ergo sum. , www.cogitoinc.com

dbaDIRECT, www.dbadirect.cpm

Hyperion, www.hyperion.com

Melissa Data, www.melissadata.com

Microsoft Access, http://office.microsoft.com

Oracle, www.oracle.com

SAP, www.sap.com

Virtual Nerds custom developers, www.virtualnerds.com

Elizabeth Millard, a freelance writer based in Saint Louis Park Saint Louis Park, city (1990 pop. 43,787), Hennepin co., SE Minn., a suburb of Minneapolis; settled 1854, inc. 1886. There is printing and publishing, machining, food processing, and the manufacture of rubber products and furniture. , Minn., specializes in covering technology.
COPYRIGHT 2006 Professional Media Group LLC
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Millard, Elizabeth
Publication:University Business
Date:Aug 1, 2006
Words:2735
Previous Article:Endowment management new and proven strategies: keeping a pulse on the economy helps today's endowment managers in their quest to bring their...
Next Article:CMS: sticking with tried and true: institutions are boosting their content management efforts with the use of commercial software while looking at...



Related Articles
IT spending dips: budgets are bigger at private colleges.(Update)
Student database poses privacy concerns: but some say it would provide better data on higher ed trends.(IN THE NEWS)
Growing pains: as network usage increases, IHEs find ways to manage demand and keep the traffic flowing.
Tomorrow's I.D. card program: IHEs are looking to make their campus card services positively futuristic.
Back from the breach: IHEs find that recovery from security breaches must be part of every IT plan.
Building a financial fundraising case: a successful fundraising plan involves sharing data, tying donors' values to the cause, and then careful...
The outs (and ins) of facility management: outsourcing facilities management isn't an all-or-nothing decision on today's campuses.
Virginia schools in a privacy pickle: a well-intentioned state law raises significant data privacy risks for institutions and their...
Institutional impact: economic studies a popular tool.(BEHIND the NEWS)
Reducing business risk through cost-effective e-mail management: "can legacy systems effectively manage today's data volumes and varying data...

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles