Intelligent Enterprise

Better Insight for Business Decisions

Intelligent Enterprise - Better Insight for Business Decisions
search Intelligent Enterprise
Advanced Search
RSS
Webcasts
Whitepapers
Subscribe
Home




July 23, 2001



8 Steps to Better Customer Identification

The first step in retaining profitable customers is to know who they are

By David Cameron

Continued from Page 1

ALLOW FOR COUNTRY-BASED BUSINESS RULES

Marketers must account for national name and address conventions when identifying customers. In New York City, the address "20 5th Avenue 203" probably means house number 20 on 5th Avenue, unit 203. In Holland, "Vanhelsdingenlaan 16" means house number 16 on Vanhelsdingenlaan. Such varied use of numerics after street names requires different business rules to ensure you match the correct fields. In Spain, multiple middle names are common. In Quebec, women routinely retain their maiden names. In Britain, first initials replace first names in most instances. Without country-by-country adjustments to the matching rules, marketers can quickly and extensively introduce errors into the system and corrupt customer identification.

MINIMIZE TOTAL ERROR

In traditional merge-purge processes, the goal is to minimize undermatching. What is undermatching? Here's an example: Two identical records fail to match only because one has an apartment number and one doesn't. One common way to fix this error is to remove the requirement that apartment numbers match, which prevents duplicate mail from being sent to the same individual. What happens, unfortunately, is that matching rules are biased to minimize undermatching at the expense of overmatching, which results in erroneous matches of completely different records.

But pushing too far to fix one problem can inadvertently cause another. "Fixing" an undermatch causes too much overmatch in other areas, such as when the absence of an apartment number keeps legitimately different records separate from one another. Most often marketers never notice that they have actually increased error in an effort to reduce it. By adjusting rules so that overmatching and undermatching are treated equally, marketers can reduce total error.

INCLUDE DATA QUALITY IN THE PROCESS

Most errors in matching are not caused by inadequate business rules. In fact, confusion in identifying address elements causes most matching errors. If you identify a house number incorrectly, you can't use it in a match. This result is especially true when cleaning up business names. An important step is removing common words both universally (for example, "the," "a," "an", and so on) and geographically (for instance, "Bay State" in Massachusetts names, or the name of the street from the business name, like 31st Street Antiques). These matching errors are data quality issues.

You also need to flag records based on common words. For example, key words like Corp., Inc., GmBh, SA, and so forth, connote corporations, whose names you can't use elsewhere. The absence of this suffix (for example, Town Pizza, which occurs in virtually every town in America) is also significant. Other flags to set and account for include common words flags (where a name consists of all common words, such as "Boston Consulting Group") and franchise flags (McDonalds, Burger King, Pizza Hut, and so on). These flags play a crucial role in the match.

Also, creating a separate copy of the address purely for matching purposes is extremely important. For instance, you must not change Beacon Hill to Boston for a mailing because it often angers customers, but for matching reasons, you need to substitute the true city for this vanity reference.

Incorporating data quality into the matching process answers all these issues and dramatically improves accuracy.

IMPLEMENT BOTH CALLABLE AND BATCH-RULE EXECUTION

This practice is critical to realtime relationship identification and successful one-to-one and permission marketing on the Internet. Basically, marketers must have the capability of running matching rules on demand or periodically.

Most batch processes run periodically to clean up duplicates in customer databases or incorporate new activity into the customer view. Although this step is necessary, equally important are identifying individuals in real time while you interact with them and linking them to the total customer view in order to establish a dialog. Customer identification must be done consistently across all processes and touch customer points, regardless of platform. Technically, this process requires that your matching rules are available in library form so that you can match one record at a time into a database - also known as callable matching. You should use the same business rules in many-to-many batch matching processes.

Most systems have inconsistent business rules for callable and batch matching and inconsistent rules across different platforms. Such discrepancy means it is impossible to consistently match data in both real time and during periodic updates, which prevents the establishment of good customer dialogs.

IMPLEMENT A TIERED RULE SYSTEM

The most effective matching systems are multitiered and involve multiple points of control.

The content of data fields is the first point of control, such as data quality. The second point of control is which fields you use - for example, name and address; name and email; name, address, and email. It should be possible to choose any set of fields for matching. The third point of control is the thresholds for a match when comparing any one field. For instance, comparing two email addresses requires an exact match, but when comparing two last names, one or two errors is acceptable. The fourth point of control is the combination of matches required to associate two records. For example, if the first and last name are exact, but the city is totally different while the street address is close, that might be considered a match. However, if errors occur in the first and last name, the city might increase in importance to overcome those errors.



Rate This Article

Comments:

Optional e-mail address:

These complex points of control ensure that you can implement sophisticated business rules to accommodate special situations.

In summary, the realtime requirements of Internet marketing and the rise of global marketing have forced significant changes on traditional customer identification systems. Marketers need to retrain themselves in order to achieve scalability and extensibility, forethought, and a well-designed architecture. Reliance on traditional methods will result in an inability to cross from the present generation of marketing science to the next.

DAVID CAMERON [david.cameron@wheelhouse.com] is vice president of data integration and analytics for Wheelhouse Corp., a Burlington, Mass.-based marketing infrastructure services provider. Cameron is a frequent consultant, speaker, and publisher of topics in this field.


RESOURCES

Related Articles on IntelligentEnterprise.com:

"The Model Customer," January 30, 2001

"I Buy, Therefore I Am," April 28, 2000







IE Weekly Newsletter
Subscribe to the newsletter
    Email Address







InformationWeek Business Technology Network
InformationWeekInformationWeek 500InformationWeek 500 ConferenceInformationWeek AnalyticsInformationWeek CIO
InformationWeek EventsInformationWeek ReportsInformationWeek MagazinebMightyByte and SwitchDark Reading
Digital LibraryIntelligent EnterpriseInternet EvolutionNetwork ComputingNo Jitter
space
Techweb Events Network
InteropVoiceConWeb 2.0 ExpoWeb 2.0 SummitEnterprise 2.0 ConferenceMobile Business ExpoSoftware ConferenceCSI - Computer Security Institute
Black HatGTECEnergy CampMashup CampStartup Camp
space
Light Reading Communications Network
Light ReadingLight Reading EuropeUnstrungLight Reading's Cable Digital NewsConstantinopleInternet Evolution
Heavy ReadingLight Reading Live!Light Reading InsiderEthernet ExpoOptical ExpoTeleco TVTower Technology Summit
space
Financial Technology Network
Advanced TradingBank Systems & TechnologyInsurance & TechnologyWall Street & TechnologyAccelerating Wall StreetBank Systems & Technology Executive SummitBuyside Trading SummitInsurance & Technology Executive Summit
space
Microsoft Technology Network
MSDN MagazineTechNetThe Architecture Journal
space