Flexing your super financial sleuth power: here's how to pump up the detective power of Benford's law.EXECUTIVE SUMMARY * Benford's law Noun 1. Benford's law - a law used by auditors to identify fictitious populations of numbers; applies to any population of numbers derived from other numbers; "Benford's law holds that 30% of the time the first non-zero digit of a derived number will be 1 and it will holds that the leftmost left·most adj. Farthest to the left: in the leftmost lane of traffic. Adj. 1. leftmost - farthest to the left; "the leftmost non-zero digit" digit in many types of numerical data Numerical data (or quantitative data) is data measured or identified on a numerical scale. Numerical data can be analysed using statistical methods, and results can be displayed using tables, charts, histograms and graphs. is a 1 nearly one-third of the time, with probability inversely proportional See See also: Inversely to the value of each increasing digit. * This phenomenon can be harnessed to analyze data to detect values that deviate from normal distribution and thus could indicate fraud. An Excel template that applies z-score values to Benford's analyses of subsets of data can assess the probability that any one employee, customer or vendor among many may have committed fraud. * The template is flexible by allowing users to set a z-score threshold above which the Benford's analysis flags suspect data. ********** Worldwide fraud is on the rise. The magnitude of the problem prompted the AICPA AICPA See American Institute of Certified Public Accountants (AICPA). and the Association of Certified Fraud Examiners Established in 1988 the Association of Certified Fraud Examiners is the professional organization that governs professional fraud examiners. Its activities include producing fraud information, tools and training. to create the Institute for Fraud Prevention. To assist in detecting fraud, auditors need to employ innovative techniques like Benford's law, which predicts the frequency of digits 1 through 9 in the first four places of any number. American physicist Frank Benford Frank Albert Benford, Jr., (29 May 1883 Johnstown, Pennsylvania - 1948) was an American electrical engineer and physicist best known for rediscovering and generalizing Benford's Law, a statistical statement about the occurrence of digits in lists of data. in the 1930s observed that lower digits, beginning with 1, appear more frequently than higher ones, starting from the leftmost position of many types of collections of numbers. Provided with a large data set, auditors can use Benford's law to help detect fraud by analyzing all account transactions to see if they fall into the expected pattern (see "I've Got Your Number," JofA, May 99, page 79). Another JofA article ("Turn Excel Into a Financial Sleuth," Aug. 03, page 58) presented a Benford analysis using a "Fraud Buster" Excel template on aggregate data. To pump up auditors' detective powers, we present an Excel application that simultaneously applies the technique to each employee, and can be easily adapted for other groups including customers and vendors. A Big Four firm is evaluating this approach, which has been successfully used by an internal auditing department of a large international company. Our application has received positive feedback because it is easy to use and can help identify fraudsters, particularly when used in combination with other methods. An internal auditor Internal auditor An employee of a company who analyzes the company's accounting records to that the company is following and complying with all regulations. from an international retail chain said the procedure allowed him to "work smarter instead of harder--the key to success when you are dealing with vast amounts of data." The method could help auditors in such activities as SAS (1) (SAS Institute Inc., Cary, NC, www.sas.com) A software company that specializes in data warehousing and decision support software based on the SAS System. Founded in 1976, SAS is one of the world's largest privately held software companies. See SAS System. no. 99's requirement that they discuss the potential for a material misstatement mis·state tr.v. mis·stat·ed, mis·stat·ing, mis·states To state wrongly or falsely. mis·state ment n. in financial statements due to fraud (see
"Auditors' Responsibility for Fraud Detection," JofA,
Jan. 03, page 28).
PUMPING UP WITH Z-SCORES Benford's law is used by several countries and states (for example, California and New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of ) to identify tax defrauders. Several software companies--for example, Apex Analytix and Cost Recovery Solutions--use Benford's law to identify suspicious vendors. Using such software, internal auditors at a large international manufacturing firm run a Benford's law test for each vendor based on amounts that exceed user-defined exception levels. One of these auditors indicated that the company was experimenting to find the optimum level of deviations from Benford's law. Exactly how do we increase the detective power of Benford's law? For each employee, customer, vendor or other party to a transaction we calculate a z-score for each leading digit (1 to 9). The z-score is a statistical measure of how many standard deviations In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. a number is from the mean and allows the auditor to empirically determine--not guess--whether deviations from the pattern are statistically significant. The larger the z-score, the less likely it is that unexpected frequencies are the result of chance. The auditor selects the maximum allowable z-score corresponding to the level of error that he or she is willing to accept. For example, if the auditor is willing to accept a 5% chance of drawing the wrong conclusion, the auditor would set the maximum allowable z-score to 1.96. Any z-score that exceeds the auditor's maximum allowable z-score may indicate fraud and must be investigated further. Thus our approach eliminates the need to experiment with the appropriate exception level we just leave it up to statistics! HOMING IN ON FAKE SALES RETURNS While our workbook work·book n. 1. A booklet containing problems and exercises that a student may work directly on the pages. 2. A manual containing operating instructions, as for an appliance or machine. 3. can be used in many situations, we selected the setting used by the internal auditor quoted above, who investigated sales returns at a retail store. Selected employees are authorized to process sales returns, with management approval required for returns of $500 or more. Our dataset includes 56,000 hypothetical sales returns over a six-month period and contains the following fields: transaction number, date, employee number, sales return amount, and, if required, manager number. Exhibit 1 shows the result of a Benford analysis with the commonly used ACL See access control list. 1. ACL - Access Control List. 2. ACL - Association for Computational Linguistics. 3. ACL - A Coroutine Language. A Pascal-based implementation of coroutines. ["Coroutines", C.D. commercial data analysis software, performed on the entire population of sales returns. The z-score (Zstat ratio) amounts are relatively small, less than ACEs 1.96 default z-score. Therefore, an auditor would not suspect any fraud, because the sales return counts fall within the expected pattern and the count of each leading digit falls within its acceptable range. However, the dataset contains a fraud perpetrated by Amy (employee 8136 in the Excel workbook available online at www.aicpa.org/ download/pubs/jofa/june2007/SuperBenford.xls) who wrote 60 fake returns (1.1% of her 5,320 returns) for amounts just under the $500 managerial level. To increase the chance of detecting fraud, we propose an automated method that applies Benford's law to each employee by analyzing the frequency of the first digit of every transaction amount. [ILLUSTRATION OMITTED] EXCEL TO THE RESCUE (AGAIN) The Using This Workbook tab in the workbook guides you step by step through an evaluation of our sales return dataset. The steps include (1) extracting the first digit using two text functions (Exhibit 2), (2) creating a PivotTable to calculate the actual frequencies for each employee using the COUNT function (Exhibit 3), and (3) calculating a z-score using the actual counts and expected frequencies for each combination of digits for each employee (Exhibit 4). The spreadsheet displays "yes" for any z-score that exceeds the maximum allowable z-score in cell K2, currently set at 1.28. Based on this z-score value, we can conclude that there is less than a 20% chance that frequencies identified in column L with a "yes" are the result of chance. [ILLUSTRATIONS OMITTED] Want to increase the power of your test? Just change the z-score. If you are willing to accept only a 10% probability that the unexpected frequencies are the result of chance, then set the z-score to 1.65. For a 5% or 1% chance, set the z-score to 1.96 or 2.58, respectively Amy's fraud (issuing fake returns near $500) has a maximum z-score of 2.59 for the 4 digit, as shown in cell E17. Given that Amy's score exceeds 2.58, there is less than a 1% probability that her actual return frequencies are due to chance--indicating that fraud is very likely WORKING WITH YOUR DATA Using the instructions on the Using Your Data worksheet, you can import your data into the application. If the expected frequencies of your dataset don't follow Benford's law (see www.theiia.org/ ITAudit/index.cfm?act=itaudit.archive &fid fid n. 1. Nautical A square bar used as a support for a topmast. 2. A large tapering pin used to open the strands of a rope before splicing. [Origin unknown.] =58), your expected frequencies can be entered in row 5 of the Evaluation worksheet. For example, a higher percentage of sales at a retail grocery store may begin with the digits 1 and 2, with fewer sales beginning with the digits 3 to 9. Got a large dataset? Although Excel easily handled our 56,000-line dataset, yours may overpower o·ver·pow·er tr.v. o·ver·pow·ered, o·ver·pow·er·ing, o·ver·pow·ers 1. To overcome or vanquish by superior force; subdue. 2. To affect so strongly as to make helpless or ineffective; overwhelm. 3. Excel. No problem! Programs such as Microsoft Access A database program for Windows, available separately or included in the Microsoft Office suite. Access is programmable using Visual Basic for Applications (VBA). Access can read Paradox, dBASE and Btrieve files, and using ODBC, Microsoft SQL Server, SYBASE SQL Server and Oracle data. and ACL can identify the leading digit and create a cross-tabulation that can be copied into our workbook's Actual worksheet. Access users can create a crosstab query to replace Excel's PivotTable. ACL users can substitute that program's LEADING function for the formula in step 1 to extract the first digit, then create a cross-tabulation table. A MORE DYNAMIC ANALYSIS Internal auditors, external auditors The examples and perspective in this article or section may not represent a worldwide view of the subject. Please [ improve this article] or discuss the issue on the talk page. and managers are under increasing pressure to identify fraud. The combination of the three text functions and Excel's PivotTable allows a more dynamic data analysis when using Benford's law. However, like most audit tests, Benford's law cannot be relied upon to catch all frauds, which often requires a combination of approaches. For example, Benford's law did not identify the fraud committed by Tom, who scattered his 100 fake returns (2.1% of his 4,833 returns) randomly between $100 and $500 (see employee 1981 in the workbook). Certainly, Tom's maximum z-score of 0.88 (for the 4 digit) is larger than other employees' but not large enough to exceed the acceptable z-score. Benford's law does not work with small datasets, data with assigned numbers (standard) assigned numbers - The RFC STD 2 documenting the currently assigned values from several series of numbers used in network protocol implementations. This RFC is updated periodically and, in any case, current information can be obtained from the Internet Assigned Numbers or artificial minimums and maximums, and numbers that are truly random. Despite these limitations, Benford's law can still be a powerful financial sleuth. Both auditors and managers must use tools such as Benford's law in combination with their professional judgment and investigative skills to uncover fraud. AICPA RESOURCES JofA articles * "Worldwide: Looters Have a Foothold," JofA, Jan. 07, page 32, www.aicpa.org/ pubs/jofa/jan2007/special1.htm. * "Turn Excel Into a Financial Sleuth," Aug. 03, page 58, www.aicpa.org/pubs/ jofa/aug2003/rose.htm. * "I've Got Your Number," May 99, page 79, www.aicpa.org/pubs/jofa/ may1999/nigrini.htm. Publications * CPAs Handbook for Fraud and Commercial Crime Prevention (#056504). * Fraud Detection in a GAAS See gallium arsenide. Audit (#006615). CPE (Customer Premises Equipment) Communications equipment that resides on the customer's premises. CPE - Customer Premises Equipment * Identifying Fraudulent Financial Transactions: A CPE Self-Study Course (#730546). For more information or to place an order, go to www.cpa2biz.com, or call the Institute at 888-777-7077. OTHER RESOURCES Web sites * Association of Certified Fraud Examiners, www.acfe.com. * AuditSoftware.net, www.auditsoftware.net. RELATED ARTICLE: Astronomer's keen eye paves way for law. Who would've thought that an astronomer and a physicist could develop a tool for use in accounting? What came to be called Benford's law was discovered in 1881 by the American astronomer Simon Newcomb, who observed that the pages of printed logarithmic logarithmic pertaining to logarithm. logarithmic relationship when the logs of two variables plotted against each other create a straight line. tables starting with the number 1 were much more worn than later pages. Newcomb then analyzed how numbers were distributed in naturally occurring data and derived the frequencies of what is now called Benford's law. Unfortunately Newcomb's discovery went unnoticed until 1938, when Frank Benford, a physicist, rediscovered the same worn pattern of the logarithmic tables. Benford analyzed 20,229 sets of numbers, including baseball statistics Statistics are very important to baseball, perhaps as much as they are for cricket, and more than almost any other sport. Since the flow of baseball has natural breaks to it, the game lends itself to easy record keeping and statistics. , areas of rivers and numbers in magazines. Surprisingly, these number sets all follow the same first-digit pattern. Mark Lehman, CPA, Ph.D., and Marcia Weidenmier Watson, CPA, Ph.D., are associate and assistant professors, respectively, of accountancy at Mississippi State University Mississippi State University, at Mississippi State, near Starkville; land-grant and state supported; coeducational; chartered 1878 as an agricultural and mechanical college, opened 1880. From 1932 to 1958 it was known as Mississippi State College. , Starkville, Miss. Their e-mail addresses See Internet address. e-mail address - electronic mail address are mark.lehman@msstate.edu and mweidenmier@cobilan.msstate.edu, respectively. Tim Jones William Timothy Jones (born December 1, 1962, in Sumter, South Carolina) was a Major League Baseball infielder. He is an alumnus of The Citadel. Drafted by the St. Louis Cardinals in the 2nd round of the 1985 MLB amateur draft, Jones would make his Major League Baseball , MPA MPA medroxyprogesterone acetate. , is a graduate student at Mississippi State University. Benford's Law 1 30.1% 2 17.6% 3 12.5% 4 9.7% 5 7.9% 6 6.7% 7 5.8% 8 5.1% 9 4.6% For many kinds of data, the leftmost digit is a 1 nearly one-third of the time, with the probability inversely proportional to increasing digit value. Note: Table made from bar graph. |
|
||||||||||||||||

ment n.
Printer friendly
Cite/link
Email
Feedback
Reader Opinion