A meta-analysis of the diagnostic accuracy of the SCOFF.
Although anorexia nervosa (AN) and bulimia nervosa (BN) are well-known disorders, in community studies they are not so prevalent. Most cases belong to the category of Eating Disorders Not Otherwise Specified (EDNOS) (Eddy et al., 2010) of the DSM-IV-R (APA, 2002). Although EDNOS can be even more severe and enduring, often they are not diagnosed until the disorder is well established, delaying the diagnosis and appropriate treatment. Above all, because the symptoms are presented first to non-specialists such as family doctors, they are not immediately recognized as related to an ED (Ogg, Millar, Pusztai, & Thom, 1997). Thus, it is crucial to identify the subpopulation at risk in order to encourage a higher awareness of the availability of treatments (Becker et al., 2005; Herpertz-Dahlmann, Wille, Holing, Vloet, & Ravens-Sieberer, 2008). There is a great interest in the public health sector in improving effective screening and the strategies of secondary prevention for ED in adolescents. High-quality tools are required for effective screening in epidemiological studies or in primary care, given that because ED has a relatively low prevalence in the general population, the community studies that assess the diagnostic efficacy of screening tests need large samples to ensure adequate power and accuracy.
One of the tests most often recommended is the SCOFF questionnaire, which was developed as a quick and reliable tool for the screening of ED. Morgan, Reid, and Lacey (1999) designed the SCOFF as a test that could be administered by non-specialists able to detect symptoms of ED. According to the authors the proposed cut-off allows for a clinically appropriate balance between false positives and false negatives. This cut-off is a cue for suspicion that an ED exists and it should be followed by additional questioning about the patient's weight and their eating attitudes and behaviors (Hill, Reid, Morgan, & Lacey, 2010).
Several studies have examined the psychometric characteristics of the SCOFF, both in the original version and in its translation into several other languages (see Table 1). In those studies acceptable trade-offs between sensitivity and specificity have been found, along with high levels of reliability. Today there are enough assessment studies available to conduct a quantitative synthesis. A meta-analysis provides for integration of the results of several studies that assess the diagnostic efficacy of a tool (Botella & Huang, 2012; Deville et al., 2002; Gatsonis & Paliwal, 2006; Zweig & Campbell, 1993), which can also be done for other psychometric characteristics (Botella, Suero, & Gambara, 2010). The results are expressed mainly as the values of sensitivity and specificity of the pooled estimates that combine the estimates provided by primary studies. Our own meta-analysis also attempted to partially account for the variability observed, analyzing the role of several moderator variables.
In short, our aim was to perform a systematic review and meta-analysis to summarize the literature on assessments of the diagnostic efficacy of the SCOFF for detecting ED.
The acronym refers to the SCOFF's questions, which are related to five key features in ED (Sick, Control, One, Fat and Food; see appendix A). Individuals must answer Yes/No to each question, scoring one point for each positive answer (Morgan et al., 1999). The authors proposed a cut-off of two or more points for recommending the clinician to follow up with a deeper and more rigorous assessment.
We performed a systematic search of the literature using five international databases: Medline, PsycINFO, EMBASE, Web of Science (Science Citation Index Expanded and Social Sciences Citation Index) and Cochrane Library. Two of the authors (ARS and HG) searched for all the papers written in English, German or Spanish, published in peer-reviewed journals between 1999 and August 2011. The list of key words for the search included: SCOFF questionnaire combined with each of the terms 'eating disorder', 'anorexia nervosa', 'bulimia nervosa', 'screening', 'primary care', 'validation', 'psychometrics' and 'prevalence'. Each of those combinations was also combined with the term 'screening'. More than 50 combinations of words were searched in each of those data bases. We also did manual searches of the references cited in the selected papers.
Once we had read the abstracts we recovered copies of the relevant papers. Most of the retrieved studies reported the use of the SCOFF in a sample of participants in intervention studies or prevention programmes, but they did not provide relevant information related to diagnostic efficacy.
Inclusion and exclusion criteria
After the process described in the previous section we still had 30 papers for possible inclusion in the meta-analysis. Studies were finally selected if they reported the use of the SCOFF in a sample of ED and other comparison sample, besides any measurement with a diagnostic reference. We excluded studies as follows: (a) those written in languages other than English, German or Spanish; (b) those not published in peer-review journals; (c) those which reported results for the SCOFF as a recommended tool for screening, including psychometric studies; (d) those which compared two forms of administration (oral versus written); (e) those in which although sensitivity and specificity were reported, the size of both samples was not reported, so that proper statistical treatment was not allowed; (f) those which related to a sample voluntarily participating in intervention programmes, as the data were likely to be biased.
No restrictions were placed on the gender and age of the participants, the type of sample, or the type of reference employed as a diagnostic reference. A total of 15 studies met all the inclusion criteria and were finally selected. They were performed in eight different countries and in seven different languages. Most of them are assessments of new translations to a different language. This gives us an opportunity to assess generalizability. The advantage of a tool with high generalizability is that it allows for comparisons and synthesis of the results from studies performed in different languages and countries.
The decision criteria for exclusion were independently applied by two of the authors (JB and ARS), with a high inter-rater agreement (coefficients equal to 1 for most variables, and all above .85).
Diagnostic references for the Gold Standard
Studies that assess the SCOFF's efficacy for screening have employed a large variety of tools as the diagnostic reference (gold standard). In 60% of cases individual diagnostic interviews are used which employ several tools (CIDI, DSM-IV, MINI, SCAN). Although they are not 100% reliable, their performance is good enough for them to be accepted as a gold standard.
Some studies, however, have chosen as diagnostic references specific cut-offs in psychometric tools, such as EAT-40, EAT-26 and EDI-2. Given that these procedures are less appropriate for the role of a gold standard, lower levels of diagnostic efficacy are to be expected compared with studies based on interviews.
Extraction of basic data
The data from each paper were systematically and independently extracted by two of the authors using a structured database; they reached perfect agreement for the four raw frequencies in the studies. The data related to the tool were transformed, when necessary, to reach the four frequencies: true positives (TP), false positives (FP), false negatives (FN) and true negatives (TN). When the frequencies were reported separately according to the disorder (AN, BN, EDNOS) they were summed to reach the totals. In most cases the papers provided sensitivity and specificity, plus the size of each group, so that the frequencies were obtained from those quantities.
The following fields were coded: year of publication, country, language of the version employed, size of the samples, mean age, gender (percentage of women) and the tool employed for the diagnostic reference. We abandoned some initial fields because the number of studies providing the relevant information was very low (e.g. duration of the illness, mean body mass index).
Table 1 identifies each of the studies, with the values of the most relevant variables analyzed in the present research. The sources are marked with an asterisk in the references.
We have adjusted hierarchical models of the summary ROC curve (Gatsonis & Paliwal, 2006; Macaskill, 2004) employing the NLMIXED procedure of SAS (2008). These models provide combined estimates of the parameters that reflect diagnostic efficacy, the criterion for classification and the scale parameter. The parameters allow combined estimates of the performance indexes of the test to be derived, within the framework of Random Effects models. Random Effects models assume that the individual data sets are drawn from a distribution of populations. This is more credible than the assumption that all them come from a single population, as in Fixed Effect models (Borenstein, Hedges, Higgins, & Rothstein, 2010).
Besides the basic model we have explored several models that include as covariates the moderators for which we had enough information (mean age, gender, type of reference). These are Mixed models, as they involve a main Random Effect component, whereas the moderators are included as a Fixed Effect.
We have also obtained descriptive summaries and several types of figures from Review Manager (2008) and METADISC (Zamora, Abraira, Muriel, Kahn, & Coomarasamy, 2006).
Characteristics of the studies
The set of 15 studies aggregated 882 cases with an ED and 4350 healthy controls, according to the diagnostic references employed. The main characteristics of the selected studies are described in Table 2.
Women predominate in the composition of the samples (between 50% and 100%). The average age is reported in only two-thirds of the studies (although all report the range).
Very varied procedures were employed for classification as the diagnostic reference. In 60% they are interviews, often structured by several well-known tools (CIDI, DSM-IV, MINI, SCAN). In the other 40% the classification was done by means of a cut-off in another test (EAT, EDI, EDEQ).
Diagnostic efficacy of the SCOFF
First of all we checked for any threshold effect. This effect is produced by shifts of the criteria for classification between different studies. It is reflected in a negative correlation between sensitivity and specificity. Although this effect is typical in tests with an implicit criterion for classification, it is also achievable with an explicit criterion that reflects a latent variable (such as the SCOFF). The Spearman's correlation between the sensitivities and the specificities in our 15 studies was not statistically significant ([r.sub.se] = .377; p = .166), so we discarded this effect in our set of studies.
We fitted hierarchical models of the summary ROC curve (Gatsonis & Paliwal, 2006; Macaskill, 2004) by means of the NLMIXED procedure (SAS, 2008). Assuming logistic distributions, this model estimates several parameters, although there are really just two that characterize the performance of a binary classification tool. The first one, alpha, reflects the ability of the test for discrimination. The second one, theta, reflects how conservative or liberal the threshold for classification is. The model estimates the parameters jointly, so that it takes into account the joint variation inherent in any potential threshold effect (even if it is small and non-significant). Then the estimates of the parameters are converted into pooled estimates of the sensitivity and specificity.
The basic hierarchical model (without covariates) fitted with the 15 studies provides combined estimates of the parameters that involve a sensitivity of .801 and a specificity of .934. The diagnostic odds ratio for these estimates is 56.96. The scale parameter, beta, is statistically different from zero (p = .046). This means that the summary curve is asymmetric (alpha and theta are not independent). The values for sensitivities show higher heterogeneity than those for specificities. This is a consequence of the fact that the size of the samples of targets is generally lower than the size of the samples of controls.
The homogeneity test for the values of sensitivity and specificity in the 15 studies shows that in both indexes the observed variability is larger than expected [sensitivity, Q(14) = 139.98, p < .001; specificity, Q(14) = 202.55, p < .001]. Figure 1 shows the combined forest plot of both indexes.
We fitted HSROC models that incorporate the moderator variables mean age, gender and type of diagnostic reference as covariates associated with the parameters alpha and/or theta. Whereas the models with the mean age did not show a significant association with alpha or theta, those including the gender and the type of diagnostic reference did show marginally significant associations with the parameter alpha, but not with theta. As regards gender, we did not have any a priori hypothesis for the direction of any eventual association. The observed association is positive. That is, the ability of the SCOFF to discriminate between cases and normals is greater the larger the percentage of women in the sample. The efficacy of the test is marginally higher for women than for men (p = .052).
As regards the type of tool employed as the diagnostic reference, we expected a greater ability for discrimination when tools based on individual interviews were employed as compared with psychometric tools. Actually, the psychometric tools are not a real gold standard, given that their results are not beyond question. They have an imperfect reliability that underestimates the efficacy for classification of the tool for screening, as they classify some individuals (cases and normals) in the wrong group. This covariate has been included in the model as a dummy variable. As expected, it is associated with the test's capability for discrimination. The diagnostic efficacy is marginally greater when the diagnostic reference is based on an interview than when it is based on a psychometric test (p = .057).
Given that the diagnostic efficacy is significantly different according to the type of diagnostic reference and that the measurements based on interviews are clearly superior, the best estimates we can afford of the performance of the test are the estimates based on this last type of measurement. Fitting a new HSROC model only to the nine studies that employed diagnostic references based on interviews gives pooled estimates of .882 for sensitivity and .925 for specificity. The corresponding odds ratio for those estimates is 92.19.
Some of the primary studies have highlighted that perhaps the sensitivity of the SCOFF could be different for AN, BN and EDNOS (Garda-Campayo et al., 2005). In order to study this issue we selected the studies that provide relevant data. Of course, those are only the studies in which the measure of the diagnostic reference is based on any form of interview, given that the psychometric tests do not discriminate between the three diagnostics. Eight from the nine studies that employ some form of interview provide frequencies of positive classification by the SCOFF, disaggregated according to the diagnostic (see Table 1).
After testing as potential covariates the frequency of AN and BN we have concluded that they are not significantly different. When the proportion of EDNOS in the sample is included, however, a significant association with diagnostic efficacy is apparent. Specifically, the diagnostic efficacy is significantly better for EDNOS than for AN/BN (p = .011). This difference can be associated with a greater tendency of the AN/BN cases to deny the illness, hiding the symptoms when answering the SCOFF. It can also be related to a lower sensitivity in individuals with previous low weight. Specifically, question 3 of the SCOFF makes reference to a recent extreme weight loss. It is possible that patients with AN or BN consider that it has not happened recently, given that these patients reduce the Body Mass Index (BMI) step by step over a long period. Furthermore, the SCOFF was developed as a tool for detecting patients with ED in an early stage in the general population, not for identifying whether the adolescent has a BMI of 17.5 kg/[m.sup.2], a key criterion for diagnosing a AN.
The joint analysis of 15 studies that assess the diagnostic efficacy of the SCOFF shows that the test is highly effective as a quick screening tool for ED. Its brevity and simplicity have allowed easy translation into multiple languages that have shown comparable values of performance. The pooled estimates from the present 15 studies are .801 for sensitivity and .934 for specificity.
The analysis of the moderator variables shows that the diagnostic efficacy is associated with the gender and the type of diagnostic reference. Specifically, the efficacy increases as the percentage of women in the sample increases and if the diagnostic reference is based on an interview. The best estimates from these data are those combining the nine studies based on interviews; they provide values of .882 for sensitivity and .925 for specificity. The comparison of these values with those provided by the combined estimates of the 15 studies reflects another interesting characteristic. When interviews instead of psychometric tests are used the specificity remains practically unchanged; however, the sensitivity rises from .801 to .882. This increase reveals the ability of a professional (even a non-specialist) to detect symptoms, without any associated increase in the false positives. That is, the effect is not based on a shift of the criteria, but on the genuine advantage of the interview over the psychometric tests in terms of discriminating between the cases and the controls.
Nevertheless, the effects associated to gender and the type of diagnostic reference have been detected with statistical tests only marginally significant (p < .06). Convergent evidence is needed to reinforce the role of both moderators.
In short, the results from the present meta-analysis reinforce the idea that the set of five questions of the SCOFF constitute a highly efficient tool for the detection of ED, even by a non-specialist, in several languages. Its use as a screening tool is highly recommended.
Correspondence concerning this article should be addressed to Juan Botella. Facultad de Psicologia. Departamento de Psicologia Social y Metodologia. Universidad Autonoma de Madrid. Cantoblanco s/n. 28049. Madrid (Spain).
This research was supported by the project PSI2009-12071, from the Spanish Ministerio de Educacion y Ciencia.
American Psychiatric Association (2002). Diagnostic and statistical manual of mental disorders (DSM-IV-TR; 4th Ed). Washington, DC: American Psychiatric Association.
Becker A. E., Thomas J. J., Franko D. L., & Herzog D. B. (2005). Disclosure patterns of eating and weight concerns to clinicians, educational professionals, family, and peers. International Journal of Eating Disorders, 38, 18-23. http://dx.doi.org/10.1002/eat.20141
Berger U., Wick K., Holling H., Schlack R., Bormann B., Brix C., ..., & Strau B. (2011). Screening riskanten Essverhaltens bei 12-j a hrigen M a dchen und Jungen: Psychometrischer Vergleich der deutschsprachigen Versionen von SCOFF und EAT-26. [Screening of Disordered Eating in 12-Year-Old Girls and Boys: Psychometric Analysis of the German Versions of SCOFF and EAT-26]. Psychotherapie, Psychosomatik & Medizinische Psychologie, 61, 311-318. http://dx.doi.org/10.1055/ s-0031-1271786
Borenstein M., Hedges L. V., Higgins J. P. T., & Rothstein H. R. (2010). A basic introduction to fixed-effect and random-effects models for meta-analysis. Research Synthesis Methods, 1, 97-111. http://dx.doi.org/10.1002/jrsm.12
Botella J., & Huang H. (2012). Procedimientos de meta-analisis de la precision de clasificaciones binarias. [Procedures for performing meta-analyses of the accuracy of tools for binary classification]. Psicothema, 24, 133-141.
Botella J., Suero M., & Gambara H. (2010). Psychometric inferences from a meta-analysis of reliability and internal consistency coefficients. Psychological Methods, 15, 386-397. http://dx.doi.org/10.1037/a0019626
Caamano F., Aguiar M., Lopez-Otero M. J., & Takkouche B. (2002). Validation of SCOFF questionnaire among preteenagers. British Medical Journal, (rapid response). Retrieved from http://www.bmj.com/search/ Validation%20of%20SCOFF%20questionnaire%20 among%20preteenagers?filters=type:bmj_rapid_response
Cotrufo P., Barretta V., & Monteleone P. (1998). Un estudio epidemiologico sobre los trastornos de la conducta alimentaria en dos institutos de segunda ensenanza en Napoles. [An epidemiological study on eating disorders in two high schools in Naples] European Psychiatry, 12, 342-344. http://dx.doi.org/10.1016/S0924-9338(97) 80003-X
Cotton M. A., Ball C., & Robinson P. (2003). Four simple questions can help screen for eating disorders. Journal of General Internal Medicine, 18, 53-56. http://dx.doi.org/ 10.1046/j.1525-1497.2003.20374.x
Deville W. L., Buntinx F., Bouter L. M., Montori V. M., de Vet H. C., van der Windt D. A., & Bezemer P. D. (2002). Conducting sytematic reviews of diagnsotic studies: Didactic guidelines. BMC Medical Research Methodology, 2, 2-9. http://dx.doi.org/10.1186/1471-2288-2-9
Eddy K. T., Swanson S. A., Crosby R. D., Franko D. L., Engel S., & Herzog D. B. (2010). How should DSM-V classify eating disorder not otherwise specified (EDNOS) presentations in women with lifetime anorexia or bulimia nervosa? Psychological Medicine, 40, 1735-1744. http://dx.doi.org/10.1017/S0033291709992200
Favaro A., Ferrara S., & Santonastasio P. (2003). The spectrum of eating disorders in young women: A prevalence study in a general population sample. Psychosomatic Medicine, 65, 701-708. http://dx.doi.org/ 10.1097/01.PSY.0000073871.67679.D8
Garcia F. D., Grigioni S., Allais E., Houy-Durand E., Thibaut F., & Dechelotte P. (2011). Detection of eating disorders in patients: Validity and reliability of the French version of the SCOFF questionnaire. Clinical Nutrition, 30, 178-181. http://dx.doi.org/10.1016Zi.dnu.2010.09.007
Garcia F. D., Grigioni S., Chelali S., Meyrignac G., Thibaut F., & Dechelotte P. (2010). Validation of the French version of SCOFF questionnaire for screening of eating disorders among adults. World Journal of Biological Psychiatry, 11, 888-893. http://dx.doi.org/10.3109/15622975. 2010.483251
Garcia-Campayo J., Sanz-Carrillo C., Ibanez J. A., Lou S., Solano S., & Alda M. (2005). Validation of the Spanish version of the SCOFF questionnaire for the screening of eating disorders in primary care. Journal of Psychosomatic Research, 59, 51-55. http://dx.doi.org/10.1016/j. jpsychores.2004.06.005
Gatsonis C., & Paliwal P. (2006). Meta-analysis of diagnostic and screening test accuracy evaluations: Methodological primer. American Journal of Roentgenology, 187, 271-281. http://dx.doi.org/10.2214/AJR.06.0226
Herpertz-Dahlmann B., Wille N., Holling H., Vloet T. D., & Ravens-Sieberer U. (2008). Disordered eating behavior and attitudes, associated psychopathology and health-related quality of life: Results of the BELLA study. European Child & Adolescent Psychiatry, 17, 82-91. http://dx.doi.org/10.1007/s00787-008-1009-9
Hill L. S., Reid F., Morgan J. F., & Lacey J. H. (2010). SCOFF, the development of an eating disorder screening questionnaire. International Journal of Eating Disorders, 43, 344-351. http://dx.doi.org/10.1002/eat.20679
Lahteenmaki S., Aalto-Setala T., Suokas J. T, Saarni S. E., Perala J., Saarni S. I., ..., & Suvisaari J. M. (2009). Validation of the Finnish version of the SCOFF questionnaire among young adults aged 20 to 35 years. BMC Psychiatry, 9. http://dx.doi.org/10.1186/1471-244X-9-5
Luck A. J., Morgan J. F., Reid F., O'Brien A., Brunton J., Price C., ..., & Lacey J. H. (2002). The SCOFF Questionnaire and clinical interview for eating disorders in general practice: Comparative study. British Medical Journal, 325, 755-756. http://dx.doi.org/10.1136/bmj.325.7367.755
Macaskill P. (2004). Empirical Bayes estimates generated in a hierarchical summary ROC analysis agreed closely with those of a full Bayesian analysis. Journal of Clinical Epidemiology, 57, 925-932. http://dx.doi.org/10.1016/j. jclinepi.2003.12.019
Mond J. M., Myers T. C., Crosby R. D., Hay P. J., Rodgers B., Morgan J. F., ..., & Mitchell J. E. (2008). Screening for eating disorders in primary care: EDE-Q versus SCOFF. Behaviour Research and Therapy, 46, 612-622. http://dx. doi.org/10.1016/j.brat.2008.02.003
Morande G., Celada J., & Casas J. J. (1999). Prevalence of eating disorders in a Spanish school age population. Journal of Adolescent Health, 24, 212-219. http://dx.doi. org/10.1016/S1054-139X(98)00025-1
Morgan J. F., Reid F., & Lacey J. H. (1999). The SCOFF questionnaire: Assessment of a new screening tool for eating disorders. British Medicine Journal, 319, 1467-1468. http://dx.doi.org/10.1136/bmj.319.7223.1467
Muro-Sans P., Amador-Campos J. A., & Morgan J. F. (2008). The SCOFF-C: Psychometric properties of the Catalan version in a Spanish adolescent sample. Journal of Psychosomatic Research, 64, 81-86. http://dx.doi.org/ 10.1016/j.jpsychores.2007.06.011
Ogg E. C., Millar H. R., Pusztai E. E., & Thom A. S. (1997). General practice consultation patterns preceding diagnosis of eating disorders. International Journal of Eating Disorders, 22, 89-93. http://dx.doi.org/10.1002/(SICI)1098108X(199707)22:1<89::AID-EAT12>3.0.CO;2-D
Pannocchia L., Di Fiorino M., Giannini M., & Vanderlinden J. A. (2011). Psychometric exploration of an Italian translation of the SCOFF questionnaire. European Eating Disorders Review, 19, 371-373. http://dx.doi.org/ 10.1002/erv.1105
Parker S. C., Lyons J., & Bonner J. (2005). Eating disorders in graduate students: Exploring the SCOFF questionnaire as a simple screening tool. Journal of American College Health, 54, 103-107. http://dx.doi.org/10.3200/JACH. 54.2.103-107
Perez-Gaspar M., Gual P., De Irala-Estevez J., Martinez-Gonzalez M. A., Lahortiga F., & Cervera S. (2000). Prevalencia de trastornos de la conducta alimentaria en las adolescentes Navarras. [Prevalence of eating disorders in a representative sample of female adolescents from Navarra-Spain] Medicina Clinica-Barcelona, 114, 481-486. http://dx.doi.org/10.1016/S0025-7753(00)71340-6
Review Manager (2008). Version 5.0. Copenhagen, Denmark: The Nordic Cochrane Centre, The Cochrane Collaboration.
Rojo L., Livianos L., Conesa L., Garcia A., Dominguez A., Rodrigo G., ..., & Vila M. (2003). Epidemiology and risk factors of eating disorders: A two-stage epidemiologic study in a Spanish population aged 12-18 years. International Journal of Eating Disorders, 34, 281-291. http://dx.doi.org/10.1002/eat.10179
Rueda G. E., Diaz L. A., Campo A., Barros J. A., Avila G. C., Orostegui L. T., ..., & Cadena L. P. (2005). Validation of the SCOFF questionnaire for screening of eating disorders in university women. Biomedica, 25, 196-202.
Rueda G. E., Diaz L. A., Ortiz D. P., Pinzon C., & Rodrigues J. (2005). Validation del cuestionario SCOFF para el cribado de los trastornos del comportamiento alimentario en adolescentes escolarizadas. [Validation of the SCOFF questionnaire for screening the eating behavior disorders of adolescents in school] Atencion Primaria, 35, 89-94.
Ruiz-Lazaro P. M., Alonso J. P., Velilla J. M., Lobo A., Martin A., Paumard C., & Calvo A. I. (1998). Estudio de prevalencia de trastornos de la conducta alimentaria en adolescentes de Zaragoza. [Study of the prevalence of eating disorders in adolescents in Zaragoza] Revista de Psiquiatria Infanto-Juvenil, 3, 148-162
SAS Institute Inc. (2008). The SAS System for Windows. Version 9.2. Cary, NC: SAS Institute Inc.
Steinhausen H. C., Winkler C., & Meier M. (1997). Eating disorders in adolescence in a Swiss epidemiological study. International Journal of Eating Disorders, 22, 147-151. http://dx.doi.org/10.1002/(SICI)1098-108X(199709) 22:2<147::AID-EAT5>3.3.CO;2-I
Zamora J., Abraira V., Muriel A., Khan K., & Coomarasamy A. (2006). Meta-Disc: A software for meta-analysis of test accuracy data. BMC Medical Research Methodology, 6. http://dx.doi.org/10.1186/1471-2288-6-31
Zweig M. H., & Campbell G. (1993). Receiver-operating characteristic (ROC) plots: A fundamental evaluation tool in clinical medicine. Clinical Chemistry, 39, 561-577.
The five questions of the SCOFF:
S Do you make yourself Sick because you feel uncomfortably full?
C Do you worry you have lost Control over how much you eat?
O Have you recently lost more than One stone in a three-month period?
F Do you believe yourself to be Fat when others say you are too thin?
F Would you say that Food dominates your Life?
Juan Botella, Ana Rosa Sepulveda, Huiling Huang and Hilda Gambara
Universidad Autonoma de Madrid (Spain)
Table 1. Identification and main characteristics of the 15 studies included in the meta-analysis Study Country Age Berger et al, 2011 Germany (a) 12 (12-12) Caamano et al, 2002 Spain (b) 12 ** (11-13) Cotton, Ball & Robison, 2003 UKC 29 (18-64) Garcia et al., 2010 France (d) 20.9 (18-35) Garcia et al, 2011 France (d) 22.1 (18-35) Garcia-Campayo et al, 2005 Spain (b) 29.2 (15-53) Lahteenmaki et al, 2009 Finland (e) ? (18-29) Fuck et al, 2002 UK (c) ? (18-50) Mond et al, 2008 USA (c) 27.6 (18M0) Morgan, Reid, & Facey, 1999 UK (c) ? (18M0) Muro-Sans, Amador-Campos, & Morgan, 2008 Spain (f) 13.5 (11-17) Pannocchia et al., 2011 Italy (g) ? Parker, Fyons, & Bonner, 2005 USA (c) ? (20-51) Rueda, Diaz, Campo et al, 2005 Colombia (b) 21.2 (17-35) Rueda, Diaz, Ortiz et al, 2005 Colombia (b) 14 (10-19) Study Gender Diagnostic Reference Berger et al, 2011 52.7 EAT-26 [less than or equal to] 20 (I) Caamano et al, 2002 ? EAT-40 [less than or equal to] 30 (I) Cotton, Ball & Robison, 2003 77 Q-EDD1 Garcia et al., 2010 100 MINI & interview DSM-IV (II) Garcia et al, 2011 100 MINI & interview DSM-IV (II) Garcia-Campayo et al, 2005 100 interview SCAN (II) Lahteenmaki et al, 2009 57.7 interview SCID (II) Fuck et al, 2002 100 interview DSM-IV (II) Mond et al, 2008 100 EDE telephone & interview; EDEQ (II) Morgan, Reid, & Facey, 1999 100 interview DSM-IV (II) Muro-Sans, Amador-Campos, & Morgan, 2008 49.8 EDI-2 (3 subscales) (I) Pannocchia et al., 2011 100 EDI-3 (I) Parker, Fyons, & Bonner, 2005 72.4 EDEQ & DSM-IV criteria (I) Rueda, Diaz, Campo et al, 2005 100 interview CIDI (II) Rueda, Diaz, Ortiz et al, 2005 100 interview CIDI (II) Study Sample * TP FP FN TN Berger et al, 2011 64 191 17 535 Caamano et al, 2002 25 32 14 218 Cotton, Ball & Robison, 2003 21 23 6 173 Garcia et al., 2010 8/29/0 35 19 2 363 Garcia et al, 2011 67/45/0 106 6 6 108 Garcia-Campayo et al, 2005 29/47/55 128 4 3 68 Lahteenmaki et al, 2009 7 66 2 466 Fuck et al, 2002 1/3/9 11 34 2 294 Mond et al, 2008 0/5/20 18 33 7 89 Morgan, Reid, & Facey, 1999 68/48/0 116 12 0 84 Muro-Sans, Amador-Campos, & Morgan, 2008 57 195 21 681 Pannocchia et al., 2011 29 8 1 55 Parker, Fyons, & Bonner, 2005 32 16 28 220 Rueda, Diaz, Campo et al, 2005 0/8/43 40 46 11 144 Rueda, Diaz, Ortiz et al, 2005 4/14/54 59 36 13 133 (a) German; (b) Spanish; (c) English; (d) French; (e) Finish; (f) Catalan; (g) Italian. (I) Psychometric test; (II) interview. * The three values are the frequencies of cases diagnosed with the diagnostic reference being AN/BN/EDNOS. ** The mean has been estimated as the average between the higherand lower limits, given that the interval is very narrow. Table 2. Primary studies' characteristics Size of the samples; mean (range) Targets, 58.8 (9-131) Controls, 290 (63-876) Gender (k = 14); mean of the % of 86.4 (49.8-100) women (range) Age (k = 10); mean (range) 20.1 (12-29.2) Type of diagnostic reference Based on interview 9 (60%) Test or questionnaire 6 (40%) Figure 1. Forest plot of the 15 studies included in the meta-analysis. Sensitivity Specificity Berger et al, 2011 0.79 [0.69, 0.86] 0.74 [0.70, 0.77] Caamano et al, 2002 0.64 [0.48, 0.77] 0.87 [0.82, 0.91] Cotton et al, 2003 0.77 [0.59, 0.89] 0.88 [0.83, 0.92] Garcia et al, 2010 0.93 [0.81, 0.98] 0.95 [0.93, 0.97] Garcia et al. 2011 0.94 [0.88, 0.97] 0.94 [0.89, 0.97] Garcia-campayo 2005 0.97 [0.93, 0.99] 0.94 [0.86, 0.97] Lahteenmaki et al, 2009 0.75 [0.44, 0.92] 0.88 [0.84, 0.90] Luck et al, 2002 0.82 [0.56, 0.94] 0.90 [0.86, 0.92] Mond et 0.71 [0.52, 0.85] 0.73 [0.64, 0.80] al, 2008 Morgan et al, 1999 1.00 [0.96, 1.00] 0.87 [0.79, 0.92] Muro-Sans et al, 2008 0.73 [0.62, 0.81] 0.78 [0.75, 0.80] Pannocchia et al, 2011 0.95 [0.81, 0.99] 0.87 [0.76, 0.93] Parker et al, 2005 0.53 [0.41, 0.65] 0.93 [0.89, 0.96] Rueda et al, 2005a 0.78 [0.65, 0.87] 0.76 [0.69, 0.81] Rueda et al, 2005b 0.82 [0.71, 0.89] 0.79 [0.72, 0.84]
|Printer friendly Cite/link Email Feedback|
|Author:||Botella, Juan; Sepulveda, Ana Rosa; Huang, Huiling; Gambara, Hilda|
|Publication:||Spanish Journal of Psychology|
|Date:||Jan 1, 2013|
|Previous Article:||Relapse situations according to Marlatt's Taxonomy in smokers.|
|Next Article:||Confirmatory factor analysis of the generalized self-efficacy scale in Brazil and Portugal.|