Printer Friendly

More diseases tracked by using Google trends.

To the Editor: The idea that populations provide data on their influenza status through information-seeking behavior on the Web has been explored in the United States in recent years (1,2). Two reports showed that queries to the Internet search engines Yahoo and Google could be informative for influenza surveillance (2,3). Ginsberg et al. scanned the Google database and found that the sum of the results of 45 queries that most correlated with influenza incidences provided the best predictor of influenza trends (3). On the basis of trends of Google queries, these authors put their results into practice by creating a Web page dedicated to influenza surveillance. However, they did not develop the same approach for other diseases. To date, no studies have been published about the relationship of search engine query data with other diseases or in languages other than English.

We compared search trends based on a list of Google queries related to 3 infectious diseases (influenza-like illness, gastroenteritis, and chickenpox) with clinical surveillance data from the French Sentinel Network (4). Queries were constructed through team brainstorming. Each participant listed queries likely to be used for searching information about these diseases on the Web. The query time series from January 2004 through February 2009 for France were downloaded from Google Insights for Search, 1 of the 2 websites with Google Trends that enables downloading search trends from the Google database (5). Correlations with weekly incidence rates (no. cases/100,000 inhabitants) of the 3 diseases provided by the Sentinel Network were calculated for different lag periods (Pearson coefficient n).

The highest correlation with influenza-like illness was obtained with the query grippe -aviaire -vaccin, the French words for influenza, avian, and vaccine respectively ([rho] = 0.82, p<0.001). The minus sign removed queries that contained the terms avian or vaccine. Use of the query word grippe alone resulted in a lower correlation ([rho] = 0.34, p<0.001). The high double peak in 2005-2006 and the smaller peaks preceding annual epidemics observed with the query word grippe alone were decreased by this specification. However, the unusual double-peak shape of the 2005-2006 epidemic remained (online Appendix Figure, panel A, available from www. htm).

The highest correlation with acute diarrhea was obtained when we searched for the French word for gastroenteritis ([rho] = 0.90, p<0.001). Various spellings were used to account for the presence/absence of an accent or a hyphen. The Google database was searched for gastro-enterite + gastroenterite + gastroenterite + gastroenterite + (gastro enterite) + (gastro enterite). The + sign coded for or, enabling searches for queries containing [greater than or equal to] 1 of the terms. The second highest correlation was obtained when the keyword gastro ([rho] = 0.88, p<0.001) (online Appendix Figure, panel B) was used. The highest correlation with chickenpox was obtained with the French word for chickenpox (varicelle) ([rho] = 0.78, p<0.001) (online Appendix Figure, panel C).

A time lag of 0 weeks gave the highest correlations between the best queries for influenza-like illness and acute diarrhea and the incidences of these diseases; the peak of the time series of Google queries occurred at the same time as that of the disease incidences. The best query for chickenpox had a 1-week lag, i.e., was 1 week behind the incidence time series.

In conclusion, for each of 3 infectious diseases, 1 well-chosen query was sufficient to provide time series of searches highly correlated with incidence. We have shown the utility of an Internet search engine query data for surveillance of acute diarrhea and chickenpox in a non-English-speaking country. Thus, the ability of Internet search-engine query data to predict influenza in the United States presented by Ginsberg et al. (3) appears to have a broader application for surveillance of other infectious diseases in other countries.

This study was supported by the Institute National de la Sante et de la Recherche Medicale.

Camille Pelat, Clement Turbelin, Avner Bar-Hen, Antoine Flahault, and Alain-Jacques Valleron

Author affiliations: Institut National de la Sante et de la Recherche Medicale, Paris, France (C. Pelat, C. Turbelin, A.-J. Valleron); Universite Pierre et Marie Curie-Paris 6, Paris (C. Pelat, C. Turbelin, A.-J. Valleron); Universite Paris Descartes, Paris (A. Bar-Hen); and Ecole des Hautes Etudes en Sante Publique, Paris (A. Flahault)

DOI: 10.3201/eid1508.090299


(1.) Eysenbach G. Infodemiology: tracking flu-related searches on the web for syndromic surveillance. AMIA Annu Symp Proc. 2006:244-8.

(2.) Polgreen PM, Chen Y, Pennock DM, Nelson FD. Using internet searches for influenza surveillance. Clin Infect Dis. 2008;47:1443-8. DOI: 10.1086/593098

(3.) Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012-4. DOI: 10.1038/nature07634

(4.) Valleron AJ, Bouvet E, Garnerin P, Menares J, Heard I, Letrait S, et al. A computer network for the surveillance of communicable diseases: the French experiment. Am J Public Health. 1986;76:1289-92. DOI: 10.2105/AJPH.76.11.1289

(5.) Google insights for search, 2009 [cited 2009 Feb 27]. Available from http://www.

Address for correspondence: Camille Pelat, Institut National de la Sante et de la Recherche Medicale, Unite Mixte de Recherche S 707, Faculte de Medecine Pierre et Marie Curie, Site Saint-Antoine, Porte 807, 27 rue Chaligny, 75571 Paris CEDEX 12, France; email: pelat@
COPYRIGHT 2009 U.S. National Center for Infectious Diseases
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2009 Gale, Cengage Learning. All rights reserved.

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:LETTERS
Author:Pelat, Camille; Turbelin, Clement; Bar-Hen, Avner; Flahault, Antoine; Valleron, Alain-Jacques
Publication:Emerging Infectious Diseases
Article Type:Letter to the editor
Geographic Code:1USA
Date:Aug 1, 2009
Previous Article:Extreme drug resistance in Acinetobacter baumannii infections in intensive care units, South Korea.
Next Article:Human-to-dog transmission of methicillin-resistant Staphylococcus aureus.

Terms of use | Copyright © 2017 Farlex, Inc. | Feedback | For webmasters