NIST HOSTS TOPIC DETECTION AND TRACKING (TDT) WORKSHOP ON TEXT ORGANIZATION.NIST (National Institute of Standards & Technology, Washington, DC, www.nist.gov) The standards-defining agency of the U.S. government, formerly the National Bureau of Standards. It is one of three agencies that fall under the Technology Administration (www.technology. hosted the third annual TDT Evaluation Workshop in November 2000. The workshop involved 30 academic and corporate researchers who participated in the NIST-administered TDT evaluation that occurred during the fall of 2000. The workshop followed the NIST Text Retrieval Conference For other uses of "TREC", see TREC. The Text REtrieval Conference (TREC) is an on-going series of workshops focusing on a list of different information retrieval (IR) research areas, or tracks. (TREC TREC Texas Real Estate Commission TREC Text Retrieval Conference TREC Technique de Randonnée Equestre de Compétition TREC Tropical Research and Education Center TREC T-cell Receptor Excision Circle TREC Teachers and Researchers Exploring and Collaborating ), a related conference in the text retrieval research field, also hosted by the division. The TDT program develops technologies that search, organize, and structure news-oriented textual materials from a variety of broadcast news media in both the English and Mandarin languages the spoken or colloquial language of educated people in China. See also: Mandarin . The research-driven program uses controlled laboratory simulations of hypothetical systems to test the efficacy of potential technologies to access the continuously flowing information that is available from news-producing entities. In earlier studies reported at last years TDT meeting, research showed that it was possible to track and organize events in news data even though the data was multilingual mul·ti·lin·gual adj. 1. Of, including, or expressed in several languages: a multilingual dictionary. 2. . The TV and radio broadcasts were transcribed using automatic speech recognition, and the Mandarin text was converted to English using COTS (Commercial Off-The-Shelf) Refers to ready-made merchandise that is available for sale. See MOTS. (software) COTS - commercial off-the-shelf. See commercial software. Mandarin-to-English software. The focus of TDT changed in 2000 to the development of core techniques that organize news data. A second shift was to bring more of the real world into the evaluation by requiring TDT systems to operate on broadcast news audio data that has been transcribed, translated, and segmented into stories without human intervention. The Web site is http:llwww.nist.gov/TDT. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion