A readability checker with supervised learning using deep indicators.Checking for readability or simplicity of texts is important for many institutional and individual users. Formulas for approximately measuring text readability have a long tradition. Usually, they exploit surface-oriented indicators like sentence length, word length, word frequency, etc. However, in many cases, this information is not adequate to realistically approximate the cognitive difficulties a person can have to understand a text. Therefore we use deep syntactic and semantic indicators in addition. The syntactic information is represented by a dependency tree, the semantic information by a semantic network (data) semantic network - A graph consisting of nodes that represent physical or conceptual objects and arcs that describe the relationship between the nodes, resulting in something like a data flow diagram. . Both representations are automatically generated by a deep syntactico-semantic analysis. A global readability score is determined by applying a nearest neighbor See point sampling. algorithm on 3,000 ratings of 300 test persons. The evaluation showed that the deep syntactic and semantic indicators lead to promising results comparable to the best surface-based indicators. The combination of deep and shallow indicators leads to an improvement over shallow indicators alone. Finally, a graphical user interface graphical user interface (GUI) Computer display format that allows the user to select commands, call up files, start programs, and do other routine tasks by using a mouse to point to pictorial symbols (icons) or lists of menu choices on the screen as opposed to having to was developed which highlights difficult passages, depending on the individual indicator values, and displays a global readability score. Keywords: readability, syntactic and semantic analysis Semantic analysis may refer to:
Povzetek: Strojno ucenje z odvisnosmimi drevesi je uporabljeno za ugotavljanje berljivosti besedil. 1 Introduction Readability checkers are used to highlight text passages that are difficult to read. They can help authors to write texts in an easy-to-read style. Furthermore they often display a global readability score which is derived by a readability formula. Such a formula describes the readability of a text numerically. There exists a large amount of readability formulas (13). Most of them use only surface-oriented indicators like word frequency, word length, or sentence length. Such indicators have only indirect and limited access to judging real understandability. Therefore, we use deep syntactic and semantic indicators (1) in addition to surface-oriented indicators. The semantic indicators operate mostly on a semantic network (SN); in contrast, the syntactic indicators mainly work on a dependency tree containing linguistic categories and surface text parts. The SNs and the dependency trees are derived by a deep syntactico-semantic analysis based on word-class functions. Furthermore, we collected a whole range of readability criteria from almost all linguistic levels: morphology, lexicon, syntax, semantics, and discourse (2) (7). To make these criteria operable operable /op·er·a·ble/ (op´er-ah-b'l) subject to being operated upon with a reasonable degree of safety; appropriate for surgical removal. op·er·a·ble adj. , each criterion is underpinned by one or more readability indicators that have been investigated in the (psycho-)linguistic literature and can be automatically determined by NLP (Natural Language Processing) The capability of understanding human language. If the language is spoken, voice recognition plays an important role in converting the sounds to individual words. Then, natural language processing figures out what the words mean. tools (see (11) for details). Two typical readability indicators for the syntactic readability criterion of syntactic ambiguity
Syntactic ambiguity is a property of sentences which may be reasonably interpreted in more than one way, or reasonably interpreted to mean more than one thing. are the center embedding In linguistics, center embedding refers to the process of embedding a phrase in the middle of another phrase of the same type. This often leads to difficulty with parsing which it would not be parsimonious to explain on grammatical grounds. depth of subclauses and the number of argument ambiguities (concerning their syntactic role (3)). 2 Related work There are various methods to derive a numerical representation Numerical representation (computers) Numerical data in a computer are written in basic units of storage made up of a fixed number of consecutive bits. of text readability. One of the most popular readability formulas is the so-called Flesch Reading Ease (4). The formula employs the average sentence length and the average number of syllables for estimating readability. The sentence length is intended to roughly approximate sentence complexity, while the number of syllables approximates word frequency since usually long words are less used. Later on, this formula was adjusted to German (1). Despite of its age, the Flesch formula is still widely used. Also, the revised Dale-Chall readability index (2) mainly depends on surface-oriented indicators. Actually, it is based on sentence length and the occurrences of words in a given list of words which are assumed to be difficult to read. Recently, several more sophisticated approaches which use advanced NLP technology were developed. They determine for instance the embedding depth of clauses, the usage of active/passive voice or text cohesion (17; 9; 21). The methods of (3; 22) go a step beyond pure analysis and also create suggestions for possible improvements. Some approaches, e.g., (20), integrate their readability checkers into a graphical user interface, which is vital for practical usage. [FIGURE 1 OMITTED] As far as we know, all approaches for determining text readability are based on surface or syntactic structures but not on a deep semantic representation which represents the cognitive difficulties for text understanding more adequately. Readability formulas usually combine several so-called readability indicators like sentence or word length by a linear combination. Examples for non-linear approaches are the nearest neighbor approach of Heilman et al. (9) and the employment of support vector machines by Larsson (15) to separate the vectors of indicator values for given texts into the three different readability classes easy, medium, and difficult. A drawback of the latter method is that this classification is rather rough. 3 System architecture A text is processed in several steps (see Figure 1) by our readability checker DeLite (an association of Lite as in light/easy reading and De as in Deutsch/German; there is also a prototype EnLite for English). First, the Controller passes the text to a deep syntactico-semantic analysis (WOCADI (4) parser, (6)), which is based on a word-class functional analysis and is supported by a large semantically oriented lexicon (8). The parser output for each sentence is a morpho-lexical analysis, one or more (in case of ambiguities) syntactic dependency trees, one or more SNs, and intrasentential and intersentential coreferences determined by a hybrid rule-statistical coreference resolution module. The resulting SNs follow the MultiNet formalism (multilayered extended semantic network, (10), example in Figure 2). On the basis of this analysis, the text is divided into sentences, phrases, and words in the Preparation Layer. The individual indicator values are determined by the Calculation Layer. DeLite currently uses 48 morphological, lexical, syntactic, and semantic indicators; below we concentrate on some deep syntactic and semantic ones. Each indicator is attached to a certain processing module depending on the type of required information: words, phrases, sentences, or the entire document. Each module iterates over all objects of its associated type that exist in the text and triggers the calculation of the associated indicators. Examples for indicators operating on the word level are the indicators number of word characters or number of word readings. Semantic and syntactic indicators usually operate on the sentence level. As the result of this calculation step an association from text segments to indicator values is established. In the Evaluation Layer, the values of each indicator are averaged to the so-called aggregated indicator value This term is ambiguous: Ellenberg's indicator values are simple ordinal classes of organisms (initially plants) with a similar realized ecological niche along a gradient. The latest edition of Ellenberg's indicator values contain values on a 9 point scale for soil acidity, . Note that there exists for each indicator only one aggregated indicator value per text. The readability score is then calculated (see Sect. 4) by the k-nearest neighbor algorithm of the machine learning toolkit RapidMiner (18). In contrast to surface-oriented indicators, a deep indicator can usually only be determined for a given sentence (most deep indicators operate on sentences) if certain prerequisites are met (e.g., full parse or chunk parse is available). If this is not the case, the associated sentence is omitted for determining the aggregated indicator value. If an indicator could not be calculated for any sentence of the text at all, its value is set to some fixed constant. Finally, all this information is marked up in XML XML in full Extensible Markup Language. Markup language developed to be a simplified and more structural version of SGML. It incorporates features of HTML (e.g., hypertext linking), but is designed to overcome some of HTML's limitations. and in a user-friendly HTML HTML in full HyperText Markup Language Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web. format and is returned to the calling process by the Exportation Layer. 4 Deriving a readability score using the k-nearest neighbor algorithm A nearest neighbor algorithm is a supervised learning method. Thus, before this method can be applied to new data, a training phase is required. In this phase, a vector of aggregated indicator values is determined by RapidMiner (see Sect. 4) for each text of our readability study comprising 3,000 ratings from 300 users. The vector components are normalized and multiplied by weights representing the importance of the individual indicators where the weights are automatically learned by an evolutionary algorithm. All vectors are stored together with the average user ratings for the associated texts. To derive a readability score for a previously unseen text, the vector of weighted and normalized aggregated indicator values is determined for this text first (see above). Afterwards, the k vectors of the training data with the smallest distance to the former vector are extracted. The readability score is then given as a weighted sum of the user ratings associated with those k vectors. 5 Syntactic indicators 5.1 Clause Center Embedding Depth A sentence is difficult to read if the syntactic structure is very complex (5). One reason for a high complexity can be that the sentence contains deeply embedded subordinate clauses. The difficulty can be increased if the subordinate clause is embedded into the middle of a sentence since the reader has to memorize the superior clause until its continuation after the termination of the subordinate clause (12), for example: Er verliess das Haus, in dem die Frau, die er liebte, wohnte, sofort. (literal translation from German: He left the house where the woman he loved lived immediately.) In contrast to (21), two separate indicators are employed for the embedding depth: one measuring embedding depth in general and one focusing only on center embedding depth which allows it to compare both effects. In our experiments only center embedding depth was considerably correlated to the readability ratings from the participants. Center embedding depth is calculated for each main verb in the following way. First, we determine the path from the root of the dependency tree to each main verb. Then, we count the occurrences of the dependency relations for relative or other subordinated clauses on this path. However, they are only taken into account if the embedded clause is not located on the border of the superior clause. 5.2 Distance between Verb and Separable sep·a·ra·ble adj. Possible to separate: separable sheets of paper. sep Prefix In German, so-called separable prefix verbs are split into two words in clauses with main clause word order, for example einladen (invite) [??] Er ladt ... ein. (He invites...). If the verb is far away from the verb prefix, it can be difficult for readers to associate both parts. 5.3 Number of Words per Nominal Phrase According to (19), long NPs degrade readability. Hence, some information from the long NP should better be placed in a subordinate clause or a new sentence. Therefore we count the average number of words contained in an NP where a larger number results in a worse readability score. Note that we only consider maximal NPs (i.e., NPs not contained in a larger NP); otherwise a large indicator value for the long NP could be compensated by small indicator values for the contained NPs which should be avoided. 6 Semantic indicators 6.1 SN Quality An incomplete parse from WOCADI is mainly caused by syntactic or semantic defects of the sentence since the parser builds the syntactic structure as a dependency tree and the semantic representation as an SN in parallel. Therefore, the indicator SN quality is a mixed one: semantic and syntactic. Consider for instance the two sentences Das Werk kam vor allem bei jungen Theatergangern an. Schulbusse reisten an, um es sich anzusehen. (5) (The work was very well accepted by young visitors of the theater. School buses arrived to watch it.) The second sentence, which is syntactically correct, is semantically incorrect and therefore difficult to read. The semantic lexicon employed by the parser requires that the first argument (which plays the semantic role of the agent) of ansehen.1.1 (6) (to watch) is of type human. Thus, this sentence is rejected by the parser as incorrect. In other cases the sentence might be accepted but considered as semantically improbable. This information, which is provided by the parser, is used by DeLite and turned out to be very valuable for estimating text readability. Three parse result types are differentiated: complete parse (around 60% of the sentences; note that this means complete syntactic structure and semantic representation at the same time), chunk parse (25%), failure (15%). (7) These three cases are mapped to different numerical values of the indicator SN quality. Additionally, if a full parse or a chunk parse is available, the parser provides a numerical value specifying the likelihood that the sentence is semantically correct which is determined by several heuristics. This information is incorporated into the quality score of this indicator too. (8) 6.2 Number of Propositions per Sentence DeLite also looks at the number of propositions in a sentence. More specifically, all SN nodes are counted which have the ontological sort si(tuation) (10, p. 412) or one of its subsorts. In a lot of cases, readability can be judged more accurately by the number of propositions than by sentence length of similar surface-oriented indicators. Consider for instance a sentence containing a long list of NPs: Mr. Miller, Dr. Peters, Mr. Schmitt, Prof Kurt, ... were present. Although this sentence is quite long it is not difficult to understand (14). In contrast, short sentences can be dense and contain many propositions, e.g., concisely expressed by adjective or participle par·ti·ci·ple n. A form of a verb that in some languages, such as English, can function independently as an adjective, as the past participle baked in We had some baked beans, clauses. [FIGURE 2 OMITTED] 6.3 Number of Connections between SN Nodes/Discourse Entities The average number of nodes which are connected to an SN node is determined. A large value often indicates a lot of semantic dependencies. For this indicator, the arcs leading to and leaving from an SN node are counted. Note that the evaluation showed that better results (stronger correlation and higher weight) are achieved if only SN nodes are regarded which are assigned the ontological sort object (10, p. 409-411). These SN nodes roughly represent the discourse entities of a sentence. 6.4 Length of Causal and Concessive con·ces·sive adj. 1. Of the nature of or containing a concession. 2. Grammar Expressing concession, as the conjunction though. Chains Argumentation is needed to make many texts readable. But if an author puts too many ideas in too few words, the passage becomes hard to read. For example, the following sentence from a newspaper corpus has been automatically identified as pathologic because it contains three causal relations (CAUS CAUS Causative CAUS Citizens Against UFO Secrecy (Peter Gersten, Esq, Founder) CAUS Color Association of the United States CAUS College of Architecture & Urban Studies (Virginia Tech) and CSTR CSTR Centre for Speech Technology Research CSTR Canister CSTR Continually Stirred Tank Reactor CSTR Center for Software Testing Research (Florida Tech) CSTR Combat System Trial Rehearsal (US DoD) in Figure 2) chained together: Das konnte bewirken, dass der Fahrer aus Angst vor den Nachbarn die Geschwindigkeit reduziert. (This could achieve that the driver reduces the speed for fear of the neighbors.). Again, length measurements on the surface will not help to detect the readability problem, which exists for at least some user groups. Splitting such a sentence into several ones is a way out of too dense argumentation. 7 Evaluation To judge the viability of our approach, we conducted an online readability study with 500 texts, more than 300 participants, and around 3,000 human ratings for individual texts. The participants rated the text readability on a 7 point Likert scale Likert scale A subjective scoring system that allows a person being surveyed to quantify likes and preferences on a 5-point scale, with 1 being the least important, relevant, interesting, most ho-hum, or other, and 5 being most excellent, yeehah important, etc (16). Almost 70% of the participants were between 20 and 40 years old; the number of participants over 60 was very small (3 %). The participants were mainly well-educated. 58 % of them owned a university or college degree. There is none who had no school graduation at all. Our text corpus originated from the municipal domain and differs significantly from newspaper corpora corpora plural form of corpus. corpora albicantia see corpus albicans. corpora arenacea sandy or gritty bodies, found in the pineal body; appear to be of glial or stromal origin; have the structure of , which are widely used in computational linguistics. It contains a lot of ordinances with legal terms and abbreviations, e.g., g 65 Abs. I Satz l Nr. 2 i. V.m. g 64 Abs. 1 Satz 2 LWG LWG Left Waist Gunner LWG Logistics Working Group LWG Light-Weight Group LWG Liquid Water in Grams LWG Laminated Waveguide LWG Life Without God LWG Lightweight Gun LWG Loverdan Web Gestion (web advertising company) LWG Lethality Working Group NRW NRW Nordrhein-Westfalen (German Federal State; Capital Düsseldorf) NRW Non-Revenue Water NRW Northern Right Whale NRW Nicolson-Ross-Weir (measurement technique) NRW Nonradioactive Waste (section 65.1.1 (2) in connection with section 64.1.2 LWG NRW). This corpus has been chosen because local administrations in Germany have committed themselves to make their web sites accessible; one central aspect of accessibility is simple language. Figure 4 shows the mean absolute error (MAE (1) (Metropolitan Area Exchange) Originally known as Metropolitan Area Ethernets, MAEs are junction points on the Internet where data is exchanged between carriers. See IXP and NAP. ) and the root mean square error (RMSE) of DeLite's global readability score in contrast to the average user rating determined by a 10 fold cross-validation over all 500 test documents. The ordinate ordinate: see Cartesian coordinates. (mathematics) ordinate - The y-coordinate on an (x,y) graph; the output of a function plotted against its input. x is the "abscissa". See Cartesian coordinates. contains MAE and RMSE, the abscissa abscissa: see Cartesian coordinates. (mathematics) abscissa - The horizontal or x coordinate on an (x, y) graph; the input of a function against which the output is plotted. The vertical or y coordinate is the "ordinate". See Cartesian coordinates. , on a logarithmic scale, the number of neighbors used. The lowest errors (MAE: 0.122, RMSE: 0.148) and highest correlation (0.528) were obtained with 30 nearest neighbors. The nearest neighbor algorithm determined the weights of each indicator using an evolutionary algorithm. The resulting indicator weights, in the case all indicators are used simultaneously, are given in Table 1. The correlations of the indicators in comparison with the user ratings are displayed in Table 2. Correlation and weights of deep syntactic and semantic indicators turned out to be quite comparable to surface-oriented indicators. [FIGURE 3 OMITTED] Finally as a baseline, DeLite was compared to the readability index resulting from employing the nearest neighbor approach only on the indicators of the Flesch readability index, i.e. average sentence length and number of syllables per word. The correlation of DeLite with the user ratings is 0.528, which clearly outperforms the Flesch indicators (0.432). 8 User interface Besides a low-level server interface, DeLite provides a graphical user interface for comfortable usage. In Figure 3, a screenshot See screen shot. of this interface is shown. (9) The types of readability problems found in the text are displayed on the right side. If the user clicks on such a type, the associated difficult-to-read text segments are highlighted. Additional support for the user is provided if he/she wants to have more information about the readability problem. By moving the mouse pointer over the highlighted text segment, a fly-over help text with a more detailed description is displayed. Moreover, if the user clicks on the highlighted segment, additional text segments are marked in bold face. These additional segments are needed to fully describe and explain specific readability problems. Figure 3 shows the readability analysis of a verb which is too far away from its separable prefix (see Sect. 5.2). The prefix ein- is highlighted as problematic and additionally the main verb ladt is marked in bold face for better understanding. [FIGURE 4 OMITTED] 9 Conclusion An overview of some typical examples of deep syntactic and semantic readability indicators has been given. In our evaluation, it turned out that these indicators have weights and correlations comparable to the best surface-based indicators in accurately judging readability. In the future, the parser employed in DeLite will be continually improved. Currently, DeLite is only a diagnosis tool; we will investigate how DeLite can propose reformulations for improving readability. Finally, the automatic distinction between real ambiguities that exist for humans and spurious ambiguities that exist only for machines (e.g., NLP methods like PP attachment and interpretation) must be sharpened. Deep syntactic and semantic indicators turned out to be quite valuable for assessing readability and are expected to be a vital part of future readability checkers. Acknowledgments We wish to thank our colleagues Christian Eichhom, Ingo Glockner, Johannes Leveling, and Rainer Osswald for their support for this work. The research reported here was in part funded by the EU project Benchmarking Tools and Methods for the Web (BenToWeb, FP6-004275). Received: May 11, 2008 References [1] Amstad, T. (1978). Wie verstandlich sind unsere Zeitungen? Ph.D. thesis, Universitat Zurich, Zurich, Switzerland. [2] Chall, J. and Dale, E. (1995). Readability Revisited: The New Dale-Chall Readability Formula. Brookline, Massachusetts: Brookline Books. [3] Chandrasekar, R. and Srinivas, B. (1996). Automatic induction of rules for text simplification. Technical Report IRCS IRCS Institute for Research in Cognitive Science (University of Pennsylvania) IRCS Indian Red Cross Society IRCS Infrared Camera and Spectrograph IRCS International Radio Call Sign IRCS IRC-Over-SSL IRCS Improved Radar Calibration System Report 96-30, University of Pennsylvania (body, education) University of Pennsylvania - The home of ENIAC and Machiavelli. http://upenn.edu/. Address: Philadelphia, PA, USA. , Philadelphia, Pennsylvania. [4] Flesch, R. (1948). A new readability yardstick. Journal of Applied Psychology Journal of Applied Psychology is a publication of the APA. It has a high impact factor for its field. It typically publishes high quality empirical papers. www.apa. , 32:221-233. [5] Groeben, N. (1982). Leserpsychologie: Textverstandnis--Textverstandlichkeit. Munster, Germany: Aschendorff. [6] Hartrumpf, S. (2003). Hybrid Disambiguation dis·am·big·u·ate tr.v. dis·am·big·u·at·ed, dis·am·big·u·at·ing, dis·am·big·u·ates To establish a single grammatical or semantic interpretation for. in Natural Language Analysis. Osnabruck, Germany: Der Andere Verlag. [7] Hartrumpf, S.; Helbig, H.; Leveling, J.; and Osswald, R. (2006). An architecture for controlling simple language in web pages, eMinds: International Journal on Human-Computer Interaction, 1(2):93-112. [8] Hartrumpf, S.; Helbig, H.; and Osswald, R. (2003). The semantically based computer lexicon HaGenLex--Structure and technological environment. Traitement automatique des langues, 44(2):81-105. [9] Heilman, M. J.; Collins-Thompson, K.; Callan, J.; and Eskenazi, M. (2007). Combining lexical and grammatical features to improve readability measures for first and second language texts. In Proceedings of the Human Language Technology Conference. Rochester, New York This article is about the city of Rochester in Monroe County. For the town in Ulster County, see Rochester, Ulster County, New York. Rochester, once known as The Flour City, and more recently as The Flower City or . [10] Helbig, H. (2006). Knowledge Representation and the Semantics of Natural Language. Bedin, Germany: Springer. [11] Jenge, C.; Hartrumpf, S.; Helbig, H.; Nordbrock, G.; and Gappa, H. (2005). Description of syntactic-semantic phenomena which can be automatically controlled by NLP techniques if set as criteria by certain guidelines. EU-Deliverable 6.1, FernUniversitat in Hagen. [12] Kimball, J. (1973). Seven principles of surface structure parsing in natural language. Cognition, 2:15-47. [13] Klare, G. (1963). The Measurement of Readability. Ames, Iowa: Iowa State University Academics ISU is best known for its degree programs in science, engineering, and agriculture. ISU is also home of the world's first electronic digital computing device, the Atanasoff–Berry Computer. Press. [14] Langer, I.; von Thun, F. S.; and Tausch, R. (1981). Sich verstandlich ausdrucken. Munchen, Germany: Reinhardt. [15] Larsson, P. (2006). Classification into Readability Levels. Master's thesis, Department of Linguistics Noun 1. department of linguistics - the academic department responsible for teaching and research in linguistics linguistics department academic department - a division of a school that is responsible for a given subject and Philology phi·lol·o·gy n. 1. Literary study or classical scholarship. 2. See historical linguistics. [Middle English philologie, from Latin philologia, love of learning , University Uppsala, Uppsala, Sweden. [16] Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology, 140:1-55. [17] McCarthy, P.; Lightman, E.; Dufty, D.; and McNamara, D. (2006). Using Coh-Metrix to assess distributions of cohesion and difficulty: An investigation of the structure of high-school textbooks. In Proc. of the Annual Meeting of the Cognitive Science Society. Vancouver, Canada. [18] Mierswa, I.; Wurst, M.; Klinkenberg, R.; Scholz, M.; and Euler, T. (2006). Yale: Rapid prototyping for complex data mining tasks. In Proc. of KDD KDD Knowledge Discovery and Data Mining (International Conference) KDD Knowledge Discovery in Databases KDD Kokusai Denshin Denwa (Japan) KDD Key Distribution Device . Philadelphia, Pennsylvania. [19] Miller, G. (1962). Some psychological studies of grammar. American Psychologist, 17:748-762. [20] Rascu, E. (2006). A controlled language approach to text optimization in technical documentation. In Proc. of KONVENS 2006, pp. 107-114. Konstanz, Germany. [21] Segler, T. M. (2007). Investigating the Selection of Example Sentences for Unknown Target Words in ICALL ICALL Intelligent Computer-Aided Language Learning ICALL Informational-Communicative Assistance and Lobbing League Reading Texts for L2 German. Ph.D. thesis, School of Informatics, University of Edinburgh (body, education) University of Edinburgh - A university in the centre of Scotland's capital. The University of Edinburgh has been promoting and setting standards in education for over 400 years. , Edinburgh, UK. [22] Siddharthan, A. (2003). Syntactic simplification and text cohesion. Ph.D. thesis, Computer Laboratory, University of Cambridge, Cambridge, UK. Tim vor der Bruck, Sven Hartrumpf, Hermann Helbig Intelligent Information and Communication Systems (IICS IICS International Interactive Communications Society IICS International Institute for Christian Studies IICS International Ion Chromatography Symposium IICS Istanbul International Community School (Turkey) ) FernUniversitat in Hagen, 58084 Hagen, Germany E-mail: firstname.lastname@fernuni-hagen.de (1) an indicator is called deep if it requires a deep syntactico-semantic analysis. (2) In this article, discourse criteria are subsumed under the heading semantic because they form only a small group and rely directly on semantic information. (3) Such ambiguities can occur in German because of its relatively free constituent order. (4) WOCADI is the abbreviation of Word-Class based Disambiguating. (5) from the newspaper Schleswig-Holstein am Sonntag, 2007 (6) Note that the readings of a lexeme (grammar) lexeme - A minimal lexical unit of a language. Lexical analysis converts strings in a language into a list of lexemes. For a programming language these word-like pieces would include keywords, identifiers, literals and punctutation. are distinguished by numerical suffixes. (7) The absence of a complete parse is problematic only for a part of the indicators, mainly deep syntactic and semantic ones. And even for some of these indicators, one can define fallback fall·back n. 1. a. Something to which one can resort or retreat. b. A retreat. 2. Computer Science strategies to approximate indicator values by using partial results (chunks). (8) Naturally, this indicator depends strongly on the applied parser. A different parser might lead to quite different results. (9) Note that the classification of indicators is slightly different in the screenshot than in this article. This is caused by the fact that we want to evaluate surface-oriented indicators in comparison to linguistically informed indicators. Table 1: Indicators with largest weights in our readability function (Syn=syntactic, Sem=semantic, and Sur=surface indicator type). Indicator Weight Type Number of words per sentence 0.679 Sur Passive without semantic agent 0.601 Syn/Sem Number of word readings 0.520 Sem Distance between verb and comple- 0.518 Syn ment SN quality 0.470 Syn/Sem Number of connections between dis- 0.467 Sem course entities Inverse concept frequency 0.453 Sem Clause center embedding depth 0.422 Syn Number of sentence constituents 0.406 Syn Maximum path length in the SN 0.395 Sem Number of causal relations in a chain 0.390 Sem Number of compound simplicia 0.378 Sur ... ... ... Word form frequency 0.363 Sur ... ... ... Number of connections between SN 0.326 Sem nodes Table 2: Indicators most strongly correlated with user ratings (Syn=syntactic, Sem=semantic, and Sur=surface indicator type). Indicator Corr. Type Number of words per sentence 0.430 Sur SN quality 0.399 Syn/Sem Inverse concept frequency 0.330 Sem Word form frequency 0.262 Sur Number of reference candidates for a 0.209 Sem pronoun Number of propositions per sentence 0.180 Sem Clause center embedding depth 0.157 Syn Passive without semantic agent 0.155 Syn/Sem Number of SN nodes 0.148 Sem Pronoun without antecedent 0.140 Sem Number of causal relations in a chain 0.139 Sem Distance between pronoun and 0.138 Sem antecedent Maximum path length in the SN 0.132 Sem Number of connections between 0.132 Sem discourse entities |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion