SI-PRON pronunciation Lexicon: a new language resource for Slovenian.We present the efforts involved in designing SI-PRON, a comprehensive machine-readable pronunciation lexicon for Slovenian. It has been built from two sources and contains all the lemmas from the Dictionary of Standard Slovenian (SSKJ SSKJ Slovar Slovenskega Knjižnega Jezika (standard Slovenian dictionary) ), the most frequent inflected word forms found in contemporary Slovenian texts, and a first pass of inflected word forms derived from SSKJ lemmas. The lexicon file contains the orthography, corresponding pronunciations, lemmas and morphosyntactic descriptors of lexical entries in a format based on requirements deigned by the W3C Voice Browser Activity. The current version of the SI-PRON pronunciation lexicon contains over 1.4 million lexical entries. The word list determination procedure, the generation and validation of phonetic transcriptions, and the lexicon format are described in the paper. Along with Onomastica, SI-PRON presents a valuable language resource for linguistic studies and research of speech technologies for Slovenian. The lexicon is already being used by the Proteus Slovenian text-to-speech synthesis system and for generating audio samples of the SSKJ headwords.
Povzetek: Clanek opisuje novjezikovni vir za slovenscino, slovar izgovarjav SI-PRON.
Keywords: language resources, pronunciation lexicon, PLS See playlist.
Consistent specification of word pronunciation is critical to the success of many speech technology applications. Most state-of-the-art Automatic Speech Recognition (ASR (Automatic Speech Recognition) Using voice recognition to replace keypad entry for telephone voice menus. Typically used to speak the digits 0 through 9 insted of keying them, ASR systems may be able to recognize a limited vocabulary. See voice recognition and AVSR. ) and Text-To-Speech (TTS (1) See text-to-speech.
(2) (Transaction Tracking System) Software that monitors a transaction until completion. In the event of a hardware or software failure, it ensures that the database is brought back to its former state before the attempt to ) systems rely on lexicons, which contain pronunciation information for many words. To provide for a maximum coverage of the words, multi-word expressions or even phrases, which commonly occur in a given application-domain, application-specific word or phrase pronunciations may be required, especially for application-specific proper nouns, such as personal names or location names.
Several guidelines have been reported to define the structure of a pronunciation lexicon, ranging from simple two-column ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. lexicons providing the mapping between graphemic and phonemic pho·ne·mic
1. Of or relating to phonemes.
2. Of or relating to phonemics.
3. Serving to distinguish phonemes or distinctive features. transcriptions, to more general de-facto standards and new standardization attempts, which are also handling multiple orthographies and multiple pronunciations.
The ISO-TC37 initiative, which started at LREC LREC Language Resources and Evaluation Conference
LREC Local Research Ethics Committee (United Kingdom)
LREC Louisiana Real Estate Commission
LREC Lake Region Electric Cooperative
LREC Leicester Racial Equality Council 2002, initiated work on a family of ISO standards This is a list of ISO standards that are discussed in Wikipedia articles. For a list of all the more than 16,000 ISO standards (as of 2007), see the ISO Catalogue.
About 300 of the standards produced by ISO and IEC's Joint Technical Committee 1 (JTC1) have been made freely/publicly related to natural language processing Natural language processing
Computer analysis and generation of natural language text. The goal is to enable natural languages, such as English, French, or Japanese, to serve either as the medium through which users interact with computer systems such as (Romary et al., 2006). Currently these standards are available in working drafts of high-level specifications for word segmentation, feature structures, annotations, and also for lexicons. The high-level specifications build on lower-level specifications in form of language and country codes, data categories, code scripts, and Unicode. Lexicon specifications are covered by the "Lexical Markup Framework" under ISO (1) See ISO speed.
(2) (International Organization for Standardization, Geneva, Switzerland, www.iso.ch) An organization that sets international standards, founded in 1946. The U.S. member body is ANSI. 24613 (Romary et al., 2006). The same description structure in terms of morphology, syntax and semantics (and translation) applies to monolingual mon·o·lin·gual
Using or knowing only one language.
mono·lin up to multilingual lexicons. Multi-word expressions are given special attention.
Another initiative, the W3C Voice Browser Activity, has recently issued a last-call working draft of the Pronunciation Lexicon Specification The Pronunciation Lexicon Specification (PLS) is a W3C Specification, which is designed to enable interoperable specification of pronunciation information for both speech recognition and speech synthesis engines within voice browsing applications. (PLS) Version 1.0 (W3C PLS Version 1.0, 2006), which is expected to be soon submitted as a W3C candidate recommendation. The PLS document was designed to enable interoperable specification of pronunciation information for both ASR and TTS engines within voice browsing applications. The mark-up language allows one or more pronunciations for a word or phrase to be specified using a standard pronunciation alphabet or if necessary using vendor specific alphabets. Pronunciations are grouped together into the PLS document which may be referenced from other markup languages, such as the Speech Recognition Gramrnar Specification (SRGS SRGS Speech Recognition Grammar Specification
SRGS Stimulated Raman Gain Spectroscopy
SRGS Survivable Relay Ground Stations ) and the Speech Synthesis Markup Language Speech Synthesis Markup Language (SSML) is an XML-based markup language for speech synthesis applications. It is a recommendation of the W3C's voice browser working group. SSML is often embedded in VoiceXML scripts to drive interactive telephony systems. (SSML SSML Speech Synthesis Markup Language ).
The Pronunciation Lexicon Markup Language markup language
Standard text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship among its parts. The most widely used markup languages are SGML, HTML, and XML. , based on PLS, is designed to allow open, portable specification of pronunciation information for speech recognition and speech synthesis speech synthesis
Generation of speech by artificial means, usually by computer. Production of sound to simulate human speech is referred to as low-level synthesis. High-level synthesis deals with the conversion of written text or symbols into an abstract representation of engines. The language is intended to be easy to use by developers while supporting the accurate specification of pronunciation information for international use.
The LC-STAR project consortium published another set of recommendations for speech technology lexicons, with an emphasis on application in machine translation, speech recognition and speech synthesis (Shamas & van den Heuvel, 2004; Fersoe et al., 2004). A Slovenian lexicon, produced at the University of Maribor The University of Maribor (Slovenian Univerza v Mariboru) is the second university in Slovenia, established in 1961. It currently has 15 faculties. Organization
These are the 15 faculties in which the university is divided into:
In Slovenian, lexical stress can be located on almost any syllable and it obeys hardly any rules. The stressed syllable in Slovenian may form the ultimate, the penultimate or the preantepenultimate syllable of a polysyllabic word Noun 1. polysyllabic word - a word of more than three syllables
word - a unit of language that native speakers can identify; "words are the blocks from which sentences are made"; "he hardly said ten words all morning" . Speakers of Slovenian have to learn lexical stress positions along with learning the language. As a consequence, a pronunciation lexicon that indicates lexical stress positions for as many Slovenian words as possible is crucial for the development of speech technology applications and linguistic research. Such a lexicon can be used either in its full-blown form or as a training material for machine learning techniques aimed at automatically predicting word pronunciations.
Several attempts towards pronunciation lexicon construction for Slovenian have been reported so far (Derlic & Kacic, 1997; Gros & Mihelic, 1999; Gros et al., 2001; Sef et al., 2002; Verdonik et al., 2002; Mihelic et al., 2003). However, none of them has used the full lemma lemma (lĕm`ə): see theorem.
(logic) lemma - A result already proved, which is needed in the proof of some further result. set as given in the Dictionary of Standard Slovenian (SSKJ) (SSKJ, 1991).
The paper describes the construction of a comprehensive reference pronunciation lexicon for Slovenian based on two sources: the information from the SSKJ and another list of the most frequent inflected word forms, which has been derived by an analysis of contemporary Slovenian text corpora corpora
plural form of corpus.
see corpus albicans.
sandy or gritty bodies, found in the pineal body; appear to be of glial or stromal origin; have the structure of .
2 The SI-PRON Lexicon
2.1 SI-PRON Wordlist
The work on designing a new pronunciation lexicon begins with the selection of words, multi-word expressions or phrases, which will be represented in the lexicon. Several word-list selection procedures are known (Ziegenheim, 2003).
The construction of the SI-PRON lexicon started with the complete lemma word list of 93,154 entries from the SSKJ provided by the Fran Ramovs Institute of the Slovenian Language Slovenian or Slovene (slovenski jezik or slovenščina) is an Indo-European language that belongs to the family of South Slavic languages. It is spoken by approximately 2 million speakers worldwide, the majority of whom live in Slovenia. , furnished with basic lexical stress information on the stressed vowels and pronunciation exceptions. The complete word pronunciations still had to be determined.
In order to further expand the SI-PRON word list, we are augmenting the SSKJ lemma descriptions with part-of-speech information and declension/conjugation categories (Toporisic, 1991), specifying the inflectional paradigms of the lemmas. Irregular inflected word forms are processed separately. Using automatic procedures, we are fully expanding the lemmas into inflected word forms. So far, over 1 million lexemes containing lexical stress information have been derived.
Since SSKJ contains many words derived from literary texts, not so common in everyday situations, we decided to upgrade the SI-PRON pronunciation lexicon with a list of 50,000 most frequent inflected word forms whose lemmas are not covered not covered Health care adjective Referring to a procedure, test or other health service to which a policy holder or insurance beneficiary is not entitled under the terms of the policy or payment system–eg, Medicare. Cf Covered. by the SSKJ word list. This additional word list has been derived from a statistical analysis of a contemporary Slovenian text corpus In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe. . The corpus comprising over 3 million Slovenian words was composed mainly from fiction and mainstream Slovenian newspaper texts: Delo, Vecer, and the former Slovenec. After tokenization and the elimination of numerals, named entities, acronyms, and abbreviations, the remaining text corpus included over 3 million tokens. Acronyms, abbreviations, and named entities were stored into separate word lists.
A statistical analysis performed on the text corpus showed that about 50.000 most frequent words accounted for approaching 95% of all non-SSKJ words used in the text corpus (Gros & Mihelic, 1999). These words form the main additional word list. They were equipped with part-of-speech tags indicating the part-of-speech function of the words in the text corpus.
2.2 Collocations and Multi-word Expressions
The identification of collocations, i.e. current combinations of words as they appear in context, can considerably increase the naturalness of synthetic speech synthetic speech
Speech that is produced by an electronic synthesizer activated by a keyboard, enabling individuals who are incapable of speech to communicate. . In human speech, collocations act as prosodic pros·o·dy
n. pl. pros·o·dies
1. The study of the metrical structure of verse.
2. A particular system of versification. units and are subject to a higher degree of reduction and internal coarticulation Co`ar`tic`u`la´tion
n. 1. (Anat.) The union or articulation of bones to form a joint. than they would be had they been ordinary, separate words. We have chosen a lexical approach for handling collocations. The most common collocations or multi-word expressions, reflexive verbs included, are stored in a separate pronunciation lexicon.
3 Phonetic Transcriptions
We have developed a tool to automatically derive word pronunciations for the SSKJ inflected words, by looking-up their stem pronunciation and appending that of the correct inflection from inflectional paradigms and morphological rules of Slovenian (Toporisic, 1991).
Therefore, the pronunciation of lexemes has been derived automatically for the SSKJ and SSKJ inflected word lists (about 2,500 entries, mainly words of foreign origin that do not obey the general Slovenian pronunciation rules, have been manually transcribed), and semi-automatically for the remaining part of the word list. Automatic lexical stress assignment and automatic grapheme-to-phoneme conversion rules have been used to process the latter.
3.1 Lexical Stress Assignment
The automatic lexical stress assignment algorithm for unseen words, which we applied is to a large extent determined by (un)stressable affixes, prefixes, and suffixes of morphs and is based upon observations by linguists (Toporisic, 1991).
For words that do not belong to these categories, the most probable stressed syllable is predicted using the results from a statistical analysis of stress position depending on the number of syllables within a word (Gros & Mihelic, 1999).
3.2 Grapheme-to-Phoneme Rules
Context-free grapheme-to-allophone rules from the Proteus standard words rule set (Zganec Gros, 2006) translate each grapheme A displayed or printed letter of the alphabet with all of its accent marks in place. See glyph. string into a series of allophones.
The rules are accessed sequentially until a rule that satisfies the current part of the input string is found. The transformation defined by that rule is then performed, and a pointer is incremented to point at the next unprocessed part of the input string. The procedure is repeated until the whole string has been converted.
The context free rules are rare and they include a one-to-one correspondence, two-to-one correspondence and one-to-two correspondence.
The vast majority of the rules for grapheme-to-allophone transcription for Standard Slovene are context-sensitive. This means that a grapheme or a string of graphemes is transcribed differently according to according to
1. As stated or indicated by; on the authority of: according to historians.
2. In keeping with: according to instructions.
3. its phonetic environment. Certainly all rules for determining which allophone al·lo·phone
1. Linguistics A predictable phonetic variant of a phoneme. For example, the aspirated t of top, the unaspirated t of stop, and the tt (pronounced as a flap) of batter of a certain phoneme phoneme
Smallest unit of speech distinguishing one word (or word element) from another (e.g., the sound p in tap, which differentiates that word from tab and tag). The term is usually restricted to vowels and consonants, but some linguists include differences of pitch, is to be used in a phonetic sequence are context-dependent.
Each context-sensitive rule consists of four parts: the left context, the string to be transcribed, its right context and the phonetic transcription. A number of writing conventions has been adopted in order to keep the number of rules relatively small and readable. The left and the right context may contain code characters describing larger phonetic sets, e.g.: '#' stands for vowels, '$' for consonants, '--' for white space.
The rules for consonants are rather straightforward, while those for vowels must handle vowel length and the variant realizations of the orthographic /e/ and the orthographic /o/ in stressed syllables.
A typical grapheme-to-allophone rule in the Proteus standard words rule set has the following structure:
left grapheme right allophone context string context string $ /er/ -- [@r] = /n/ k [N]
The first rule says that the word final /er/ preceded by a consonant is transcribed as [@r] (e.g. /gaber/ -> [*ga:.b@r]). The second rule implies that any /n/ followed by /k/ is transcribed into [N] ([N] is the allophone of [n] when followed by /k/ or /g/, e.g. in /anka/ -> [*a:N.ka]).
The initial rule set based on the one produced in 2001 (Gros et al., 2001) was built by taking into acconut various observations of expert linguists, e.g. (Toporisic, 1991), and other basic rule sets for Slovenian grapheme-to-allophone transcription (Gros & Mihelic, 1999).
The initial set of rules has been undergoing continuous refinement ever since and resulted in 194 rules of the Proteus standard words rule set (Zganec Gros, 2006). Rules for coarticulatory pronunciation corrections of words according to the words' left context and to the right context are included.
In the recent years, telecommunication applications of ASR and TTS have increased in importance, e.g. automatic telephone directory inquiry systems. Names of locations (cities, streets, etc.) and other proper names cannot be mentally reconstructed from the context when listening to the messages, and correct name pronunciation is required. The Proteus standard word rules developed for a standard Slovenian vocabulary do not lead to satisfactory results when applied to names. Therefore, additional 'name-specific' rules were added to the final Proteus standard words rule set resulting in the Proteus names rule set.
3.3 Transcription Accuracy Experiment
The phonemization errors were determined by comparing the automatic transcription outputs to manually verified pronunciation lexicon transcriptions.
A performance test applied on the SI-PRON SSKJ-based word list pronunciation lexicon showed error rates of about 25% in the stress assignment of unknown words and consequently in the phonetic transcription. If stress assignment and the transcriptions of graphemic /e/ and /o/ in stressed syllables was manually verified or known in advance, a transcription success rate of 99.1% was achieved for standard SSKJ words.
A closer examination of the mismatches revealed that the majority of the errors could be attributed to inconsistencies in manual labelling during the preparation of the original SSKJ.
As a consequence, we argue that, in order to semi-automatically derive phonetic transcriptions for Slovenian words not covered by the lexicon with a 0.3% error rate, manual validation of the stress position and its type have to be carried out, starting from automatically predicted stress positions. The rest can be performed automatically by applying our upgraded grapheme-to-phoneme conversion rule set.
4 SI-PRON Format
The SI-PRON lexicon format complies with the Pronunciation Lexicon Specification (PLS) Version 1.0, a W3C Voice Browser Activity working draft of syntax specification for pronunciation lexicons (W3C PLS Version 1.0, 2006). This lexicon specification has been recommended for use by speech recognition and speech synthesis engines in voice browser applications.
The element <lexeme> represents a lexical entry and may include multiple orthographies and multiple pronunciation information. An example of a simple lexicon file with a single lexeme (grammar) lexeme - A minimal lexical unit of a language. Lexical analysis converts strings in a language into a list of lexemes. For a programming language these word-like pieces would include keywords, identifiers, literals and punctutation. within SI-PRON would be as shown in Fig. 1.
Figure 1. An example of a simple lexicon file with a single lexeme within SI-PRON. <?xml version="1.0" encoding="UTF-8"?> <lexicon version="1.0" xml:lang="si-SI" alphabets="x-sampa-SI -reduced"> <lexeme> <grapheme>dober</grapheme> <phoneme>"d/o:-b@r</phoneme> <! -- This is an example of the x-sampa-SI-reduced string for the pronunciation of the Slovenian word: "dober", meaning "good" in English --> </lexeme> </lexicon>
In the Pronunciation Lexicon Specification, the pronunciation alphabet is specified by the alphabet attribute of the <phoneme> element. We are using the "x-sampa-SI-reduced" phonetic alphabet, a subset of the X-SAMPA set as defined for Slovenian (Zemljak et al., 2002), augmented with additional markers for Slovenian lexical stress accents (acute, circumflex circumflex /cir·cum·flex/ (serk´um-fleks) curved like a bow.
1. Curving or bending around.
curved like a bow. , and grave) and tonemic accents (tonemic acute and tonemic circumflex). Both primary and secondary stress positions are marked.
The <alias> element is used to provide the pronunciation of an acronym or an abbreviation in terms of an expanded orthographic representation.
Homographs or words with the same spelling but different pronunciations can be treated in two ways. If we do not want to distinguish between the two words then we can represent them as alternate pronunciations within the same <lexeme> element. In the opposite case, two different <lexeme> elements need to be used. In both cases the application, which is making use of the lexicon, will not be able to decide when to apply the first or the second transcription unless additional information, such as context-specific attributes or part-of-speech information is provided.
4.2 Multiple Pronunciations
Providing multiple pronunciations for items that share the same orthography and meaning is important for speech recognition lexicons because they provide information on variations of pronunciation within a language. Therefore, for many lexemes, words, and multi-word expressions, multiple standard pronunciations are specified, including those, which consider possible coarticulation effects at word boundaries. Multiple pronunciations are indicated by subsequent <phoneme> elements within one <lexeme> element.
Pronunciation preference--extensions needed?
In TTS applications, typically only one pronunciation among the multiple pronunciation possibilities is required. Therefore, to indicate default pronunciation variation, the prefer attribute can be used in PLS. In SI-PRON, unless marked otherwise, the default pronunciation is the first pronunciation from SSKJ.
However, sometimes several pronunciation variations in SSKJ are (almost) equally preferred, whereas the actual preferred pronunciation for the TTS engine may depend on the application. This is not to be confused with application-specific pronunciations, which can be handled in separate application-specific pronunciation lexica. What we have in mind is that there may exist several almost equally preferred pronunciations for a given grapheme, and the developers would like to have a mechanism that would enable them to systematically choose the preferred one.
Typically one of the two almost equally preferred pronunciations yields better rendering of input text if the application requires either overarticulated or fluent pronunciation. Therefore, we would welcome a new optional attribute to the <phoneme> element in PLS, the: pron-style attribute indicating the preferred pronunciation variation of a lexeme with respect to the desired pronunciation style. The two attribute values, which would be useful for SI-PRON, are "fluent" and "overarticulated".
In addition, the pron-style optional attribute would need to be introduced into SSML, as a defined attribute for the <voice>, <speak>, [p], and <s> elements.
For the same elements in SSML: <voice>, <speak>, [p], and <s>, another optional attribute, emotion, would be useful (e.g. for comuter games, where emotion changes occur frequently).
Example: For Slovenian male nouns, ending with a consonant followed by "ilec", SSKJ often provides one of the following single or multiple pronunciations of the "ilc" sequence within the genitive genitive (jĕn`ĭtĭv) [Lat.,=genetic], in Latin grammar, the case typically used to refer to a possessor. The term is used in the grammar of other languages, but the phenomenon referred to may not closely resemble a Latin genitive; thus a form of the noun: [iUts]/[ilts], [ilts]/[iUts], [ilts], or [iUts]; examples would be Slovenian words "nosilca", "krotilca", "darovalca", etc. Many other cases of such pronunciation variations are known for Slovenian, and are marked in SSKJ.
Whenever there are two pronunciation variations in SSKJ they typically account for an overarticulated (e.g. [ilts]) or a more fluent (e.g. [iUts]) pronunciation variation. The pronunciation order as indicated in SSKJ indicates a slight pronunciation preference in standard usage and should still be indicated by the prefer attribute. In order to enable high-quality TTS such pronunciation differentiations should be captured in the text rendering process.
This would avoid the contusion CONTUSION, med. jurisp. An injury or lesion, arising from the shock of a body with a large surface, which presents no loss of substance, and no apparent wound. If the skin be divided, the injury takes the name of a contused wound. Vide 1 Ch. Pr, 38; 4 Carr. & P. 381, 487, 558, 565; 6 Carr. of having a multitude of TTS pronunciation lexicons with different variations of the default pronunciation as given by the prefer attribute. The multiple lexicons are impossible to edit synchronously, and the proposed approach would allow us to use one master pronunciation lexicon.
4.3 Multiple Orthographies
Sometimes multiple orthographies of a word share the same meaning and pronunciation. They are presented with subsequent <grapheme> elements within a single <lexeme> element.
4.4 Part-of-Speech Tags
The most recent specification of the PLS focuses on the major features described in the PLS requirements document. Many more complex features, such as those providing morphological, syntactic and semantic information associated with pronunciations are expected to be introduced in a future revision of the PLS specification.
Therefore, proprietary <lemma> and <morphsynt> elements have been additionally defined for SI-PRON. Multext-East morphosyntactic descriptors for the Slovenian language, as described in (Erjavec, 2004), were used to provide the part-of-speech information of the lexemes, along with the lemmas.
5 SI-PRON Validation
Finally, the SI-PRON lexicon has been subjected to an automatic validation as a way to ensure that the structure of the document is well-formed and conforms with the chosen Document Type Definition (DTD (Document Type Definition) A language that describes the contents of an SGML document. The DTD is also used with XML, and the DTD definitions may be embedded within an XML document or in a separate file. ).
Additionally, manual validation of both phonemic transcriptions and morphosyntactic descriptions was performed on a subset of the lexicon comprising 5.000 lexical entries. A subset from the LC-STAR lexicon specifications for lexicon validation criteria was used (Shamas and den Heuvel, 2002).
A lexicon editing tool with a user-friendly interface has been designed to allow inspecting, editing, browsing and automatic validation of the pronunciation lexicon.
Due to free lexical stress position, pronunciation lexica are of crucial importance for development of speech technology applications and linguistic research for Slovenian. They are not only used for providing application-specific pronunciations or pronunciations of names, but are indispensable in any TTS or ASR system.
The task of constructing a master pronunciation lexicon is very tedious and time-consuming and should not be repeated often. Therefore, a master-lexicon approach is best suited for Slovenian TTS, in which many speaking-style pronunciation nuances are captured. We propose refined extensions to both PLS and SSML, which are described in section 4, and mainly deal with multiple pronunciations and morphosyntactic descriptions.
Along with Onomastica, SI-PRON presents a valuable language resource for linguistic studies as well as for research and development of speech technologies for Slovenian. The lexicon is already being used by the Proteus Slovenian text-to-speech synthesis system (Zganec Gros, 2006) and for generating audio samples of the SSKJ word list, which are available at the very end of every SSKJ lexical entry description (SSKJ audio, 2006).
A part of the presented work has been financed as an applied research project by the Slovenian Research Agency under contract No. 5405.
Received: August 12, 2006
 Derlic, R., Kacic, Z., (1996). Definition of pronunciation dictionary of names and letter-to-sound rules for Slovene language--project Onomastica. In Proceedings of the 2nd International Workshop on Speech dialog man-machine, Maribor, Slovenia, June 26-27, pp. 153-158.
 Erjavec, T. (2004). MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora. In Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC'04, Lisbon, Portugal, pp. 1535-1538.
 Fersoe, H., Hartikainen, E., van den Heuvel, H., Maltese G., Moreno A., Shammass S., Ziegenhain U. (2004). Creation and Validation of Large Lexica for Speech-to-Speech Translation Purposes. In Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC'04, Lisbon, Portugal.
 Gros, J., Mihelic, F., (1999). Acquisition of an extensive rule set for Slovene grapheme-to-allophone transcription. In Proceedings of the 6th European Conference on Speech Communication and Technology EUROSPEECH'99, Budapest, Hungary, pp. 2075-2078.
 Gros, J., Mihelic, F., Pavesic, N., Zganec, M., Mihelic, A., Knez, M., Mercun, A., Skerl, D., (2001). The phonetic SMS (1) (Storage Management System) Software used to routinely back up and archive files. See HSM.
(2) (Systems Management Server) Systems management software from Microsoft that runs on Windows NT Server. reader. In Proceedings of the Text, speech and dialogue 4th international conference, Zelezna Ruda, Czech Republic, Lecture notes in artificial intelligence, 2166. Berlin: Springer, pp. 334-340
 Mihelic, F., Zganec Gros, J., Dobrisek, S., Zibert, J. and Pavesic, N., (2003). "Spoken language resources at LUKS of the University of Ljubljana The University of Ljubljana (in Slovenian, Univerza v Ljubljani; in Latin, Universitas Labacensis) is the first and the largest university in Slovenia; with 56,000 enrolled students, it ranks among the biggest universities in the world. ", International Journal on Speech Technologies, Vol. 6, No. 3, pp. 221-232.
 PLS-W3C, (2006). Pronunciation Lexicon Specification (PLS) Version 1.0, W3C Working Draft 31 January 2006. http://www.w3.org/TR/pronunciation-lexicon/S4.7.
 Romary, L., Francopoulo, G., Monachini, M. and Salmon-Alt, S. (2006). Lexical Markup Framework: working to reach a consensual ISO standard on lexicons. To be presented at LREC'06 as a tutorial. Genoa, Italy.
 SSKJ audio (2006). available from http://bos.zrcsazu.si/sskj.html.
 Verdonik, D., Rojc, M., Kacic, Z., Horvat, B., (2002). Zasnova in izgradnja oblikoslovnega in glasovnega slovarja za slovenski knjizni jezik. In Zbornik konference Jezikovne tehnologije'02. Editors: Yomaz Erjavec, Jerneja Gros, Ljubljana, Slovenia, pp. 44-48.
 Verdonik, D., Rojc, M. and Kacic, Z., (2004). Creating Slovenian language resources for development of speech-to-speech translation components, In Proceedings of the Fourth International Conference on Language Resources and Evaluation LREC'04. Lisbon, Portugal, pp. 1399-1402.
 Shammass, S. & van den Heuvel, H., (2004). Specification of validation criteria for lexicons for recognition and synthesis, LC-STAR Deliverable D6.1. available from www.lc-star.com.
 SSKJ (1997). Slovar slovenskega knjiznega jezika (The Dictionary of Standard Slovenian). 2nd edition, Ljubljana: DZS DZS Digital Zone System .
 Sef, T., Gams, M., Skrjanc, M., (2002). Automatic lexical stress assignment of unknown words for highly inflected Slovenian language. In Zbornik 11. mednarodne Elektrotehniske in racunalniske konference ERK ERK Extracellular Signal-Regulated Kinase
ERK Electronic Records Keeping
ERK Externally Regulated Kinases 2002. Portoroz, Slovenija., pp. 247-250. in Slovenian.
 Toporisic, J. (1991). Slovenska Slovenica (Slovenian Grammar). Zalozba Obzorja Maribor.
 Zemljak, M., Kacic, Z., Dobrisek, S., Gros, J., Weiss, P., (2002). Raeunalniski simbolni foneticni zapis slovenskega govora. Slavisticna revija, Vol. 50, No. 2, pp. 159-169.
 Ziegenhain, U., (2003). Specification of corpora and word lists in 12 languages. LC-STAR Deliverable D1.1. available from www.lc-star.com.
 Zganec Gros, J., (2006). Text-to-speech synthesis for embedded speech user interfaces, In WSEAS WSEAS World Scientific and Engineering Academy and Society Transactions on Communications, No. 4, Vol. 5, pp. 543-548.
Jerneja Zganec Gros
Alpineon Research and Development, Ulica Iga Grudna 15, SI-1000 Ljubljana, Slovenia
E-mail: email@example.com, http://www.alpineon.si
Varja Cvetko-Oresnik and Primoz Jakopin
Fran Ramovs Institute of the Slovenian Language, Novi trg 4, SI-1000 Ljubljana, Slovenia
E-mail: firstname.lastname@example.org, http://isjfr.zrc-sazu.si/