|
|
|
| |
|
|
|
2009-12-16 16:42 |
| | Statistical Behavior of Fast Hashing of Variable-Length Text Strings / Savoy, Jacques In: SIGIR Forum (Special Interest Group on Information Retrieval), 1990, vol. 24, no. 3, p. 62-71. | | | | Résumé anglais: In information retrieval, we often have to store and search for a particular record into a large amount of information. For example, during a document indexing process or when a program is trying to spell a text, a dictionary has to be used in an efficient way. A solution to that problem resides in using a hash table. However, if we known many algorithms for manipulating or accessing hash tables [Knuth 73], [Standish 80], [Wiederhold 87], the main problem is to define a "good" hash [...] | | Mot clés anglais: key-to-adress transformation ; hashing ; hash function ; hash coding ; direct access method ; scatter storage ; dictionary lookup ; information retrieval | | Texte intégral (1.8 MB, 2009-12-16 16:42:25) | | Notice détaillée - Notices similaires |
2009-12-16 12:12 |
| | Approaches to Collection Selection and Results Merging for Distributed Information Retrieval / Rasolofo, Yves, Abbaci, Faïza, Savoy, Jacques In: Conference on Information and Knowledge Management (CIKM’01), 2001, p. 191-198. | | | | Résumé anglais: We have investigated two major issues in Distributed Information Retrieval (DIR), namely: collection selection and search results merging. While most published works on these two issues are based on pre-stored metadata, the approaches described in this paper involve extracting the required information at the time the query isprocessed. In order to predict the relevance of collections to a given query, we analyse a limited number of full documents (e.g., the top five documents) retrieved [...] | | Mot clés anglais: distributed information retrieval ; collection selection ; searchresults merging ; evaluation | | Texte intégral (370.3 KB, 2009-12-16 12:12:28) | | Notice détaillée - Notices similaires |
2009-12-16 10:46 |
| | Comparative Study of Monolingual and Multilingual Search Models for Use with Asian Languages / Savoy, Jacques In: ACM Transactions on Asian Language Information Processing (T.A.L.I.P.), 2005, vol. 4, no. 2, p. 163-189. | | | | Résumé anglais: Based on the NTCIR-4 test-collection, our first objective is to present an overview of the retrieval effectiveness of nine vector-space and two probabilistic models that perform monolingual searches in the Chinese, Japanese, Korean, and English languages. Our second goal is to analyze the relative merits of the various automated and freely available toolsto translate the English-language topics into Chinese, Japanese, or Korean, and then submit the resultant query in order to retrieve [...] | | Mot clés anglais: Algorithms ; Experimentation ; Measurement | | Texte intégral (606.9 KB, 2009-12-16 10:46:09) | | Notice détaillée - Notices similaires |
2009-12-16 09:20 |
| | Light stemming approaches for the French, Portuguese, German and Hungarian languages / Savoy, Jacques In: Proceedings of the 2006 ACM Symposium on Applied Computing (SAC’06), 2006, p. 1031-1035. | | | | Résumé anglais: This paper describes and evaluates various general stemmingapproaches for the French, Portuguese (Brazilian), German andHungarian languages. Based on the CLEF test-collections, wedemonstrate that light stemmers for the French, Portugueseand Hungarian languages perform well, and reasonably well forthe German language. Variations in mean average precisionamong the different stemming approaches are also evaluatedand sometimes they are found statistically significant. | | Mot clés anglais: stemming for French ; Portuguese ; German ; Hungarian ; stemmer ; natural language processing | | Texte intégral (291 KB, 2009-12-16 09:19:48) | | Notice détaillée - Notices similaires |
2009-12-16 08:56 |
| | Why do Successful Search Systems Fail for Some Topics / Savoy, Jacques In: Proceedings of the 2007 ACM Symposium on Applied Computing (SAC’07), 2007, p. 872-877. | | | | Résumé anglais: This paper describes and evaluates the vector-space and probabil-istic IR models used to retrieve news articles from a corpus writ-ten in the French language. Based on three CLEF test-collections and 151 queries, we classify the poor retrieval results of difficult topics under 6 categories. The explanations we obtain from this analysis differ from those suggested a priori by our students. We use the Web to manually or automatically find related search terms to the original query. We [...] | | Mot clés anglais: failure analysis ; robust evaluation | | Texte intégral (301.8 KB, 2009-12-16 08:55:47) | | Notice détaillée - Notices similaires |
2009-12-15 18:16 |
| | Investigation in statistical language-independent approaches for opinion detection in English, Chinese and Japanese / Zubaryeva, Olena, Savoy, Jacques In: Proceedings of the Third International Workshop on Cross Lingual Information Access (CLIAWS3): Addressing the Information Need of Multilingual Societies, 2009, p. 38-45. | | | | Résumé anglais: In this paper we present a new statistical approach to opinion detection and its' evaluation on the English, Chinese and Japanese corpora. Besides, the proposed method is compared with three baselines, namely Naïve Bayes classifier, a language model and an approach based on significant collocations. These models being language independent are improved with the use of language-dependent technique on the example of the English corpus. We show that our method almost always gives better [...] | | | Texte intégral (358.8 KB, 2009-12-15 18:16:24) | | Notice détaillée - Notices similaires |
2009-12-15 17:38 |
| | How effective is Google's translation service in search? / Savoy, Jacques, Dolamic, Ljiljana In: Communications of the ACM, 2009, vol. 52, no. 10, p. 139-143. | | | | Résumé anglais: In multilingual countries (Canada, Hong Kong, India, among others) and large international organizations or companies (such as, WTO, European Parliament), and among Web users in general, accessing information written in other languages has become a real need (news, hotel or airline reservations, or government information, statistics). While some users are bilingual, others can read documents written in another language but cannot formulate a query to search it, or at least cannot provide [...] | | | Texte intégral (290.7 KB, 2009-12-15 17:33:31) | | Notice détaillée - Notices similaires |
2009-12-10 09:36 |
| | Antimonate opaque glaze colours from the faience manufacture of Le Bois d'Épense (19th century, Northeastern France)* / Maggetti, Marino, Neururer, Christoph, Rosen, J. In: Archaeometry, 2009, vol. 51, no. 5, p. 791-807. | | | | Résumé anglais: Three types of antimony-based, opaque ceramic colours were used in the faience workshop of Le Bois d'Épense during the first decades of the 19th century; that is, yellow, tawny and green. Yellow is generated by lead antimonate crystals (Naples Yellow), which are incorporated into an uncoloured glass matrix. According to SEM–EDS measurements, these pigments contain iron. The tawny colour is the optical result of the combined presence of similar yellow, iron-bearing lead antimonate [...] | | | pdf (481.8 KB, 2009-12-10 09:36:27) | | Notice détaillée - Notices similaires |
2009-12-09 11:01 |
| | Algorithmic stemmers or morphological analysis? An evaluation / Fautsch, Claire, Savoy, Jacques In: Journal of the American Society for Information Science and Technology, 2009, vol. 60, no. 8, p. 1616-1624. | | | | Résumé anglais: It is important in information retrieval (IR), information extraction, or classification tasks that morphologically related forms are conflated under the same stem (using stemmer) or lemma (using morphological analyzer). To achieve this for the English language, algorithmic stemming or various morphological analysis approaches have been suggested. Based on Cross-Language Evaluation Forum test collections containing 284 queries and various IR models, this article evaluates these [...] | | | Texte intégral (344.5 KB, 2009-12-09 10:56:08) | | Notice détaillée - Notices similaires |
2009-12-09 09:44 |
| | Indexing and searching strategies for the Russian language / Dolamic, Ljiljana, Savoy, Jacques In: Journal of the American Society for Information Science and Technology, 2009, vol. 60, no. 12, p. 2540-2547. | | | | Résumé anglais: This paper describes and evaluates various stemming and indexing strategies for the Russian language. We design and evaluate two stemming approaches, a light and a more aggressive one, and compare these stemmers to the Snowball stemmer, to no stemming, and also to a language-independent approach (n-gram). To evaluate the suggested stemming strategies we apply various probabilistic information retrieval (IR) models, including the Okapi, the Divergence from Randomness (DFR), a [...] | | | Texte intégral (335.4 KB, 2009-12-09 09:43:41) | | Notice détaillée - Notices similaires |
|
|
|
|