dc.contributor.author | Μάστορα, Άννα | el_GR |
dc.contributor.author | Μονόπωλη, Μαρία | el_GR |
dc.contributor.author | Καπιδάκης, Σαράντος | el_GR |
dc.contributor.author | Mastora, Anna | en |
dc.contributor.author | Monopoli, Maria | en |
dc.contributor.author | Kapidakis, Sarantos | en |
dc.date.available | 2013-09-02T09:58:34Z | |
dc.date.issued | 2011 | |
dc.identifier.uri | http://hdl.handle.net/10797/13027 | en |
dc.description | Περιέχει το πλήρες κείμενο | el_GR |
dc.description.abstract | The aim of the study is to elaborate on the procedure needed in order to analyze morpho-syntactically the typing-error queries submitted in Greek during the search process. In the context of our analysis a failed query is a query which returned no hits. The analysis showed that failed queries represent 36% of the submitted queries. More specifically, 19.6% of failed queries occurred due to typing errors. We discovered that for analyzing morpho-syntactically a Greek text corpus the PoS tools need to be rich in tags in order to work adequately. Open Xerox tokenizer performed well but with significant pre-processing of the queries and the analyzer seems to require additional tools to improve its performance. MS Word which was used for spelling corrections seems to perform satisfactorily. All tools were challenged in terms of named entities recognition. | el_GR |
dc.language.iso | eng | en |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.source | Workshop on Digital Information Management - 1o | en |
dc.title | Failed Queries: a Morpho-Syntactic Analysis Based on Transaction Log Files | en |
dc.type | Conference Object | en |
dc.subject.uncontrolledterm | Failed queries | en |
dc.subject.uncontrolledterm | Morpho-syntactic analysis | en |
dc.subject.uncontrolledterm | PoS tagging | en |
dc.subject.uncontrolledterm | Typing errors | en |
dc.subject.JITA | Διαχείριση υπηρεσιών, λειτουργιών και τεχνικών πληροφόρησης, Γλώσσες ευρετηρίασης, διαδικασίες και σχήματα | el_GR |
dc.subject.JITA | Information treatment for information services, Information functions and techniques, Index languages, processes and schemes | en |
dc.identifier.JITA | IC | en |