Show simple item record

dc.contributor.authorΦράγκου, Παυλίναel_GR
dc.contributor.authorFragkou, Pavlinaen
dc.coverage.spatialGR - Κωen
dc.date.available2014-02-03T10:56:15Z
dc.date.issued2011
dc.identifier.urihttp://hdl.handle.net/10797/13757en
dc.descriptionΠεριέχει το πλήρες κείμενοel_GR
dc.description.abstractIn this paper we examine the benefit of performing named entity recognition and co-reference resolution to a Greek corpus used for text segmentation. Segments consist of portions among one of the 300 documents published by ten different authors in the Greek newspaper "To Vima". The aim here is to examine whether the combination of text segmentation and information extraction (and most specifically the named entity recognition and co-reference resolution steps) can prove to be beneficial for the identification of the various topics that appear in a document. Named entity recognition was performed using an already existing tool which was trained on a similar corpus. The produced annotations were manually corrected and enriched in order to cover four types of named entities (i.e. person name, organization, location and time). Coreference resolution and most specifically substitution of every reference of the same instance with the same named entity identifier was performed in a subsequent step. The evaluation using three well known text segmentation algorithms leads to the conclusion that, the benefit highly depends on the segment's topic, the number of named entity instances appearing in it, as well as the segment's length.en
dc.language.isoengen
dc.relation.ispartofSymposium on Information and Knowledge Managementen
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.source1rd International Conference on Integrated Informationen
dc.titleText Segmentation Using Named Entity Recognition and Co-Reference Resolution in Greek Textsen
dc.typeConference Objecten
dc.subject.uncontrolledtermText segmentationen
dc.subject.uncontrolledtermNamed entity recognitionen
dc.subject.uncontrolledtermCo-reference resolutionen
dc.subject.uncontrolledtermInformation extractionen
dc.subject.JITAΔιαχείριση υπηρεσιών, λειτουργιών και τεχνικών πληροφόρησηςel_GR
dc.subject.JITAInformation treatment for information services, Information functions and techniquesen
dc.contributor.conferenceorganizer2nd AMICUS Workshopen
dc.contributor.conferenceorganizerMednet Hellas, The Greek Medical Networken
dc.contributor.conferenceorganizerNational And Kapodistrian University of Athensen
dc.contributor.conferenceorganizerUniversity of Peloponneseen
dc.contributor.conferenceorganizerEmerald Group Publishing Limiteden
dc.contributor.conferenceorganizerTechnological educational Institute of Athensen
dc.identifier.JITAIZen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record