dc.contributor.author | Φράγκου, Παυλίνα | el_GR |
dc.contributor.author | Fragkou, Pavlina | en |
dc.coverage.spatial | GR - Κω | en |
dc.date.available | 2014-02-03T10:56:15Z | |
dc.date.issued | 2011 | |
dc.identifier.uri | http://hdl.handle.net/10797/13757 | en |
dc.description | Περιέχει το πλήρες κείμενο | el_GR |
dc.description.abstract | In this paper we examine the benefit of
performing named entity recognition and co-reference
resolution to a Greek corpus used for text segmentation.
Segments consist of portions among one of the 300
documents published by ten different authors in the
Greek newspaper "To Vima". The aim here is to
examine whether the combination of text segmentation
and information extraction (and most specifically the
named entity recognition and co-reference resolution
steps) can prove to be beneficial for the identification of
the various topics that appear in a document. Named
entity recognition was performed using an already
existing tool which was trained on a similar corpus. The
produced annotations were manually corrected and
enriched in order to cover four types of named entities
(i.e. person name, organization, location and time). Coreference
resolution and most specifically substitution
of every reference of the same instance with the same
named entity identifier was performed in a subsequent
step. The evaluation using three well known text
segmentation algorithms leads to the conclusion that,
the benefit highly depends on the segment's topic, the
number of named entity instances appearing in it, as
well as the segment's length. | en |
dc.language.iso | eng | en |
dc.relation.ispartof | Symposium on Information and Knowledge Management | en |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.source | 1rd International Conference on Integrated Information | en |
dc.title | Text Segmentation Using Named Entity Recognition and Co-Reference Resolution in Greek Texts | en |
dc.type | Conference Object | en |
dc.subject.uncontrolledterm | Text segmentation | en |
dc.subject.uncontrolledterm | Named entity recognition | en |
dc.subject.uncontrolledterm | Co-reference resolution | en |
dc.subject.uncontrolledterm | Information extraction | en |
dc.subject.JITA | Διαχείριση υπηρεσιών, λειτουργιών και τεχνικών πληροφόρησης | el_GR |
dc.subject.JITA | Information treatment for information services, Information functions and techniques | en |
dc.contributor.conferenceorganizer | 2nd AMICUS Workshop | en |
dc.contributor.conferenceorganizer | Mednet Hellas, The Greek Medical Network | en |
dc.contributor.conferenceorganizer | National And Kapodistrian University of Athens | en |
dc.contributor.conferenceorganizer | University of Peloponnese | en |
dc.contributor.conferenceorganizer | Emerald Group Publishing Limited | en |
dc.contributor.conferenceorganizer | Technological educational Institute of Athens | en |
dc.identifier.JITA | IZ | en |