WelCome to Ambo University Institutional Repository!!

Automatic Relation Extraction Between Entities For Amharic Text

Show simple item record

dc.contributor.author Melkamu, Genet
dc.date.accessioned 2023-10-31T07:10:10Z
dc.date.available 2023-10-31T07:10:10Z
dc.date.issued 2021-05
dc.identifier.uri http://hdl.handle.net/123456789/3164
dc.description.abstract Automatic relation extraction is the task of identifying or recognizing the relations between entities. It is a sub component of information extraction among named entity recognition, co reference and anaphoric resolution, temporal and event detection and template filling. Among those information extraction components this study focused only Automatic relation extraction between entities for Amharic text using supervised machine learning approach. Once named entities are identified, relation extraction is the second step and sub-task of information extraction. For instance, “Engineer Ktaw Ejigu is the chief scientist of the American space science” contains the person-affiliation relation between the person Engineer Ktaw Ejigu and the organization American space science. From this the words “the chief scientist of” are the relation between the entities of the person and organization. The named entities are targeted predefined person, location and organization and the relations are also predefined which are existed between those corresponding entities to extract in the text. The problem of relation extraction from a text; the system developed for English language text or other foreign languages text cannot be applicable for Ethiopian Amharic or other text; this is due to the difference of nature or structure of the language. This study prepared own corpus from Walta information center website archive resources to obtain a suitable number of 30,466 words or tokens from 2000 sentences. To create the model of the system this work used supervised machine learning. For this study two models are created using support vector machine and conditional random field machine learning. In the SVM algorithm model using stochastic gradient descent classifier algorithm precision 49%, recall 10% and F1-score 13%, passive aggressive classifier algorithm precision 55%, recall 41% and F1-score of 48 and multinomial Naïve Bayes classifier algorithm is highly scorer among the tested algorithms and can obtain results of Precision 60%, Recall 41% and F1-score of 48%. But, by using conditional random field it can be achieved Precision 87%, Recall 87% and F-score 86% respectively. As the performance of the system indicates that; in this work CRF is a selected algorithm to train and create a model for Automatic relation extraction between entities for Amharic text proposed architecture. en_US
dc.language.iso en en_US
dc.publisher Ambo University en_US
dc.subject Information Extraction en_US
dc.subject Relation Extraction en_US
dc.subject Supervised Machine Learning en_US
dc.title Automatic Relation Extraction Between Entities For Amharic Text en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AmbouIR


Advanced Search

Browse

My Account