WelCome to Ambo University Institutional Repository!!

Afan Oromo Document Text Classification Using Single Layer Multisize Filters Convolutional Neural Network

Show simple item record

dc.contributor.author Kasahun, Teshome
dc.date.accessioned 2023-10-31T06:57:51Z
dc.date.available 2023-10-31T06:57:51Z
dc.date.issued 2021-10
dc.identifier.uri http://hdl.handle.net/123456789/3161
dc.description.abstract Text classification is one of the most widely used natural language processing technologies. It’s the technique which classifies unstructured text data into meaningful categorical classes. With the continuously increasing amount of online information, there is a pressing need to classify text for valuable information. Previously, many researchers have been done Afan Oromo text classification using machine learning methods. However, most of these traditional methods use TF-IDF, Bag of words to map some representation of the input data to predefined set of meaningful outputs but ignoring the context and internal hierarchy of the text and in addition, the traditional approach treats labels as independent individuals while ignores the relationships between them, which not reflect reality but also leads significant loss of semantic information, these limitations can be solved by deep learning methods. So, in this study, we use a Single layer Multi-Size Filters Convolutional Neural Network for document text classification and we collect dataset that contains 6450 documents organize into ten classes. We also look at how preprocessing approaches affect the performance of Single-layer Multi-Size Filters Convolutional Neural Networks. After hyperparameter tuning of our model, the performance of SMF-CNN evaluated using those different ways: Fast-Text pre-trained and Word2vec pre-trained word embedding, the other is without using pre-trained embedding. The experimental results show Single-layer Multi Size Filters Convolutional Neural Network performance can achieve both effectiveness and good scalability of the accuracy is 96.81%, it can be seen that only Fast-Text pre trained word embedding is greater accuracy. en_US
dc.language.iso en en_US
dc.publisher Ambo University en_US
dc.subject Text Classification en_US
dc.subject Afan Oromo en_US
dc.subject Convolutional Neural Network en_US
dc.title Afan Oromo Document Text Classification Using Single Layer Multisize Filters Convolutional Neural Network en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AmbouIR


Advanced Search

Browse

My Account