Please use this identifier to cite or link to this item:
http://repository.unizik.edu.ng/handle/123456789/580
Title: | Automatic Text Classification on Blogs Using Support Vector Machines (SVM) |
Authors: | Asogwa, D.C Efozia, F.N Chukwuneke, C.I Nnaekwe, K.U |
Keywords: | machine learning text classification feature extraction pre-processing algorithm supervised learning |
Issue Date: | Apr-2022 |
Publisher: | International Journal of Research Publication and Reviews |
Citation: | International Journal of Research Publication and Reviews, Vol 3, no 4, pp 805-809 |
Abstract: | Automatic Text Classification is a machine learning task that automatically assigns a given text document to a set of pre-defined categories based on the features extracted from its textual content. Most online communication forums, including social media, enable users to express themselves freely, and most times, anonymously. The ability to freely express oneself is a human right that should be cherished, but people always induce and spread hate or illegal words towards another group as an abuse of this liberty. For instance many online forums such as Facebook, YouTube, and Twitter consider hate speech harmful, and have policies to remove hate speech content. This paper attempts to automatically classify the textual entries made by bloggers on various topics into hate speech and non-hate speech. This was achieved by following steps like pre-processing, feature extraction and support vector machine classification. Empirical evaluation of this binary classification has resulted in an accuracy of approximately 83% over the test set. In addition to classifying the textual entries of the blogs, it is proposed that the extracted features themselves be further classified under more meaningful heads which results in generation of a semantic resource that lends greater understanding to the classification task. This semantic resource can be used for data mining requirements that arise in the future. |
Description: | Scholarly Works |
URI: | www.ijrpr.com http://repository.unizik.edu.ng/handle/123456789/580 |
ISSN: | 2582-7421 |
Appears in Collections: | Scholarly Works |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
automatic classification of blogs.pdf | 412.13 kB | Adobe PDF | View/Open |
Items in UnizikSpace are protected by copyright, with all rights reserved, unless otherwise indicated.