Text mining in unstructured text: techniques, methods and analysis

ABSTRACT

Text mining is a process to extract interesting and significant patterns to explore knowledge from textual data sources. Approximately, 90% of world’s data is held in unstructured format. In the 21^stcentury, unstructured data is growing exponentially. Computational text analysis has become an exciting research field with many applications in communication research. It can be a difficult method to apply, because it requires knowledge of various techniques, and the software required to perform most of these techniques is not readily available in common statistical software packages. This report takes a quick look at how to organize and analyze huge volume of unstructured text data using R programing language. The Coronavirus Corpus data set was used for the evaluation. Different features obtained from the data management part of tokenization, removal of punctuations, stemming and construction of the document-term matrix (DTM) were further used for the analysis. Visualization, finding associations, networks and groups among the extracted features are included in the analysis. Overall, this paper provides a practical demonstration of text mining using a real data set.

Support the magazine and subscribe to the content

This is premium stuff. Subscribe to read the entire article.

Gain access to all our Premium contents.
More than 3000+ articles.

Subscribe Now

Buy Article

Unlock this article and gain permanent access to read it.

Unlock Now

Text mining in unstructured text: techniques, methods and analysis

Author Muhammad Abdur Razzaqe, Tapati Basak, 174 (2022) 68-84

The Influence of Market-Based and Bank-Based Financial Systems on Economic Growth: An Evaluation of Nigeria and South Africa’s Data

Modelling of Global LNG Consumption at Some Optimal Prices

View free articles

View Articles

Last Articles

Condensed Geometry II: New Constraints, Temporal Confinement Phase and Structure and Interpretation of Space and Time

Assessment of the constraints in the environmental management plan of filling stations in Kaduna metropolis, Nigeria

Preparation and characterization of TiO2 and TiO2P25 nanomaterial and photocatalytic application

Popular Articles

About Us

Submit your Article

Jeevamrut – A Natural Fertilizer

Abstracting & Indexing

Guide for Authors

Careers

Menu

Other databases

EISSN 2392-2192

Welcome Back!

Retrieve your password

Are you sure want to unlock this post?

Are you sure want to cancel subscription?