VisualUrText

A Text Analytics Tool for Unstructured Textual Data

Zuraini Zainol, Mohd T.H. Jaymes, Nohuddin Puteri Nor Ellyza

Research output: Contribution to journalConference article

1 Citation (Scopus)

Abstract

The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90% of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.

Original languageEnglish
Article number012011
JournalJournal of Physics: Conference Series
Volume1018
Issue number1
DOIs
Publication statusPublished - 1 Jun 2018
Event1st International Conference on Big Data and Cloud Computing, ICoBiC 2017 - Kuching, Sarawak, Malaysia
Duration: 25 Nov 201727 Nov 2017

Fingerprint

trends
natural language processing
information retrieval
network analysis
linguistics
data mining
machine learning
intelligence
students
statistics
matrices

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this

VisualUrText : A Text Analytics Tool for Unstructured Textual Data. / Zainol, Zuraini; Jaymes, Mohd T.H.; Puteri Nor Ellyza, Nohuddin.

In: Journal of Physics: Conference Series, Vol. 1018, No. 1, 012011, 01.06.2018.

Research output: Contribution to journalConference article

@article{b3f1aaff548f41e1bad544d3ca91cfbe,
title = "VisualUrText: A Text Analytics Tool for Unstructured Textual Data",
abstract = "The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90{\%} of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.",
author = "Zuraini Zainol and Jaymes, {Mohd T.H.} and {Puteri Nor Ellyza}, Nohuddin",
year = "2018",
month = "6",
day = "1",
doi = "10.1088/1742-6596/1018/1/012011",
language = "English",
volume = "1018",
journal = "Journal of Physics: Conference Series",
issn = "1742-6588",
publisher = "IOP Publishing Ltd.",
number = "1",

}

TY - JOUR

T1 - VisualUrText

T2 - A Text Analytics Tool for Unstructured Textual Data

AU - Zainol, Zuraini

AU - Jaymes, Mohd T.H.

AU - Puteri Nor Ellyza, Nohuddin

PY - 2018/6/1

Y1 - 2018/6/1

N2 - The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90% of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.

AB - The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90% of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.

UR - http://www.scopus.com/inward/record.url?scp=85048359193&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85048359193&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/1018/1/012011

DO - 10.1088/1742-6596/1018/1/012011

M3 - Conference article

VL - 1018

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

SN - 1742-6588

IS - 1

M1 - 012011

ER -