Abstract:
Millions of structured, semi-structured, and unstructured documents are produced around the
globe a day today. Several research societies like IEEE, Elsevier, Springer, and Wiley that we
use to publish the scientific documents enormously and some individual's documents are sources
of such data. Thanks to their massive volume and ranging document formats, search engines face
problems in indexing such documents, thus retrieving inefficient, tedious, and time-consuming
data. Information extraction from such documents is among the most well-liked areas of research
in data/text mining. Because the number of such documents is increasing tremendously day by
day on a large scale that's why proper and more sophisticated information extraction techniques
are necessary to find out, this research focuses on reviewing and summarizing existing practices
in information extraction to highlighting their limitations.