Abstract:
Millions of structured, semi-structured, and unstructured documents are produced around the globe a day today. Several research societies like IEEE, Elsevier, Springer, and Wiley that we use to publish the scientific documents enormously and some individual's documents are sources of such data. Thanks to their massive volume and ranging document formats, search engines face problems in indexing such documents, thus retrieving inefficient, tedious, and time-consuming data. Information extraction from such documents is among the most well-liked areas of research in data/text mining. Because the number of such documents is increasing tremendously day by day on a large scale that's why proper and more sophisticated information extraction techniques are necessary to find out, this research focuses on reviewing and summarizing existing practices in information extraction to highlighting their limitations