Template-Based Metadata Extraction for Heterogeneous Collection

Jienfeng Tang
With the growth of the Internet and related tools, there has been a rapid growth of online resources. In particular, by using high-quality OCR (Optical Character Recognition) tools it has become easy to convert an existing corpus into digital form and make it available online. However, a number of organizations have legacy collections that lack metadata. The lack of metadata hampers not only the discovery and dispersion of these collections over the Web, but also...
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.