This is a web-based application that allows users to upload PDF files and convert them into structured XML format. The application is built using Flask, SQLAlchemy, and other modern web technologies.
This Clowder extractor converts pdf documents to text and json. It uses GROBID 0.8.0 to convert pdf to xml and then uses s2orc-doc2json to convert xml to json. The doc2txt/json2txt is used to convert ...