I need help converting a PDF file. The conversion can be done either from PDF to txt or PDF to json. The PDF files will always follow a set pattern of text location and content
PDF to text is not trivial. Even though pdf2txt exists, it relies on textual information beeing embedded into the PDF.
In worst case scenarios you might need OCR software to extract text from pixel data.