🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
PDFs are hard to process and it's hard to extract information.
So the results of this tool may not satisfy you.
There will be more work to improve this software but altogether, it's unlikely that it …