A practical document parsing tool for converting PDF, images, DOCX, PPTX, and XLSX into Markdown and JSON