Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
-
Updated
Jun 25, 2026 - Python
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
[ECCV 2026] A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
PDF转Markdown/Word 软件MinerU最新版免安装一键启动整合包
🛠️ Dump Unreal Engine data from Android games, generate SDK and function scripts, and automate address scanning for efficient game analysis.
🎨 Enhance cinematic image quality with ComfyUI-None-upup. This AI engine offers nodes for clarity, brightness, and video processing to elevate your visuals.
`pdf2struct` extracts structured JSON from PDF documents.
Generate ComfyUI custom nodes for Foundation-1 to create structured text-to-sample music with fast, local audio diffusion workflows
🤖 Process SCAIL-pose data with ComfyUI nodes, utilizing VitPose for accurate face and hand detection in an efficient, streamlined setup.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
🚀 Enable true multi-GPU processing in ComfyUI with independent model replicas for efficient, simultaneous batch execution across multiple devices.
Implements Unreal Engine 5 network protocol in Python to connect, authenticate, and replicate actors with UE5 Lyra Starter Game servers.
Build a minimalist, stylish brand mall template using uni-app, Vue2, and Tuniao UI for quick e-commerce and fashion store front development.
Add a description, image, and links to the pdf-extractor-pretrain topic page so that developers can more easily learn about it.
To associate your repository with the pdf-extractor-pretrain topic, visit your repo's landing page and select "manage topics."