A Repo For Document AI
-
Updated
Jan 14, 2025 - Python
A Repo For Document AI
Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing
Add a description, image, and links to the pubtabnet topic page so that developers can more easily learn about it.
To associate your repository with the pubtabnet topic, visit your repo's landing page and select "manage topics."