๐ Smart Document Parser
A powerful document parsing application that automatically extracts structured information from various document formats. Upload a document or provide a URL (PDF, DOCX, TXT, HTML, Markdown) and get structured information automatically.
Document Metadata
Property | Value |
|---|---|
๐ Supported Formats
- PDF Documents (*.pdf)
- Word Documents (*.docx)
- Text Files (*.txt)
- HTML Files (*.html)
- Markdown Files (*.md)
๐ Example URLs
- ArXiv PDFs: https://arxiv.org/pdf/2408.08921.pdf
- Research Papers
- Documentation
๐ Features
- Multiple Format Support: PDF, DOCX, TXT, HTML, and Markdown
- Support for File Upload and URLs
- Rich Information Extraction
- Smart Processing with Confidence Scoring
- Automatic Format Detection
Made with โค๏ธ using Docling and Gradio