Loading spreadsheet data...
Login with your password to access the supplier data portal.
Access Levels:
Click "Export Excel" to download an updated spreadsheet with all your changes.
The system uses a multi-layered extraction pipeline to capture 100% of data from PDFs:
| Layer | Engine | Best For |
|---|---|---|
| L1 | pdftotext | Native digital PDFs (fastest) |
| L2 | pdfplumber | Tables with structure |
| L3 | PyMuPDF | Complex vector layouts |
| L4 | Image-OCR | Embedded stamps/logos |
| L5 | Tesseract | Full page OCR (300 DPI) |
| L6 | EasyOCR | 80+ languages, varied fonts |
| L7 | PaddleOCR | Korean/Chinese/Japanese |
| L8 | Camelot Lattice | Bordered tables |
| L9 | Camelot Stream | Borderless tables |
✨ Layers 6-9 are NEW - providing enhanced multi-language and table support!
Enhance extraction by pasting copied text from the original document.
Copy text from layers, cells, or any part of the document and paste below. You can do multiple passes - the data will be combined.
or click to select files