Troubleshooting
All users7 min read
Common Conversion Issues
Most conversion issues are caused by source quality, malformed PDFs, embedded font differences, or scanned content that needs preprocessing.
Common causes
- Scanned or image-only PDFs without OCR.
- Malformed or damaged source PDFs.
- Large files with embedded assets or unusual fonts.
- Skewed scans or low-resolution images.
Recommended fix sequence
- 1
Repair the PDF if the source behaves inconsistently.
- 2
Deskew scanned pages if text alignment is poor.
- 3
Run OCR before text extraction or AI retrieval.
- 4
Compress only after the file content is final.
How to interpret the symptom
- Missing text usually points to OCR or source readability issues.
- Broken layout often points to fonts, embedded objects, or malformed input.
- Oversized outputs often come from image-heavy content or merging too early without cleanup.
FAQ
Why is extracted text missing or incomplete?
The file is often image-based or low quality. OCR and image enhancement usually improve extraction results.
Why is my output file too large?
Run compression after merge or edit steps and remove unnecessary image-heavy source pages where possible.