FlexFiles Documentation
Product guides, tutorials, and operational reference
Troubleshooting
All users7 min read

Common Conversion Issues

Most conversion issues are caused by source quality, malformed PDFs, embedded font differences, or scanned content that needs preprocessing.

Common causes

  • Scanned or image-only PDFs without OCR.
  • Malformed or damaged source PDFs.
  • Large files with embedded assets or unusual fonts.
  • Skewed scans or low-resolution images.

Recommended fix sequence

  1. 1

    Repair the PDF if the source behaves inconsistently.

  2. 2

    Deskew scanned pages if text alignment is poor.

  3. 3

    Run OCR before text extraction or AI retrieval.

  4. 4

    Compress only after the file content is final.

How to interpret the symptom

  • Missing text usually points to OCR or source readability issues.
  • Broken layout often points to fonts, embedded objects, or malformed input.
  • Oversized outputs often come from image-heavy content or merging too early without cleanup.

FAQ

Why is extracted text missing or incomplete?

The file is often image-based or low quality. OCR and image enhancement usually improve extraction results.

Why is my output file too large?

Run compression after merge or edit steps and remove unnecessary image-heavy source pages where possible.