Using Our Tools · Guide · File & Format Converters
File Converter Troubleshooting Guide
Systematic troubleshooting for failed conversions: 6 common failure modes, why conversions take so long, handling very large files (over 500 MB), speed-up tips, and recovery options when a converter fails entirely.
Converters fail. Browsers run out of memory mid-conversion, output files are corrupted, OCR returns gibberish, and the “30 seconds” turns into 30 minutes of waiting. This guide is the systematic troubleshooting playbook — what to check, in what order, and the recovery options when things go truly wrong.
Advertisement
Common failure modes (in order of frequency)
1. Output looks corrupted or garbled
Almost always one of: wrong source format detected, encoding mismatch (UTF-8 vs Latin-1), or the file was a different format than its extension claimed. Diagnostic:
- Open the source in a hex editor (HxD on Windows, xxd on Mac/Linux). The first few bytes are usually a magic number identifying the actual format.
- Check the file extension matches the magic number. .pdf with no “%PDF” in the first 4 bytes is a renamed file.
- For text: check encoding. iconv (Mac/Linux) or Notepad++ “Encoding” menu (Windows) lets you re-save with explicit UTF-8.
2. Browser tab crashes or hangs
Memory exhaustion. Browser-side conversion holds the entire file in RAM, sometimes multiple times (during decoding, during processing, during encoding). For very large files, browser tabs are limited to around 2-4 GB of memory.
Fixes:
- Close other tabs to free memory.
- Try Chrome instead of Safari (Chrome handles big tabs better in our experience).
- Split the file before converting — split a 1 GB PDF into 4 chunks of 250 MB.
- Switch to a desktop tool (FFmpeg, ImageMagick, Pandoc) for files > 500 MB.
3. OCR returns gibberish
Tesseract gibberish usually means: low-resolution input, language mismatch (asking English OCR to parse Spanish text), or non-text content (a photograph mistaken for a scan). Fixes:
- Verify language selection matches the source.
- If the source is a scanned photo with poor lighting, retake or scan with better lighting before OCR.
- For mixed-language documents, run OCR per page or section with the right language each time.
- For low-resolution scans (under 200 DPI), upscale first using image resizer with bicubic interpolation, then OCR.
4. PDF won’t open in target app
The PDF is corrupted, password-protected, or uses a feature the target app doesn’t support (XFA forms, layered PDFs). Diagnostic:
- Try opening in a different app (Adobe Reader, Preview, Chrome). If it opens elsewhere, the original target has a feature gap.
- Check for password protection — converters often fail silently on encrypted PDFs.
- For corrupted PDFs: our PDF organizer or pdf-lib-based tools can sometimes salvage the readable pages.
5. Conversion completes but output is empty
Usually the source had no extractable content for the requested target. Examples: scanned PDF (no extractable text — needs OCR), image with transparency lost in flat conversion, encrypted document.
6. Special characters or accents missing
Encoding problem. Re-save the source as UTF-8, retry. If the source was Windows-1252 (common for Excel exports on Western European machines), the conversion needs to know to re-encode.
Why some file conversions take so long
Speed depends on:
- Input file size and complexity. A 500 MB PDF takes longer than a 5 MB one. Multi-column or image-heavy PDFs are slower than text-only.
- Conversion type. Lossless reformats (PNG to BMP) are fast. Reflow + restructure (PDF to DOCX preserving layout) are slow.
- Browser-side vs cloud. Cloud servers have GPUs and more RAM. Browser-side relies on your CPU. Slower but private.
- OCR specifically. Always slow. 5–15 seconds per page for browser-side Tesseract. 1–2 seconds per page for cloud GPU OCR.
- Background tabs. Browsers throttle CPU on background tabs. Keep the conversion tab in foreground.
Converting large files: does size matter?
Yes, in three ways:
- Memory pressure. Browser tabs cap around 2-4 GB. Files over 500 MB are at risk of running out of memory mid-conversion. Symptoms: hang, crash, “page unresponsive” warnings.
- Linear-or-worse time complexity. Some conversions scale linearly with file size (bytes processed). Others scale super-linearly (image processing, video encoding). 10× the file size is rarely 10× the time.
- Upload bandwidth (if cloud-based). A 1 GB upload to cloud OCR over a 10 Mbps connection takes 13 minutes just to upload — before any processing.
For files > 500 MB, browser-only converters become unreliable. Switch to a desktop tool: FFmpeg for video, ImageMagick for images, Pandoc or LibreOffice headless for documents. They handle gigabytes without breaking a sweat.
Tips to convert files faster
- Keep the conversion tab in foreground. Browser background tabs are throttled.
- Close other tabs. Free up memory and CPU for the conversion.
- Use lower-quality settings if you don’t need pristine output. 90% JPG is visually identical to 100% but converts faster. 720p video instead of 1080p halves encoding time.
- Process in batches. Don’t convert 100 files sequentially in 100 separate operations. Use a batch tool (FFmpeg, Pandoc) that re-uses the loaded engine.
- Skip unnecessary intermediate steps. PDF → DOCX → PDF is wasteful if you just need a different PDF. Find the direct path.
- Pre-process if possible. Strip pages you don’t need before OCR. Resize huge images before format conversion. Don’t OCR pages that are already text-extractable.
What happens when a file converter fails: recovery options
Failure recovery in priority order:
- Retry with a different tool. If browser-only fails, try desktop. If one cloud service fails, try another. Different engines have different failure modes — what one chokes on, another handles fine.
- Verify the source isn’t corrupted. Open the source file in its native app. If it opens fine there, the converter is the problem. If it doesn’t open, the source is the problem.
- Try converting via an intermediate format. Direct PDF → DOCX failing? Try PDF → ODT → DOCX, or PDF → Markdown → DOCX. The intermediate often dodges whatever the direct path is choking on.
- Salvage what you can. If the conversion partially succeeds (50 pages converted out of 100), accept the partial result and re-process only the failed pages.
- Recover from backup or original. Always keep the original. If the conversion eats the file, you have the master.
- If genuinely stuck: post on r/SoftwareRecs or SuperUser with the source file format, target format, and exact error. Specific edge cases often have community-known solutions.
Use these while you read
Tools that pair with this guide
- Color ConverterConvert any color between HEX, RGB, and HSL. Paste a hex code or tweak channels and see every format update live.File & Format Converters
- Unix Timestamp ConverterConvert Unix epoch seconds or milliseconds to a human date, or a date to an epoch. Timezone-aware, runs offline.File & Format Converters
- Number Base ConverterConvert any number between binary, octal, decimal, and hexadecimal at the same time. Clean UI, no ads.File & Format Converters
- CSV to JSONPaste CSV rows and get clean JSON. Handles quoted fields, custom delimiters, and header-row toggling. No upload needed.File & Format Converters
Frequently asked questions
My file converter isn't working — what do I do?
Systematic checks: (1) verify source file isn't corrupted (opens in native app?), (2) check magic-number vs file extension match, (3) for browser hangs, close other tabs / try Chrome / split the file, (4) for gibberish output, check encoding (UTF-8 vs Latin-1) and OCR language. If still stuck, try a different converter or convert via an intermediate format.
What happens when a file converter fails?
Recovery options: retry with a different tool, verify source isn't corrupted, try an intermediate format (PDF → ODT → DOCX often works when direct PDF → DOCX fails), salvage partial output, recover from original (always keep masters). Last resort: post on r/SoftwareRecs with format details — specific edge cases have community-known fixes.
Why do file conversions take so long?
Five factors: file size and complexity, conversion type (lossless reformats are fast, reflow + restructure are slow), browser-side vs cloud (cloud has GPUs), OCR is always slow, browser background tabs get CPU-throttled. To speed up: keep tab in foreground, close other tabs, use lower quality where acceptable, batch with desktop tools.
Does file size affect conversion speed?
Yes. Memory pressure caps browser tabs around 2-4 GB. Time complexity is linear-or-worse with size. Upload bandwidth matters for cloud tools. Files over 500 MB push browser tools to their limits — switch to FFmpeg / ImageMagick / Pandoc / LibreOffice headless desktop tools for gigabyte-scale files.
Advertisement
Continue reading
- Using Our ToolsEbook and 3D File Conversion GuideConvert PDF to EPUB for e-readers, scientific PDFs with math symbols, STL to OBJ for 3D printing, and 3MF without losing details. Includes Calibre and Blender workflows for niche conversion needs.
- Using Our ToolsFile Formats You Actually Need to KnowPractical taxonomy of the ~20 formats covering 95% of everyday use — documents, images, data, audio/video, archives, and the rare ones. Real differences between PNG/JPG/SVG, EPUB/MOBI/AZW3, CSV/JSON/XML/YAML.
- Using Our ToolsConverting Files Without Losing QualityLossy vs lossless conversions, image format tradeoffs (PNG, JPG, WebP, AVIF, SVG), preserving metadata, compression effects, multi-stage conversion damage, and keeping structure (tables, lists) across format changes.
- Using Our ToolsCSV and Spreadsheet Data Conversion GuideConvert between CSV, JSON, XML, YAML, and Excel cleanly. The CSV-to-Excel pitfalls (auto-corruption of ZIP codes), JSON-to-CSV with nested data, XML-to-CSV row detection, and how to keep structure across formats.
- Using Our ToolsPDF to Word and Editable Text Conversion GuideHow to convert PDF to Word in 30 seconds (text-based PDFs), how to OCR scanned and handwritten PDFs, and which tools handle complex layouts (tables, multi-column, scientific) — with free vs paid recommendations.
- Using Our ToolsBest Free File Converters for Mac and WindowsCross-platform breakdown of free file converters — what's built into macOS and Windows, browser-only converters that need no install, and the cross-platform desktop tools (Pandoc, FFmpeg, ImageMagick, Calibre, LibreOffice).