Using Our Tools · Guide · File & Format Converters
File Formats You Actually Need to Know
Practical taxonomy of the ~20 formats covering 95% of everyday use — documents, images, data, audio/video, archives, and the rare ones. Real differences between PNG/JPG/SVG, EPUB/MOBI/AZW3, CSV/JSON/XML/YAML.
“What file formats do I actually need to know about?” gets asked weekly on r/learnprogramming and r/computerscience. The honest answer: maybe 20 formats cover 95% of everyday usage. The rest are historical curiosities or domain-specific.
This guide is the practical taxonomy: the formats you’ll actually encounter, what they’re for, and the real differences between similar-looking ones (PNG vs JPG vs SVG, for example).
Advertisement
Document formats
- PDF (Portable Document Format): visual-fidelity-first. Looks the same on every device. Hard to edit. Default for sharing finished documents.
- DOCX (Microsoft Word): editable rich text + layout. Works in Word, Google Docs, LibreOffice, Pages. The default for collaborative drafting.
- ODT (OpenDocument Text): open-source DOCX equivalent. Used by LibreOffice. Round-trips with DOCX with minor formatting drift.
- RTF (Rich Text Format): simpler than DOCX, plain-ish text with basic formatting. Mostly historical; useful for compatibility with old or specialized software.
- TXT (plain text): just characters, no formatting. The universal lowest-common-denominator format.
- MD (Markdown): plain text with structural conventions (# for headings, * for bold, etc.). The format engineers and technical writers prefer.
- EPUB: reflowable ebook format. Used by Kindle (via conversion to AZW3), iBooks, Adobe Digital Editions. Better than PDF for reading on small screens.
Image formats: PNG, JPG, SVG, WebP, AVIF (the differences explained)
Each image format optimizes for something different:
- JPG / JPEG: lossy compression for photos. 30+ years old; universal compatibility. Use for: photos, photo-realistic images, anywhere you need maximum compatibility.
- PNG: lossless. Supports transparency. Use for: screenshots, line art, logos with transparency, anything with sharp edges.
- GIF: animated images, lossless palette compression (256 colors max). Mostly replaced by video formats and WebP/AVIF for animation. Still ubiquitous because of compatibility.
- SVG (Scalable Vector Graphics): vector format. Defined by math, not pixels — infinite resolution. Use for logos, icons, anything that needs to scale.
- WebP: Google’s modern format. Both lossy + lossless modes. 25–35% smaller than JPG/PNG at similar quality. Wide browser support. Use for web images.
- AVIF: newer than WebP, even better compression. Browser support reaching parity. Use for cutting-edge web; fallback to WebP for older browsers.
- HEIC / HEIF: Apple’s default for iPhone photos. Better compression than JPG. Compatibility outside Apple ecosystem is spotty — convert to JPG before sharing widely. Use our HEIC to JPG tool.
- BMP: uncompressed bitmap. Huge files. Mostly historical / Windows-specific.
- TIFF: high-quality lossless image. Used in publishing, scanning, professional photography. Big files.
Quick conversions on this site: PNG/JPG/WebP converter, SVG to PNG, HEIC to JPG, WebP to JPG.
Data formats: CSV, JSON, XML, YAML
- CSV (comma-separated values): oldest, simplest, spreadsheet-native. Each row is a line; columns separated by commas. Works with Excel, R, Python pandas, anything. Limitation: no nested data.
- JSON (JavaScript Object Notation): nested data structure. Web API default. Hierarchical objects + arrays. Most modern data exchange.
- XML (eXtensible Markup Language): hierarchical with attributes. Older than JSON; common in enterprise systems, SOAP APIs, document formats (DOCX, ODT internally are zipped XML). More verbose than JSON but more expressive (attributes, namespaces).
- YAML (YAML Ain’t Markup Language): human-friendly nested data. Used for configs (Kubernetes, Ansible, GitHub Actions). Indentation-sensitive — easy to break with whitespace.
- TOML: simpler than YAML for configs. Used by Cargo (Rust), Poetry (Python).
- Parquet, Avro, ORC: columnar binary formats for big data. Smaller and faster than CSV/JSON for analytics workloads. Used with Spark, BigQuery, Snowflake. Not human-readable.
Conversions on this site: JSON to CSV, CSV to JSON, XML to CSV, YAML ↔ JSON, CSV to Excel.
Audio + video formats
Quick reference:
- Audio lossless: WAV (uncompressed), FLAC (compressed lossless), ALAC (Apple lossless).
- Audio lossy: MP3 (universal), AAC (newer, better compression), OGG (open-source), Opus (best compression).
- Video lossy: H.264/MP4 (universal default), H.265/HEVC (better compression but patent-encumbered), VP9 (Google’s royalty-free), AV1 (newest royalty-free, best compression).
- Video containers: MP4, MOV, MKV, WebM, AVI. The container holds the codec — same H.264 video can be in MP4, MOV, or MKV.
Audio/video conversion on this site is intentionally limited — proper transcoding needs FFmpeg, which is too heavy for browser-only.
Archive + compression formats
- ZIP: universal, every OS supports. Combines lossless compression + multiple files in one archive.
- 7z: better compression than ZIP. Requires 7-Zip or compatible.
- RAR: proprietary. Was popular for piracy; declining.
- TAR: Unix archive (just bundles, no compression). Combined with gzip → .tar.gz, with bzip2 → .tar.bz2, with xz → .tar.xz.
- Single-file compression: gz, bz2, xz — compress one file at a time. Combined with TAR for archives of multiple files.
Converting between rare file formats
For unusual or domain-specific formats:
- 3D printing (STL, OBJ, 3MF, STEP): see our ebook + 3D conversion guide.
- CAD (DWG, DXF): AutoCAD’s DWG is proprietary; DXF is more open. Free tools: LibreCAD, FreeCAD, ODA File Converter.
- GIS (Shapefile, GeoJSON, KML): QGIS handles them all, free + open-source.
- DAW project files (.als, .flp, .logicx): proprietary, no clean cross-DAW conversion. Export stems as WAV/AIFF.
- E-reader formats (AZW3, MOBI, KFX): Calibre is the universal converter. KFX (Kindle’s newest) is partially supported.
- Genealogy (GEDCOM): universal genealogy format. Most genealogy software imports / exports it.
For truly rare formats, the workflow is: search GitHub for an open-source converter (often exists), or extract the underlying data via a library (most rare formats are zipped XML or binary with a published spec).
Use these while you read
Tools that pair with this guide
- Image Format ConverterConvert between JPG, PNG, and WebP in your browser. Nothing is uploaded. Ideal for quick format swaps before sharing.File & Format Converters
- Color ConverterConvert any color between HEX, RGB, and HSL. Paste a hex code or tweak channels and see every format update live.File & Format Converters
- Unix Timestamp ConverterConvert Unix epoch seconds or milliseconds to a human date, or a date to an epoch. Timezone-aware, runs offline.File & Format Converters
- Number Base ConverterConvert any number between binary, octal, decimal, and hexadecimal at the same time. Clean UI, no ads.File & Format Converters
Frequently asked questions
What file formats do I actually need to know about?
About 20 formats cover 95% of everyday use: PDF/DOCX/ODT/RTF/TXT/MD/EPUB for documents, JPG/PNG/GIF/SVG/WebP/AVIF/HEIC/TIFF for images, CSV/JSON/XML/YAML for data, MP3/AAC/MP4/H.264 for audio/video, ZIP/TAR.GZ for archives. Everything else is domain-specific.
What's the difference between PNG, JPG, and SVG?
JPG: lossy compression for photos. PNG: lossless with transparency, for screenshots and line art. SVG: vector format, infinite resolution, for logos and icons. JPG/PNG are pixel-based; SVG is math-based. Use JPG for photos, PNG for sharp-edged content, SVG for anything that needs to scale.
Can I convert between rare file formats?
Yes for most cases. 3D printing (STL/OBJ/3MF) — see our 3D conversion guide. CAD (DWG/DXF) — LibreCAD/FreeCAD. GIS (Shapefile/GeoJSON/KML) — QGIS. E-reader (AZW3/MOBI) — Calibre. Genealogy (GEDCOM) — most genealogy software. For truly obscure formats, search GitHub — open-source converters often exist.
What's the best file converter for graphics?
For PNG/JPG/WebP/AVIF: our image format converter handles all of them browser-side. For HEIC: HEIC to JPG. For SVG to raster: SVG to PNG with resolution slider. For batch image conversion: macOS Quick Actions, Windows ImageMagick, or cross-platform GIMP for advanced work.
Advertisement
Continue reading
- Using Our ToolsCommon File Conversion Questions AnsweredQuick answers to recurring file conversion questions — Office to plain text, metadata privacy, graphics format choices, speed and size, encrypted files, rare formats. Each links to a deeper guide.
- Using Our ToolsHow File Converters Actually WorkThe decode-transform-encode pipeline every converter uses, why some information is permanently lost, whether you can convert encrypted files, how to handle rare formats, and the converter myths you should ignore.
- Using Our ToolsFile Converter Troubleshooting GuideSystematic troubleshooting for failed conversions: 6 common failure modes, why conversions take so long, handling very large files (over 500 MB), speed-up tips, and recovery options when a converter fails entirely.
- Using Our ToolsConverting Files Without Losing QualityLossy vs lossless conversions, image format tradeoffs (PNG, JPG, WebP, AVIF, SVG), preserving metadata, compression effects, multi-stage conversion damage, and keeping structure (tables, lists) across format changes.
- Using Our ToolsBulk and Batch File Conversion GuideConvert hundreds of files at once. Three approaches (one-time non-sensitive, one-time sensitive, recurring batch automation), automation snippets for macOS / Windows / cross-platform, why batch is sometimes slow, tools by scale.
- Using Our ToolsPDF to Word and Editable Text Conversion GuideHow to convert PDF to Word in 30 seconds (text-based PDFs), how to OCR scanned and handwritten PDFs, and which tools handle complex layouts (tables, multi-column, scientific) — with free vs paid recommendations.