Document File Formats

Document formats are where proprietary lock-in meets daily life — you need Word for DOCX, Adobe for PDF forms, and open standards exist but the world runs on Microsoft.

Formats 32
Most common .azw3, .bib, .cbr
About document files

Document formats are where proprietary lock-in meets daily life. You need Word to properly open a .docx. You need Adobe to do anything useful with a .pdf form. Open standards exist (ODT, ODS) but the world runs on Microsoft's formats whether it likes it or not. The good news: most document formats are more interoperable than their creators would prefer.

The big split is editable versus final. DOCX, XLSX, and PPTX are working documents — you create them, collaborate in them, track changes. PDF is the finished product — layout-locked, device-independent, impossible to casually edit. Everything else is a variation on this theme: ODT for the open-source crowd, RTF for maximum compatibility at the cost of features, TXT for people who don't trust formatting.

E-books sit in this category too, and they have their own compatibility wars. EPUB is the open standard. Amazon spent years insisting on proprietary formats. Kindle finally supports EPUB now, but the scars remain. If you're sharing a document, PDF is the safe choice. If you're collaborating, DOCX is the expected one.

All document formats
.azw3 AZW3 is Amazon's modern e-book format (also called KF8) — su... .bib BIB is a BibTeX bibliography database — a structured list of... .cbr CBR is a RAR archive containing sequential images — an older... .cbz CBZ is a ZIP archive containing sequential images — the stan... .djvu DjVu is a document format optimised for scanned documents, p... .doc DOC is the legacy Microsoft Word format — a proprietary bina... .docx DOCX is Microsoft Word's default document format — an XML-ba... .eml EML is the standard email file format — a single message in ... .epub EPUB is the open standard for e-books — reflowable text that... .fb2 FB2 is an XML-based e-book format popular in Russia and East... .hwp HWP is the standard document format in South Korea, used by ... .indd InDesign Document is Adobe's native format for page layout —... .key KEY is Apple's presentation format — known for beautiful tem... .mbox MBOX is a file format that stores multiple email messages co... .md Markdown is a lightweight text formatting syntax — headings ... .mobi MOBI is Amazon's legacy e-book format — historically used by... .msg MSG is Microsoft Outlook's proprietary format for saving ind... .numbers Numbers is Apple's spreadsheet format — free on Mac, iPad, a... .odp ODP is the open-standard presentation format — a free altern... .ods ODS is the open-standard spreadsheet format — a free alterna... .odt ODT is the open-standard word processing format — a free, IS... .pages Pages is Apple's native word processing format — editable on... .pdf PDF is a universal document format that preserves exact layo... .ppt PPT is the legacy Microsoft PowerPoint format — binary, supe... .pptx PPTX is Microsoft PowerPoint's default presentation format —... .rtf RTF is a cross-platform document format — basic formatting w... .tex TEX is a LaTeX source file — the typesetting language academ... .txt TXT is the simplest file format — raw text with no formattin... .wps WPS is the word processor format from Microsoft Works — a bu... .xls XLS is the legacy Microsoft Excel format — binary, limited t... .xlsx XLSX is Microsoft Excel's default spreadsheet format — formu... .xps XPS is Microsoft's fixed-layout document format — a PDF alte...
Safety notes
.eml Use caution

Email files can contain phishing links and malicious attachments. Exercise caution with EML from unknown sources.

.mbox Use caution

Contains email messages which may include malicious attachments or phishing content.

.msg Use caution

Email messages can contain malicious attachments and phishing links. Treat MSG files from unknown senders with caution.

FAQ
What's the difference between DOC and DOCX?
DOC is the old binary format from before Word 2007 — proprietary and hard for non-Microsoft software to parse. DOCX is XML inside a ZIP archive, which is why Google Docs, LibreOffice, and other tools can read it reliably. Always use DOCX for new documents.
How do I open a DOCX without Word?
Google Docs (free, web-based) handles most DOCX files well. LibreOffice Writer (free, desktop) is a full word processor. Apple Pages (free on Mac) also opens DOCX. Complex formatting may shift slightly in non-Word applications.
What's the most universal document format?
PDF for sharing (layout-locked, opens everywhere). DOCX for editable documents (the de facto standard). TXT for absolute universality at the cost of all formatting.
Related categories