.avro

What is a .avro file?

Avro is a row-based data serialisation format with built-in schema, widely used in Kafka and Hadoop ecosystems.

Safe format
Type Data
By Apache Software Foundation (Doug Cutting)
MIME application/avro

Drop any file to identify it

No upload. No signup. No sending your file halfway across the internet.
We tell you what it is, right here in your browser.

What is it

Avro is the data format that knows what it contains. Every Avro file embeds its schema — the structure definition that describes the data's fields, types, and organisation. This self-describing nature means any tool can read an Avro file without external documentation. Send someone an Avro file and they know exactly what's inside.

The format is row-based (unlike Parquet's columnar approach), which makes it ideal for write-heavy workloads: event streaming, message queues, and data ingestion. Apache Kafka's ecosystem is deeply integrated with Avro — the Schema Registry, Kafka Connect, and most Kafka producers/consumers use Avro serialisation by default.

Avro's schema evolution is its killer feature. You can add fields, remove fields, and change defaults without breaking existing consumers. Old readers skip unknown fields; new readers use defaults for missing fields. This forward and backward compatibility makes Avro essential in systems where producers and consumers evolve independently.

Technical details
Full Name
Apache Avro
MIME Type
application/avro
Developer
Apache Software Foundation (Doug Cutting)
Magic Bytes
4F 62 6A 01
Safety
.avro is a known, safe format. Binary data format. No executable content.
What opens it
avro-tools (CLI)
FREE Windows / Mac / Linux
Python (fastavro)
FREE Windows / Mac / Linux
Apache Spark
FREE Windows / Mac / Linux
FAQ
What's the difference between Avro and Parquet?
Avro is row-based — fast for writing and streaming (Kafka, event ingestion). Parquet is columnar — fast for reading and analytics (queries, aggregations). Use Avro for data pipelines, Parquet for data warehouses.
Why is Avro used with Kafka?
Avro's compact binary encoding is efficient for high-throughput messaging. Its schema evolution support lets producers and consumers update independently. The Kafka Schema Registry manages Avro schemas centrally.
Related formats