Last Tuesday at 2 AM, I was staring at a 47MB JSON file from a client's API. They needed it in CSV for their analytics team, who only speak Excel. The JSON had nested objects three levels deep, arrays inside arrays, and inconsistent field names. My deadline was 8 AM.

If you've ever had to convert data between formats, you know this feeling. It should be simple. Copy, paste, done. Except it never is. The nested objects explode into weird column names. The special characters break. The encoding goes sideways. The 50,000-row file makes your browser tab crash.

This guide is everything I've learned about converting between JSON, CSV, XML, and YAML without losing your mind or your data.

Why There Are So Many Data Formats#

Every format was invented to solve a specific problem. CSV was designed for tabular data that spreadsheets could read. XML was designed for document markup that machines and humans could both parse. JSON was designed for lightweight data interchange on the web. YAML was designed for configuration files that humans actually want to read.

The problem is that data doesn't stay in one place. Your API returns JSON, but your business team wants Excel. Your microservice config is in YAML, but your deployment tool needs JSON. Your legacy system exports XML, but your modern frontend expects JSON.

Format	Best For	Weakness
JSON	APIs, web apps, NoSQL databases	No native date type, no comments
CSV	Spreadsheets, flat tabular data	No nested data, no types
XML	Enterprise systems, SOAP, documents	Verbose, complex parsing
YAML	Config files, DevOps pipelines	Indentation-sensitive, type coercion

JSON: The Lingua Franca of Modern Data#

JSON won the data format wars for web development. It's what your REST API returns, what your NoSQL database stores, and what your frontend consumes.

json

{
  "employees": [
    {
      "id": 1,
      "name": "Sarah Chen",
      "department": "Engineering",
      "skills": ["Python", "Rust", "SQL"],
      "address": {
        "city": "Portland",
        "state": "OR"
      }
    }
  ]
}

JSON has five data types: strings, numbers, booleans, arrays, and objects. This simplicity is both its strength and weakness. It's lightweight (30-50% smaller than equivalent XML) and every language has native JSON support.

The lack of a date type means dates get stored as strings — "2026-03-23", "2026-03-23T14:30:00Z", or even Unix timestamps. When converting between formats, date handling is consistently the number one source of bugs. No comments means you can't annotate your JSON files, which is why configuration often prefers YAML.

CSV: The Universal Spreadsheet Format#

CSV is the oldest format here and the most misunderstood. Everyone thinks CSV is simple. It's not.

csv

id,name,department,skills,city,state
1,Sarah Chen,Engineering,"Python, Rust, SQL",Portland,OR

Here's something most people don't know: there is no single CSV standard. RFC 4180 exists, but it's a memo, not a binding specification.

Excel uses commas in English locales but semicolons in European locales
Quoting rules vary — some tools escape quotes by doubling (""), others use backslashes
Line endings might be \n, \r\n, or \r
Encoding could be UTF-8, UTF-16, Latin-1, or Windows-1252

CSV is perfect when your data is genuinely flat: product catalogs, user lists, transaction logs. Every database and spreadsheet on earth can import CSV. But the moment your data has nesting, CSV struggles. How do you represent an array of skills in a cell? A nested address object? There's no standard answer.

XML: The Enterprise Heavyweight#

XML was supposed to be the universal data format. In the early 2000s, everything was XML. It's still critical in healthcare (HL7), finance (SWIFT/ISO 20022), and government systems.

xml

<employee id="1" active="true">
  <name>Sarah Chen</name>
  <department>Engineering</department>
  <skills>
    <skill>Python</skill>
    <skill>Rust</skill>
  </skills>
</employee>

XML has capabilities JSON doesn't: attributes on elements, namespaces for preventing naming collisions, XSD schema validation, and XSLT transformations. The tradeoff is verbosity — the XML version of any dataset is roughly 3x the size of JSON.

YAML: The Human-Friendly Format#

YAML is what JSON would look like if designed for humans. It's the default for Kubernetes manifests, Docker Compose, GitHub Actions, and Ansible.

yaml

employees:
  - id: 1
    name: Sarah Chen
    department: Engineering
    skills:
      - Python
      - Rust
      - SQL
    address:
      city: Portland
      state: OR

No curly braces, no commas, comments with #, multi-line strings. But YAML has infamous parsing surprises: NO (the country code) becomes boolean false. 1.0 becomes a float, not the string "1.0". Zip code 97201 becomes an integer unless quoted. Tabs are forbidden.

JSON to CSV Conversion: Where Things Get Real#

This is the conversion most people search for, and the one with the most pitfalls.

The Simple Case#

Flat JSON arrays convert cleanly:

json

[
  { "id": 1, "name": "Widget A", "price": 9.99, "stock": 150 },
  { "id": 2, "name": "Widget B", "price": 14.99, "stock": 75 }
]

csv

id,name,price,stock
1,Widget A,9.99,150
2,Widget B,14.99,75

Keys become headers, values become cells. Life is good.

The Nested Object Problem#

Real-world JSON is rarely flat. Here's where a json to csv converter needs to be smart:

json

[
  {
    "orderId": "ORD-001",
    "customer": { "name": "Alice Park", "email": "alice@example.com" },
    "items": [
      { "product": "Laptop", "qty": 1, "price": 999 },
      { "product": "Mouse", "qty": 2, "price": 29 }
    ],
    "shipping": { "city": "Seattle", "method": "express" }
  }
]

Three common flattening strategies:

Strategy 1 — Dot Notation: Flatten nested objects by joining keys with dots. customer.name, customer.email, shipping.city. Works for objects, but doesn't handle the items array.

Strategy 2 — Array Expansion: Create a row for each array element, duplicating parent data. One order with 5 items becomes 5 CSV rows. Accurate but increases file size.

Strategy 3 — Array Indexing: Use numbered columns: items.0.product, items.1.product. Keeps one row per order but creates sparse columns when array sizes vary.

Which to choose? Array expansion for database imports, dot notation for business reporting, array indexing for quick inspection. On akousa.net's JSON to CSV converter, you can choose your strategy before converting.

CSV to JSON: Rebuilding Structure from Flat Data#

Going the other direction has its own challenges.

The Type Inference Problem#

CSV has no types. Every cell is a string. Is "42" a string or number? Is "true" a string or boolean?

csv

name,age,active,score
Alice,28,true,95.5
Bob,35,false,87.0

Naive conversion keeps everything as strings. Smart conversion detects numbers and booleans:

json

[
  { "name": "Alice", "age": 28, "active": true, "score": 95.5 },
  { "name": "Bob", "age": 35, "active": false, "score": 87.0 }
]

But type inference can be wrong. If age is a string ID like "007", inference strips leading zeros. Good csv to json converters let you override detection per column.

Nested Header Reconstruction#

If your CSV headers use dot notation, some converters can rebuild the structure:

csv

name,address.street,address.city
Alice,123 Oak Ave,Portland

Becomes nested JSON with address as an object. The CSV to JSON converter on akousa.net supports this, saving hours of manual restructuring.

XML to JSON: The Format Mismatch#

This conversion is trickier than it looks because of fundamental structural differences.

The Attribute Problem#

XML elements can have attributes. JSON objects just have properties.

xml

<price currency="USD">79.99</price>

Where does currency go in JSON? Convention 1 uses @ prefixes ("@currency": "USD"). Convention 2 flattens it ("priceCurrency": "USD"). Convention 1 is lossless — you can round-trip back to XML. Convention 2 is cleaner but destructive.

Single Element vs. Array Ambiguity#

This is one of XML-to-JSON conversion's nastiest edge cases:

xml

<!-- One tag: converter produces a string -->
<tags><tag>audio</tag></tags>
 
<!-- Two tags: converter produces an array -->
<tags><tag>audio</tag><tag>bluetooth</tag></tags>

The tag field's type changes based on element count. Your code that does data.tags.tag.forEach(...) works for two tags but crashes for one. Good xml to json converters let you force certain elements to always be arrays.

YAML to JSON and Back#

Since YAML is a superset of JSON, this conversion is the most straightforward — but type coercion creates surprises.

yaml

name: Project Config
version: 1.0
debug: yes
ports:
  - 8080
  - 443

json

{
  "name": "Project Config",
  "version": 1.0,
  "debug": true,
  "ports": [8080, 443]
}

yes became true (YAML boolean coercion). 1.0 became a float. Going JSON to YAML is typically motivated by readability — the same CI/CD pipeline definition becomes dramatically more readable in YAML. The YAML tools on akousa.net handle this while warning about type coercion issues.

Note: YAML to JSON loses comments (JSON doesn't support them). Plan accordingly.

Edge Cases That Break Converters#

Special Characters and Quoting#

json

{ "message": "He said \"hello\" and she said 'goodbye'" }

In CSV, double quotes inside a quoted field get doubled: "He said ""hello"" and she said 'goodbye'". Most tools handle this, but some don't, silently corrupting data.

Newlines in Values#

A literal newline inside a CSV cell requires the cell to be quoted. Many simple parsers split on newlines first, breaking multiline values entirely. Test with a sample before processing full datasets.

Unicode and Excel#

JSON is always UTF-8. CSV might be anything. Convert JSON to CSV, open in Excel, and you might see Ã© instead of é because Excel assumed Windows-1252. Fix: add a UTF-8 BOM at the start of the CSV file.

Null vs. Missing Keys#

json

[{ "name": "Alice", "phone": null }, { "name": "Bob" }, { "name": "Carol", "phone": "555-0123" }]

In CSV, both null and missing keys become empty cells. The distinction is lost. If downstream systems treat these differently, you need a convention.

Deeply Nested Arrays#

Three levels of nesting (arrays inside arrays inside arrays) has no clean CSV representation. The pragmatic approach: split into multiple CSV files with foreign key relationships, essentially denormalizing the data.

Large File Handling#

For a 100MB JSON file, browser-based converters need 400-800MB of memory (raw text + parsed objects + output). Modern browser tabs get 1-4GB, so files up to ~200MB usually work. The converters on akousa.net use streaming parsing to keep memory proportional to the current chunk.

Command-Line Alternatives for Giant Files#

jq:

bash

jq -r '(.[0] | keys_unsorted) as $keys | $keys, (.[] | [.[$keys[]]]) | @csv' data.json > output.csv

Miller (mlr):

bash

mlr --json --ocsv cat data.json > output.csv
mlr --csv --ojson cat data.csv > output.json

Python:

python

import json, csv
with open('data.json') as f:
    data = json.load(f)
with open('output.csv', 'w', newline='') as f:
    writer = csv.DictWriter(f, fieldnames=data[0].keys())
    writer.writeheader()
    writer.writerows(data)

For files under 200MB, browser tools are faster. For anything bigger, jq and mlr are your friends.

API Response Conversion#

One of the most common scenarios: taking API JSON and getting it into Excel.

Collect all paginated API responses
Merge into a single JSON array
Flatten nested objects with dot notation
Convert to CSV using a json to csv converter
Open in Excel

The critical step is flattening. If the API returns nested attributes.pricing.amount, you want that as a column header, not a JSON blob in a cell.

Convert JSON to Excel: The Direct Path#

People searching for "convert json to excel" usually go JSON to CSV to Excel. The gotcha: Excel guesses types during CSV import, turning zip codes into numbers and stripping leading zeros.

Better approach: use Excel's "Import Data" feature instead of "Open File" — it lets you specify column types during import. Or use a converter that handles quoting and type preservation automatically.

Data Migration Use Cases#

MongoDB to PostgreSQL#

MongoDB exports JSON. PostgreSQL imports CSV. The path: export JSON, flatten nested documents to match your relational schema, convert to CSV, import with \copy.

Configuration Format Migration#

Moving Spring Boot XML config to YAML, or converting JSON Kubernetes manifests to YAML. Small files where accuracy matters more than speed. Always validate the output — an incorrect Kubernetes manifest means a failed production deployment.

Spreadsheet to API Pipeline#

Your product team maintains inventory in Google Sheets. Your API accepts JSON. The workflow: export the Sheet as CSV, run it through a csv to json online converter with type inference enabled, review the output for type correctness, and use it as your API payload. What used to require a custom Python script now takes 30 seconds.

Database Export and Reimport#

You need to export a PostgreSQL table, transform some fields, and reimport. Export as CSV, convert to JSON for easier manipulation with jq or JavaScript, make your transformations, convert back to CSV, reimport. The JSON intermediate format makes complex transformations much easier than trying to manipulate CSV directly.

Common Conversion Mistakes#

Not checking output. Always spot-check: first 10 rows, last 10 rows, 10 random rows from the middle. Verify row count matches. Two minutes of checking saves two days of debugging.

Losing number precision. A 64-bit integer like 9007199254740993 exceeds JavaScript's safe integer limit. Excel silently rounds it. Treat large integers as strings.

Assuming consistent structure. Three JSON objects with different key sets? A naive converter uses only the first object's keys, silently dropping fields from later records. Good converters scan all objects for the complete column set.

Encoding mismatch. JSON is UTF-8. CSV target is unclear. Your colleague in Japan opens it and every character is garbled. Always use UTF-8 with BOM for Excel compatibility.

Wrong root path. Not all JSON is an array. Some wraps data in {"metadata": {...}, "results": [...]}. Point the converter at results, not the root object.

Real-World Conversion Workflows#

Let me share complete workflows for the most common scenarios.

Workflow 1: API Data to Excel Report#

Your monitoring API returns JSON. Management wants a weekly Excel report.

Fetch the JSON data from your API endpoint
Open the JSON to CSV converter on akousa.net
Paste or upload the JSON
Select dot notation flattening for nested objects
Preview the output — check column names and data integrity
Download as CSV
Open in Excel, format as a table, add charts as needed

Total time: under 2 minutes for files up to 50MB.

Workflow 2: Legacy XML Feed to Modern API#

A partner sends XML data feeds daily. Your system needs JSON.

Open the XML to JSON converter
Upload the XML
Choose attribute handling (prefix with @ for lossless conversion)
Specify which elements should always be arrays
Convert and validate the structure matches your API schema
Integrate into your data pipeline

Workflow 3: DevOps Config Cleanup#

Your CI/CD config is in JSON but your team wants readable YAML.

Open the JSON to YAML converter
Paste your JSON configuration
Convert to YAML
Add comments explaining non-obvious settings
Validate YAML syntax
Commit to your repository

Batch Conversion#

When you have dozens or hundreds of files, doing them one by one isn't practical.

Directory Batch Processing#

bash

# Convert all JSON files in a directory to CSV
for f in *.json; do
  mlr --json --ocsv cat "$f" > "${f%.json}.csv"
done

Merging Multiple Files#

Sometimes you need to combine multiple JSON files into one CSV:

bash

jq -s 'add' file1.json file2.json file3.json | \
  jq -r '(.[0] | keys_unsorted) as $k | $k, (.[] | [.[$k[]]]) | @csv' > merged.csv

Or multiple CSVs into a single JSON:

bash

# Keep header from first file, data from all
head -1 file1.csv > merged.csv
for f in file*.csv; do tail -n +2 "$f" >> merged.csv; done
# Then convert merged.csv to JSON with your preferred tool

Python Batch Script#

python

import json, csv, glob
 
for filepath in glob.glob('data/*.json'):
    with open(filepath) as f:
        data = json.load(f)
 
    csv_path = filepath.replace('.json', '.csv')
    with open(csv_path, 'w', newline='') as f:
        # Collect all keys across all records
        all_keys = set()
        for record in data:
            all_keys.update(record.keys())
 
        writer = csv.DictWriter(f, fieldnames=sorted(all_keys))
        writer.writeheader()
        writer.writerows(data)

Note how this script collects all keys across all records — avoiding the "inconsistent structure" mistake mentioned earlier.

Validating After Conversion#

Conversion without validation is gambling.

Row count check:

bash

jq length input.json       # JSON array length
wc -l output.csv           # CSV rows (subtract 1 for header)

Round-trip testing — the gold standard: convert A to B to A and diff with the original. If the diff is empty, your conversion is lossless.

bash

diff <(jq -S . original.json) <(jq -S . roundtripped.json)

Schema validation for JSON output: use JSON Schema to enforce required fields, types, and formats.

json

{
  "$schema": "http://json-schema.org/draft-07/schema#",
  "type": "array",
  "items": {
    "type": "object",
    "required": ["id", "name", "email"],
    "properties": {
      "id": { "type": "integer" },
      "name": { "type": "string" },
      "email": { "type": "string", "format": "email" }
    }
  }
}

Spot-check protocol: Look at the first 10 rows, the last 10 rows, and 10 random rows from the middle. Check that special characters survived. Verify numeric precision. This takes two minutes and catches problems that automated checks miss.

Column/field completeness: For CSV output, verify every row has the same number of fields as the header. For JSON output, confirm all expected keys are present. Missing columns usually mean the converter encountered an unexpected structure and silently dropped data.

Conversion Performance Benchmarks#

To set realistic expectations, here are rough timelines for browser-based data format conversion:

File Size	Records	JSON to CSV	CSV to JSON	XML to JSON
1 MB	~5,000	< 1 second	< 1 second	1-2 seconds
10 MB	~50,000	2-5 seconds	2-5 seconds	5-10 seconds
50 MB	~250,000	10-20 seconds	10-20 seconds	20-40 seconds
100 MB	~500,000	20-45 seconds	20-45 seconds	40-90 seconds
200 MB+	1M+	Use CLI tools	Use CLI tools	Use CLI tools

These are approximate and depend on data complexity (nesting depth, field count) and your machine's RAM. Flat data converts faster than deeply nested structures.

Format Selection Guide#

Use JSON for REST APIs, document databases, web service data interchange, variable-structure data.

Use CSV for flat tabular data, Excel/Sheets compatibility, SQL database imports, maximum legacy interoperability.

Use XML for enterprise/government integration, SOAP APIs, schema-validated documents, XSLT pipelines.

Use YAML for configuration files humans maintain, Kubernetes/Docker/CI/CD, anything needing comments and readability.

Security Note#

Many online converters upload your data to a server. If your JSON contains customer emails, API keys, or financial data, you're sending it to a third party. The converters on akousa.net process everything in your browser — your data never leaves your machine. For highly confidential data, command-line tools are the safest option.

Frequently Asked Questions#

Can I convert JSON with nested arrays to a flat CSV?#

Yes, but you need to choose a flattening strategy. Dot notation works for nested objects. For arrays, you either expand rows (one row per array element), use indexed columns (items.0, items.1), or join array values with a delimiter. The best choice depends on what you're doing with the CSV.

Will I lose data converting between formats?#

It depends on the conversion direction. JSON to YAML and back is essentially lossless (except comments). JSON to CSV loses nesting structure and type information. XML to JSON can lose attribute/element distinction and namespace information. Always do a round-trip test if data integrity is critical.

How do I handle dates during conversion?#

JSON has no date type, so dates are always strings. When converting CSV to JSON, dates should remain as strings in ISO 8601 format ("2026-03-23" or "2026-03-23T14:30:00Z"). When converting to CSV, dates pass through unchanged. The danger zone is opening CSV in Excel, which may reformat dates based on your locale settings.

What's the maximum file size I can convert in a browser?#

Typically 100-200MB before you risk the browser tab running out of memory. It depends on data complexity — flat arrays of simple objects handle better than deeply nested structures. For anything larger, command-line tools like jq, mlr, or Python scripts are more reliable.

How do I convert JSON to Excel without losing leading zeros?#

Use a converter that quotes string fields, then use Excel's "Import Data" feature (Data tab > From Text/CSV) instead of double-clicking the file. The import wizard lets you set each column's data type, preventing Excel from treating zip codes or IDs as numbers.

Wrapping Up#

Data format conversion seems simple until it isn't. The happy path is easy: flat JSON to CSV, YAML to JSON. The real world gives you nested objects, inconsistent schemas, special characters, massive files, and encoding mismatches.

The principles I keep coming back to:

Understand your data structure before converting — nesting depth, array patterns, edge cases
Choose the right flattening strategy — no universal answer, only the right one for your use case
Always validate after conversion — row counts, spot checks, round-trip testing
Preserve types intentionally — don't let the converter guess when you know
Handle encoding explicitly — UTF-8 with BOM for Excel

That 47MB JSON file I mentioned at the beginning? Converted in under 30 seconds, nested objects flattened correctly, all 50,000 records intact, ready for the analytics team by 2:15 AM. Sometimes the right tool really does make all the difference.

This guide is everything I've learned about converting between JSON, CSV, XML, and YAML without losing your mind or your data.

Why There Are So Many Data Formats#

Format	Best For	Weakness
JSON	APIs, web apps, NoSQL databases	No native date type, no comments
CSV	Spreadsheets, flat tabular data	No nested data, no types
XML	Enterprise systems, SOAP, documents	Verbose, complex parsing
YAML	Config files, DevOps pipelines	Indentation-sensitive, type coercion

JSON: The Lingua Franca of Modern Data#

JSON won the data format wars for web development. It's what your REST API returns, what your NoSQL database stores, and what your frontend consumes.

json

{
  "employees": [
    {
      "id": 1,
      "name": "Sarah Chen",
      "department": "Engineering",
      "skills": ["Python", "Rust", "SQL"],
      "address": {
        "city": "Portland",
        "state": "OR"
      }
    }
  ]
}

CSV: The Universal Spreadsheet Format#

CSV is the oldest format here and the most misunderstood. Everyone thinks CSV is simple. It's not.

csv

id,name,department,skills,city,state
1,Sarah Chen,Engineering,"Python, Rust, SQL",Portland,OR

Here's something most people don't know: there is no single CSV standard. RFC 4180 exists, but it's a memo, not a binding specification.

Excel uses commas in English locales but semicolons in European locales
Quoting rules vary — some tools escape quotes by doubling (""), others use backslashes
Line endings might be \n, \r\n, or \r
Encoding could be UTF-8, UTF-16, Latin-1, or Windows-1252

XML: The Enterprise Heavyweight#

XML was supposed to be the universal data format. In the early 2000s, everything was XML. It's still critical in healthcare (HL7), finance (SWIFT/ISO 20022), and government systems.

xml

<employee id="1" active="true">
  <name>Sarah Chen</name>
  <department>Engineering</department>
  <skills>
    <skill>Python</skill>
    <skill>Rust</skill>
  </skills>
</employee>

YAML: The Human-Friendly Format#

YAML is what JSON would look like if designed for humans. It's the default for Kubernetes manifests, Docker Compose, GitHub Actions, and Ansible.

yaml

employees:
  - id: 1
    name: Sarah Chen
    department: Engineering
    skills:
      - Python
      - Rust
      - SQL
    address:
      city: Portland
      state: OR

JSON to CSV Conversion: Where Things Get Real#

This is the conversion most people search for, and the one with the most pitfalls.

The Simple Case#

Flat JSON arrays convert cleanly:

json

[
  { "id": 1, "name": "Widget A", "price": 9.99, "stock": 150 },
  { "id": 2, "name": "Widget B", "price": 14.99, "stock": 75 }
]

csv

id,name,price,stock
1,Widget A,9.99,150
2,Widget B,14.99,75

Keys become headers, values become cells. Life is good.

The Nested Object Problem#

Real-world JSON is rarely flat. Here's where a json to csv converter needs to be smart:

json

[
  {
    "orderId": "ORD-001",
    "customer": { "name": "Alice Park", "email": "alice@example.com" },
    "items": [
      { "product": "Laptop", "qty": 1, "price": 999 },
      { "product": "Mouse", "qty": 2, "price": 29 }
    ],
    "shipping": { "city": "Seattle", "method": "express" }
  }
]

Three common flattening strategies:

Strategy 1 — Dot Notation: Flatten nested objects by joining keys with dots. customer.name, customer.email, shipping.city. Works for objects, but doesn't handle the items array.

Strategy 2 — Array Expansion: Create a row for each array element, duplicating parent data. One order with 5 items becomes 5 CSV rows. Accurate but increases file size.

Strategy 3 — Array Indexing: Use numbered columns: items.0.product, items.1.product. Keeps one row per order but creates sparse columns when array sizes vary.

CSV to JSON: Rebuilding Structure from Flat Data#

Going the other direction has its own challenges.

The Type Inference Problem#

CSV has no types. Every cell is a string. Is "42" a string or number? Is "true" a string or boolean?

csv

name,age,active,score
Alice,28,true,95.5
Bob,35,false,87.0

Naive conversion keeps everything as strings. Smart conversion detects numbers and booleans:

json

[
  { "name": "Alice", "age": 28, "active": true, "score": 95.5 },
  { "name": "Bob", "age": 35, "active": false, "score": 87.0 }
]

But type inference can be wrong. If age is a string ID like "007", inference strips leading zeros. Good csv to json converters let you override detection per column.

Nested Header Reconstruction#

If your CSV headers use dot notation, some converters can rebuild the structure:

csv

name,address.street,address.city
Alice,123 Oak Ave,Portland

Becomes nested JSON with address as an object. The CSV to JSON converter on akousa.net supports this, saving hours of manual restructuring.

XML to JSON: The Format Mismatch#

This conversion is trickier than it looks because of fundamental structural differences.

The Attribute Problem#

XML elements can have attributes. JSON objects just have properties.

xml

<price currency="USD">79.99</price>

Single Element vs. Array Ambiguity#

This is one of XML-to-JSON conversion's nastiest edge cases:

xml

<!-- One tag: converter produces a string -->
<tags><tag>audio</tag></tags>
 
<!-- Two tags: converter produces an array -->
<tags><tag>audio</tag><tag>bluetooth</tag></tags>

YAML to JSON and Back#

Since YAML is a superset of JSON, this conversion is the most straightforward — but type coercion creates surprises.

yaml

name: Project Config
version: 1.0
debug: yes
ports:
  - 8080
  - 443

json

{
  "name": "Project Config",
  "version": 1.0,
  "debug": true,
  "ports": [8080, 443]
}

Note: YAML to JSON loses comments (JSON doesn't support them). Plan accordingly.

Edge Cases That Break Converters#

Special Characters and Quoting#

json

{ "message": "He said \"hello\" and she said 'goodbye'" }

In CSV, double quotes inside a quoted field get doubled: "He said ""hello"" and she said 'goodbye'". Most tools handle this, but some don't, silently corrupting data.

Newlines in Values#

Unicode and Excel#

Null vs. Missing Keys#

json

[{ "name": "Alice", "phone": null }, { "name": "Bob" }, { "name": "Carol", "phone": "555-0123" }]

In CSV, both null and missing keys become empty cells. The distinction is lost. If downstream systems treat these differently, you need a convention.

Deeply Nested Arrays#

Large File Handling#

Command-Line Alternatives for Giant Files#

jq:

bash

jq -r '(.[0] | keys_unsorted) as $keys | $keys, (.[] | [.[$keys[]]]) | @csv' data.json > output.csv

Miller (mlr):

bash

mlr --json --ocsv cat data.json > output.csv
mlr --csv --ojson cat data.csv > output.json

Python:

python

import json, csv
with open('data.json') as f:
    data = json.load(f)
with open('output.csv', 'w', newline='') as f:
    writer = csv.DictWriter(f, fieldnames=data[0].keys())
    writer.writeheader()
    writer.writerows(data)

For files under 200MB, browser tools are faster. For anything bigger, jq and mlr are your friends.

API Response Conversion#

One of the most common scenarios: taking API JSON and getting it into Excel.

Collect all paginated API responses
Merge into a single JSON array
Flatten nested objects with dot notation
Convert to CSV using a json to csv converter
Open in Excel

The critical step is flattening. If the API returns nested attributes.pricing.amount, you want that as a column header, not a JSON blob in a cell.

Convert JSON to Excel: The Direct Path#

People searching for "convert json to excel" usually go JSON to CSV to Excel. The gotcha: Excel guesses types during CSV import, turning zip codes into numbers and stripping leading zeros.

Data Migration Use Cases#

MongoDB to PostgreSQL#

MongoDB exports JSON. PostgreSQL imports CSV. The path: export JSON, flatten nested documents to match your relational schema, convert to CSV, import with \copy.

Configuration Format Migration#

Spreadsheet to API Pipeline#

Database Export and Reimport#

Common Conversion Mistakes#

Not checking output. Always spot-check: first 10 rows, last 10 rows, 10 random rows from the middle. Verify row count matches. Two minutes of checking saves two days of debugging.

Losing number precision. A 64-bit integer like 9007199254740993 exceeds JavaScript's safe integer limit. Excel silently rounds it. Treat large integers as strings.

Encoding mismatch. JSON is UTF-8. CSV target is unclear. Your colleague in Japan opens it and every character is garbled. Always use UTF-8 with BOM for Excel compatibility.

Wrong root path. Not all JSON is an array. Some wraps data in {"metadata": {...}, "results": [...]}. Point the converter at results, not the root object.

Real-World Conversion Workflows#

Let me share complete workflows for the most common scenarios.

Workflow 1: API Data to Excel Report#

Your monitoring API returns JSON. Management wants a weekly Excel report.

Fetch the JSON data from your API endpoint
Open the JSON to CSV converter on akousa.net
Paste or upload the JSON
Select dot notation flattening for nested objects
Preview the output — check column names and data integrity
Download as CSV
Open in Excel, format as a table, add charts as needed

Total time: under 2 minutes for files up to 50MB.

Workflow 2: Legacy XML Feed to Modern API#

A partner sends XML data feeds daily. Your system needs JSON.

Open the XML to JSON converter
Upload the XML
Choose attribute handling (prefix with @ for lossless conversion)
Specify which elements should always be arrays
Convert and validate the structure matches your API schema
Integrate into your data pipeline

Workflow 3: DevOps Config Cleanup#

Your CI/CD config is in JSON but your team wants readable YAML.

Open the JSON to YAML converter
Paste your JSON configuration
Convert to YAML
Add comments explaining non-obvious settings
Validate YAML syntax
Commit to your repository

Batch Conversion#

When you have dozens or hundreds of files, doing them one by one isn't practical.

Directory Batch Processing#

bash

# Convert all JSON files in a directory to CSV
for f in *.json; do
  mlr --json --ocsv cat "$f" > "${f%.json}.csv"
done

Merging Multiple Files#

Sometimes you need to combine multiple JSON files into one CSV:

bash

jq -s 'add' file1.json file2.json file3.json | \
  jq -r '(.[0] | keys_unsorted) as $k | $k, (.[] | [.[$k[]]]) | @csv' > merged.csv

Or multiple CSVs into a single JSON:

bash

# Keep header from first file, data from all
head -1 file1.csv > merged.csv
for f in file*.csv; do tail -n +2 "$f" >> merged.csv; done
# Then convert merged.csv to JSON with your preferred tool

Python Batch Script#

python

import json, csv, glob
 
for filepath in glob.glob('data/*.json'):
    with open(filepath) as f:
        data = json.load(f)
 
    csv_path = filepath.replace('.json', '.csv')
    with open(csv_path, 'w', newline='') as f:
        # Collect all keys across all records
        all_keys = set()
        for record in data:
            all_keys.update(record.keys())
 
        writer = csv.DictWriter(f, fieldnames=sorted(all_keys))
        writer.writeheader()
        writer.writerows(data)

Note how this script collects all keys across all records — avoiding the "inconsistent structure" mistake mentioned earlier.

Validating After Conversion#

Conversion without validation is gambling.

Row count check:

bash

jq length input.json       # JSON array length
wc -l output.csv           # CSV rows (subtract 1 for header)

Round-trip testing — the gold standard: convert A to B to A and diff with the original. If the diff is empty, your conversion is lossless.

bash

diff <(jq -S . original.json) <(jq -S . roundtripped.json)

Schema validation for JSON output: use JSON Schema to enforce required fields, types, and formats.

json

{
  "$schema": "http://json-schema.org/draft-07/schema#",
  "type": "array",
  "items": {
    "type": "object",
    "required": ["id", "name", "email"],
    "properties": {
      "id": { "type": "integer" },
      "name": { "type": "string" },
      "email": { "type": "string", "format": "email" }
    }
  }
}

Conversion Performance Benchmarks#

To set realistic expectations, here are rough timelines for browser-based data format conversion:

File Size	Records	JSON to CSV	CSV to JSON	XML to JSON
1 MB	~5,000	< 1 second	< 1 second	1-2 seconds
10 MB	~50,000	2-5 seconds	2-5 seconds	5-10 seconds
50 MB	~250,000	10-20 seconds	10-20 seconds	20-40 seconds
100 MB	~500,000	20-45 seconds	20-45 seconds	40-90 seconds
200 MB+	1M+	Use CLI tools	Use CLI tools	Use CLI tools

These are approximate and depend on data complexity (nesting depth, field count) and your machine's RAM. Flat data converts faster than deeply nested structures.

Format Selection Guide#

Use JSON for REST APIs, document databases, web service data interchange, variable-structure data.

Use CSV for flat tabular data, Excel/Sheets compatibility, SQL database imports, maximum legacy interoperability.

Use XML for enterprise/government integration, SOAP APIs, schema-validated documents, XSLT pipelines.

Use YAML for configuration files humans maintain, Kubernetes/Docker/CI/CD, anything needing comments and readability.

Security Note#

Frequently Asked Questions#

Can I convert JSON with nested arrays to a flat CSV?#

Will I lose data converting between formats?#

How do I handle dates during conversion?#

What's the maximum file size I can convert in a browser?#

How do I convert JSON to Excel without losing leading zeros?#

Wrapping Up#

The principles I keep coming back to:

Understand your data structure before converting — nesting depth, array patterns, edge cases
Choose the right flattening strategy — no universal answer, only the right one for your use case
Always validate after conversion — row counts, spot checks, round-trip testing
Preserve types intentionally — don't let the converter guess when you know
Handle encoding explicitly — UTF-8 with BOM for Excel

Why There Are So Many Data Formats#

JSON: The Lingua Franca of Modern Data#

CSV: The Universal Spreadsheet Format#

XML: The Enterprise Heavyweight#

YAML: The Human-Friendly Format#

JSON to CSV Conversion: Where Things Get Real#

The Simple Case#

The Nested Object Problem#

CSV to JSON: Rebuilding Structure from Flat Data#

The Type Inference Problem#

Nested Header Reconstruction#

XML to JSON: The Format Mismatch#

The Attribute Problem#

Single Element vs. Array Ambiguity#

YAML to JSON and Back#

Edge Cases That Break Converters#

Special Characters and Quoting#

Newlines in Values#

Unicode and Excel#

Null vs. Missing Keys#

Deeply Nested Arrays#

Large File Handling#

Command-Line Alternatives for Giant Files#

API Response Conversion#

Convert JSON to Excel: The Direct Path#

Data Migration Use Cases#

MongoDB to PostgreSQL#

Configuration Format Migration#

Spreadsheet to API Pipeline#

Database Export and Reimport#

Common Conversion Mistakes#

Real-World Conversion Workflows#

Workflow 1: API Data to Excel Report#

Workflow 2: Legacy XML Feed to Modern API#

Workflow 3: DevOps Config Cleanup#

Batch Conversion#

Directory Batch Processing#

Merging Multiple Files#

Python Batch Script#

Validating After Conversion#

Conversion Performance Benchmarks#

Format Selection Guide#

Security Note#

Frequently Asked Questions#

Can I convert JSON with nested arrays to a flat CSV?#

Will I lose data converting between formats?#

How do I handle dates during conversion?#

What's the maximum file size I can convert in a browser?#

How do I convert JSON to Excel without losing leading zeros?#

Wrapping Up#

İlgili Yazılar

Regex Cheat Sheet 2026 — Regular Expressions Made Simple

HEX to RGB Converter — Color Code Guide for Web Developers

Why There Are So Many Data Formats#

JSON: The Lingua Franca of Modern Data#

CSV: The Universal Spreadsheet Format#

XML: The Enterprise Heavyweight#

YAML: The Human-Friendly Format#

JSON to CSV Conversion: Where Things Get Real#

The Simple Case#

The Nested Object Problem#

CSV to JSON: Rebuilding Structure from Flat Data#

The Type Inference Problem#

Nested Header Reconstruction#

XML to JSON: The Format Mismatch#

The Attribute Problem#

Single Element vs. Array Ambiguity#

YAML to JSON and Back#

Edge Cases That Break Converters#

Special Characters and Quoting#

Newlines in Values#

Unicode and Excel#

Null vs. Missing Keys#

Deeply Nested Arrays#

Large File Handling#

Command-Line Alternatives for Giant Files#

API Response Conversion#

Convert JSON to Excel: The Direct Path#

Data Migration Use Cases#

MongoDB to PostgreSQL#