Jennifer Martinez → Denise Holloway, SSN 412-38-9201 → 738-52-1094. After-panel gets edge-light treatment. Same primitive as Hybrid, used for the moment that earns it.git clone + run.command; the install card mirrors what the live site offers. Marked with the actual GitHub repo URL — no data-todo needed because this URL exists.
CleanSchema detects PII and sensitive values in your .csv and .xlsx files and replaces them with realistic synthetic data. Same column names, same data types, same row count — zero real values. Runs entirely on your Mac.
| first_name | salary | ssn | dept | |
|---|---|---|---|---|
| Jennifer | j.martinez@acme.com | $87,500 | 412-38-9201 | Engineering |
| Marcus | m.chen@acme.com | $112,000 | 203-77-4501 | Engineering |
| Priya | p.shah@acme.com | $95,800 | 596-12-8430 | Design |
| David | d.wright@acme.com | $78,200 | 719-04-3322 | Operations |
| Aisha | a.patel@acme.com | $103,400 | 831-66-9011 | Engineering |
| first_name | salary | ssn | dept | |
|---|---|---|---|---|
| Denise | d.holloway@example.org | $91,200 | 738-52-1094 | Engineering |
| Owen | o.bennett@example.org | $108,300 | 419-22-6783 | Engineering |
| Camila | c.ortiz@example.org | $98,600 | 662-08-5142 | Design |
| Nathan | n.briggs@example.org | $81,900 | 277-91-3805 | Operations |
| Imani | i.kelley@example.org | $106,750 | 514-49-2278 | Engineering |
Categorical fields like dept pass through untouched. Statistical distributions match the source. Joins still work.
No upload. No API call. No cloud processing. The file enters memory, leaves as a clean copy.
Works with Wi-Fi off. Suitable for restricted environments where data can't leave the network.
Column names, data types, row counts, joins, distributions — all preserved. Only the values change.
Every classification rule and replacement function is in the repo. Read it. Fork it. Trust it on inspection, not on faith.
CSV or Excel. Any size, any structure. CleanSchema reads it locally — nothing transmitted, nothing logged.
13 sensitive-data classifiers run against column names and value patterns. You see every detection — and can override any of them — before anything is replaced.
Sensitive values become realistic synthetics. Names look like names. Salaries fall in the same range. Joins still work because IDs replace consistently.
CleanSchema runs as a local Python app — no servers, no API keys, no telemetry. The clean file lands in the same folder. The original is untouched. There is nothing for us to see, because we never built the pipe to see it.
Open source. Runs on Mac. Requires Python 3.9+. Setup takes about 30 seconds.
or double-click run.command on macOS — opens in your browser