v1.0

About Data Scramble

Transform sensitive production data into high-quality synthetic data while preserving referential integrity and maintaining realistic data relationships.

What is Data Scramble?

Data Scramble & Synthetic Generator is a professional-grade tool for anonymizing sensitive database information. It allows development, testing, and analytics teams to work with realistic data without exposing PII (Personally Identifiable Information), financial data, or other confidential content.

Rather than using dummy data like "test123", Data Scramble generates authentic-looking data that respects your database schema, maintains foreign key relationships, and preserves data distribution patterns — enabling truly representative testing and development.

Key Features

Multi-Source Import

Upload CSV files or SQL dumps, or connect directly to PostgreSQL databases

Smart Schema Detection

Auto-detects primary keys, foreign keys, and data types from your schema

Referential Integrity

Preserves FK relationships using identity mapping — no broken references

18 Scramble Types

Names, emails, phones, addresses, dates, UUIDs, and more powered by Faker.js

Intelligent Suggestions

Auto-suggests scramble types based on column names (email → fake email, etc.)

Configurable Output

Control row limits, include/exclude tables, add TRUNCATE statements

How It Works

1

Input Your Data

Upload CSV files, SQL dumps, or connect to a PostgreSQL database directly

2

Configure Scrambling Rules

Review auto-detected schema, choose scramble types per column, set row limits

3

Generate Synthetic Data

Engine applies topological sort to respect FK order, generates realistic values

4

Download SQL Output

Export as a .sql file with INSERT statements ready to load into any database

Technical Architecture

Frontend

Next.js 16 + React 19 with Shadcn UI and Tailwind CSS for a polished, responsive interface

Backend

Next.js Server Actions & Route Handlers with TypeScript for type-safe API endpoints

Schema Parsing

Custom parsers for CSV (csv-parse) and SQL dumps (regex-based CREATE TABLE + INSERT analysis)

Scramble Engine

Topological sort for FK-respecting order, Identity Map for PK↔FK consistency, Faker.js for realistic values

Database Support

Direct PostgreSQL connections via node-postgres (pg) with connection pooling and error handling

Data Flow

React Context for state management, streaming-ready architecture for large datasets

Available Scramble Types

Choose from 18 different anonymization strategies:

Keep OriginalFull NameFirst NameLast NameEmailPhoneAddressCityCountryZip CodeCompanyLorem TextRandom NumberUUIDDateBooleanRandom StringNULL

Ideal For

Development Teams

Test features with realistic data without exposing production PII

QA & Testing

Generate large datasets that match production schema for comprehensive testing

Analytics & BI

Share datasets with non-technical teams while maintaining data privacy

Compliance

Meet GDPR, CCPA, HIPAA, and other regulatory requirements for data handling

Third-Party Sharing

Provide vendors with realistic data samples without exposing sensitive information

Training & Demos

Create safe datasets for training junior developers or customer demonstrations

Security & Privacy

Client-Side Processing

Files are processed entirely in-memory — no data stored on servers

Encrypted Connections

Database connections use SSL/TLS encryption for secure data transmission

No Data Retention

Output SQL is generated on-demand and deleted after download

Open Source Ready

Self-hostable architecture — deploy on your own infrastructure

About the Creator

Data Scramble & Synthetic Generator was created by Hoan Do, a full-stack engineer and database specialist focused on building tools that solve real-world data challenges.

Tech Stack

Frontend

Next.js 16, React 19, TypeScript

UI

Shadcn UI, Tailwind CSS, Lucide Icons

Data

Faker.js, csv-parse, sql-parser

Database

node-postgres (pg), PostgreSQL

State

React Context, Sonner (Toast)

Deployment

Vercel (Next.js optimized)

Ready to Scramble?

Start anonymizing your data in seconds. Upload files or connect your database.