Clinical-grade synthetic data — with proof

Gesalp AI turns sensitive clinical tables into synthesist real data: realistic, privacy-safe datasets audited for utility and compliance. Built for sponsors, CROs, and AI teams.

HIPAA/GDPR-ready. On-prem or SaaS.

AUROC0.87·
C-Index0.74·
MIA AUC0.56·
DP (ε)configurable
Trial planning
Cohort exploration
Real-world evidence
Model development
Vendor data sharing

Agent data scientist

Our agent selects the right generator (GC/CTGAN/TVAE/diffusion), tunes hyper-params, and optimizes for your objective (privacy-first, utility-first, or balanced).

Reproducible seeds & run cards.

Privacy with proof

Every dataset ships with an audit: DCR p01, MIA advantage, k-anonymity, and optional DP (ε, δ). Green-light gates block release until thresholds pass.

Exportable PDF & JSON.

Trial-aware synthesis

Survival/censoring aware. Supports longitudinal vitals/labs and irregular time-series.

Tabular today; diffusion & transformers next.

Deploy anywhere

Use our SaaS or run on-prem. The on-prem engine only publishes when all privacy/utility gates are green.

Governance & lineage

Dataset cards capture source URI+hash, cohort ε, metrics, and post-processing steps for audit trails.

Marketplace ready

Publish to your catalog: consumers can use the data without owning the originals—Snowflake-style access controls.

How it works

1

Upload or connect

Securely upload CSV/Parquet or connect your table.

2

Click Start

Agent chooses model + tuning; optional advanced settings.

3

Evaluate

See privacy & utility metrics update in real time.

4

Deliver

Download CSV/Parquet + audit PDF/JSON, or publish to marketplace.

How we'll maintain and extend our product advantage

1

Assess data sources

Inventory OMOP/FHIR/CSV and define cohorts.

2

Prepare sample cohort

De-risk schema, units, code sets (ICD/ATC/LOINC).

3

Train DP engine

Auto-select model (TabDDPM/CTGAN/TVAE) with ε/δ budget.

4

Run privacy audits

DCR/kNN, MIA advantage, k-anon/ℓ-div; gate on pass.

5

Validate utility

TSTR/RTS, AUROC/AUPRC vs real; drift (KS/AD, PSI).

6

Generate dataset card

Lineage, ε/δ, audits, hash; PDF + JSON.

7

Pilot with partners

Hospital/CRO runs on-prem/VPC; collect feedback.

8

Publish & monitor

Marketplace listing, usage telemetry, retrain loop.

Why Gesalp

Measurable privacy

Distance-to-closest-record, membership inference AUC, k-anon/ℓ-diversity, optional DP accounting.

Real utility

TSTR/RTS, AUROC/AUPRC, drift (KS/AD/PSI), and correlation/MI delta—benchmarked vs. real baseline.

Enterprise-ready

On-prem mode, SSO, role-based access, air-gapped deployments, and regulator-friendly reports.

Pricing

GESALP AI supports teams of all sizes, with pricing that scales.

Research

Free

Perfect for academic research

  • 1 project
  • ≤5k rows
  • GC/CTGAN
  • Basic privacy–utility report
Get started
Popular

Starter

$199/mo

For growing teams

  • 5 projects
  • GC/CTGAN/TVAE
  • Full utility suite
  • Email support
Choose plan

Pharma/Enterprise

Contact

For enterprise needs

  • Unlimited projects
  • Diffusion + DP
  • Full privacy suite + audit PDF
  • SSO & on-prem
Contact sales

Frequently asked questions

We generate new rows from learned distributions and audit with DCR/MIA/k-anon; optional DP records effective ε, δ.
Yes—configurable. When enabled, runs include cohort ε summaries and stop-gates.
Yes. Air-gapped or VPC. On-prem only publishes when all gates pass.
Tabular (categorical, numeric, dates), with support for longitudinal/time-series trial data.
If labels exist in the source, we preserve distributions; TSTR reports quantify downstream performance.