Introducing SCARCE: Our First Public Technical Report

We're publishing SCARCE, our first public technical report - a placebo-controlled, cross-domain benchmark measuring exactly where and how much synthetic data helps on the hardest, most data-starved class in each dataset.

Proof, not promises

Synthetic data is easy to claim and hard to prove. We wanted to answer one question under conditions we could not fool ourselves with: does adding Synthgen synthetic data to a real dataset actually make a model better on the cases that matter most - the rare, hard, data-starved class where real examples are scarce and models fail?

SCARCE - Synthetic-data Cross-domain Assessment of Rare-Class Efficiency - is our answer, and today we are publishing it in full.

What we measured

On the single hardest, most starved class in each dataset, we compared a model trained on real data alone against the same model trained on the same real data plus Synthgen synthetic data. Nothing else changed. We report per-class accuracy only, never an averaged or whole-dataset number.

The benchmark spans 8 industries and three imaging types, from everyday color photos to grayscale surface scans to X-ray.

The result

Adding Synthgen synthetic data lifted accuracy on the hardest class by +9 to +35 percentage points, on every dataset we tested. A few of the results from the report:

Pharmaceutical, faulty imprint: +34.8 pp
Electronics, blowhole: +29.1 pp
Agriculture, late blight: +13.3 pp
Building materials, hole: +9.0 pp

Every one of these is paired against a real-only baseline and statistically significant.

Fewer labels, sooner

The less real data you have, the more synthetic data is worth. On MVTec Pill, two labelled examples per class with Synthgen reached the same accuracy as five real labelled examples alone - the same result on 60% fewer real labels. The gap is widest exactly when your real data is scarcest.

What SCARCE does not claim

We think honesty is the point of a benchmark, so the report is explicit about its scope:

Every number is per-class on the hardest class - no averaged or whole-dataset accuracy.
Every gain is paired against a real-only baseline, over 7 to 15 random seeds, on a frozen test set we never touched during training.
Every result is placebo-controlled, with a z-score (standard deviations above the paired baseline) reported for each class.

We make no claims about easy classes, overall dataset accuracy, or any setting we did not measure. The gains say what they say, and nothing more.

Read the report

Read the full SCARCE technical report (PDF)

The headline numbers and the full per-class breakdown also live on our research page.

Proof, not promises

SCARCE - Synthetic-data Cross-domain Assessment of Rare-Class Efficiency - is our answer, and today we are publishing it in full.

What we measured

The benchmark spans 8 industries and three imaging types, from everyday color photos to grayscale surface scans to X-ray.

The result

Adding Synthgen synthetic data lifted accuracy on the hardest class by +9 to +35 percentage points, on every dataset we tested. A few of the results from the report:

Pharmaceutical, faulty imprint: +34.8 pp
Electronics, blowhole: +29.1 pp
Agriculture, late blight: +13.3 pp
Building materials, hole: +9.0 pp

Every one of these is paired against a real-only baseline and statistically significant.

Fewer labels, sooner

What SCARCE does not claim

We think honesty is the point of a benchmark, so the report is explicit about its scope:

Every number is per-class on the hardest class - no averaged or whole-dataset accuracy.
Every gain is paired against a real-only baseline, over 7 to 15 random seeds, on a frozen test set we never touched during training.
Every result is placebo-controlled, with a z-score (standard deviations above the paired baseline) reported for each class.

We make no claims about easy classes, overall dataset accuracy, or any setting we did not measure. The gains say what they say, and nothing more.

Read the report

Read the full SCARCE technical report (PDF)

The headline numbers and the full per-class breakdown also live on our research page.

Introducing SCARCE: Our First Public Technical Report

Proof, not promises

What we measured

The result

Fewer labels, sooner

What SCARCE does not claim

Read the report

Tags

Related Articles

Synthgen Becomes Official Fontys Partner in Innovation and SPARC Member

Synthgen Participates in Shenzhen Innovation Competition at High Tech Campus Eindhoven

Synthgen Joins SPARC Incubator Program

Introducing SCARCE: Our First Public Technical Report

Proof, not promises

What we measured

The result

Fewer labels, sooner

What SCARCE does not claim

Read the report

Tags

Related Articles

Synthgen Becomes Official Fontys Partner in Innovation and SPARC Member

Synthgen Participates in Shenzhen Innovation Competition at High Tech Campus Eindhoven

Synthgen Joins SPARC Incubator Program