Skip to main content
provence.ai
Document fraud detection

Document fraud,caught at the source.

A platform dedicated to document authenticity: PDF forensics, visual AI, cryptographic verification of official codes, data cross-checks.

5 analysis branches42 automated checks19 document types28 data types extracted100% of official codes verifiedeIDAS compatibleOn-premise deployment5 analysis branches42 automated checks19 document types28 data types extracted100% of official codes verifiedeIDAS compatibleOn-premise deployment

Regulated sectors

Where document fraud weighs heavy.

Process cases, don't just stack them.

Local authorities, social agencies, central government. Document checks on benefit cases and public procurement, with a chain-of-integrity compatible with legal proceedings.

Use cases

  • 01Processing social benefit cases
  • 02Document verification in public procurement
  • 03Consistency checks on tax declarations

eIDAS-compatible for legal evidence

Full pipeline

From detection to action.

Four parallel analysis angles to surface fraud, then four steps to qualify it, document it and handle it.

Spots the pixels that changed.

A dedicated AI model looks at the document at pixel resolution and maps suspect zones, on scans, photos or native PDFs.

  • Outperforms public academic benchmarks on French administrative documents
  • Independent of document source and quality
  • Explicit heat maps for your analysts
  • On-premise inference, no data ever leaves your perimeter
  • Sensitivity thresholds tunable per document type
~70%detection (false positives < 5%)

Document coverage

Every format. Every origin.

Not just the metadata of native PDFs.

Scanned

Paper scans, photos, fax.

CNIPasseportPermisTitre de séjourCasier judiciaireDiplômeJustificatif de domicileAvis d'impositionBulletin de salaireFactureDevisRIBRelevé bancaireQuittance

Handwritten

Handwriting and annotations.

CERFAOrdonnance médicaleAttestationReçuFormulaire complétéAnnotation

Digital native

PDFs, e-invoices, signed documents.

Avis d'impositionFacture / DevisRIBRelevé bancaireBulletin de salaireKbisContratAttestation France TravailJustificatif de domicile

Forgery techniques

Five techniques. Five detection rates.

·Stable

Local edits

Targeted edit on a single zone: name, amount, date, IBAN. Often invisible to the naked eye.

77%detection rate

e.g. Photoshop, Acrobat, pixel masking

Rising

Mobile apps & editors

Edits via consumer scan apps and PDF editors. Scan and edit in a single gesture.

76%detection rate

e.g. CamScanner, Sejda, PDF24, online tools

Surging

AI deepfakes

AI-generated composites, democratized by the latest generative models. The fastest-growing threat.

71%detection rate

e.g. Nano Banana, Midjourney, Stable Diffusion, etc.

·Stable

Full reconstruction

Document built entirely from scratch, with no original. Spotted via data inconsistencies and missing history.

51%detection rate

e.g. Document built from scratch, fake templates

Declining

Collage and copy-move

Patches copy-pasted from one document to another, or zone to zone. Classic technique, losing ground to AI tools.

43%detection rate

e.g. Manual copy-paste

Measured performance

Numbers that hold in production.

From reproducible benchmarks on real corpora, not marketing slides. To be tuned to your infrastructure and your risk tolerance.

01 · Metric

1–6

seconds per page

The full pipeline, on a dedicated GPU, processes a page in 1 to 6 seconds depending on resolution and document type. No cloud queue, no quotas.

  • Runs locally or on a private cloud
  • Async CPU inference possible. Dedicated GPU recommended.
  • Hardened protection of sensitive data

02 · Metric

42

automated checks

Twenty-four structural checks on PDFs. Fourteen on JPEG images. Four on PNGs. Each ranked by severity and explained in plain language.

  • 24 PDF signals (XMP, signatures, edits, overlays)
  • 14 JPEG signals + 4 PNG signals (re-compression, ELA, metadata)
  • 3 severity levels, translated into business language

03 · Metric

≥95%

accuracy on zones

Extraction of sensitive suspect data is measured document by document. Social security number, first name, IBAN: accuracy above 95%. The foundation of reliable qualification.

  • 28 data classes detected, including handwritten
  • Sovereign AI, calibrated on French administrative corpora
  • Visual localization for audit and masking

04 · Metric

100%

of official codes read are verified

Native reading of French 2D-Doc seals and official QR codes. Every readable 2D-Doc seal is cryptographically validated (ECDSA signature against the official list of French authorities). For QR codes pointing to an authentic copy, the document is securely retrieved and compared to the original - any inconsistency on critical fields (name, amount, tax number…) is flagged.

  • 874 embedded official keys - full coverage of French state 2D-Doc (tax authority, statistics agency, utilities, telecom and banking operators)
  • 241 structured fields recognized across all official scopes (tax notice, payslip, ID, residence permit, death certificate, vehicle registration, diploma…)
  • Fuzzy cross-document comparison with inclusion rules (truncated numbers, compound names, aggregated addresses) and plain-language user messages
  • Secure retrieval of the authentic copy with 8 defense layers (whitelist, anti-SSRF, sandbox, antivirus, PDF/image sanitization)

Human in the loop

The pipeline proposes. The analyst decides.

A dedicated interface to qualify, escalate, archive.

01/ 05

Automated triage

Three queues, calibrated by document type: authentic, to review, forged.

02/ 05

Visual review

Suspect zones mapped onto the document, signals explained in plain language.

03/ 05

Reasoned qualification

The analyst decides and justifies. Timestamped audit trail on every decision.

04/ 05

Legal evidence

Electronically signed bundle, compatible with court bailiffs and eIDAS.

05/ 05

Escalation & notifications

Built-in notifications to third parties (banks, insurers, authorities). Configurable escalation workflow.

Frequently asked questions

Documents administratifs analyses par l'IA de detection de fraude

Evaluation

Test it on
your own documents.

See the application in action on a representative sample of your cases.