About Us — Docs4U AI

Why We Built This

Every Indian exporter knows the routine: ICEGATE generates a Shipping Bill PDF, and someone in the team spends 45 minutes manually entering IEC codes, port codes, HS codes, and FOB values into Excel — row by row, document by document.

Textile manufacturers do the same with Grey Despatch Challans. CA firms do it with tax invoices. CHA firms do it with Bills of Entry. The same tedious, error-prone process, happening thousands of times a day across India's trade and compliance ecosystem.

"A mid-sized export house processes 200+ Shipping Bills a month. At 45 minutes per document, that's over 150 hours of pure data entry — every single month."

Docs4U AI was built to change that. Purpose-trained for India's trade document formats — ICEGATE, GST, DGFT — it extracts every field accurately in under 15 seconds.

99.5%

Extraction Accuracy

15s

Avg Per Document

5

Live Document Types

10×

Faster Than Manual

OCR

Scanned PDF Support

Batch

100s of PDFs at Once

What We Extract

5 Critical Document Types. Live Today.

Every document is purpose-built for the specific format and field requirements of India's trade and tax ecosystem.

Shipping Bill

ICEGATE Export
● Live

Bill of Entry

ICEGATE Import
● Live

BRC

Bank Certificate
● Live

GST Tax Invoice

ITC / Purchase Register
● Live

Grey Despatch Challan

GST §143 Job Work
● Live

Our Values

What Drives Every Decision

The principles behind every feature we build.

Accuracy Over Speed

A wrong IEC or GSTIN in an export register causes real compliance problems. We prioritise extraction accuracy above everything — including processing time.

Domain Depth

We don't build generic document tools. Every extraction model is trained on the specific format, field names, and compliance requirements of Indian trade and GST documents.

Data Security

Your Shipping Bills and invoices contain confidential business information. All files are stored in your private account, encrypted in transit, and never shared or used to train models.

Technology

The AI Stack Behind the Product

Industry-leading AI models combined with India-specific domain knowledge and validation logic.

Contextual Document Intelligence

Our primary extraction engine reads document structure contextually — understanding that "FOB VALUE" on one Shipping Bill and "F.O.B. AMOUNT" on another are the same field. Handles multi-page, multi-item documents natively without rigid templates.

Streaming Extraction

Vision OCR

For scanned PDFs — mobile camera photos of printed documents, low-resolution fax scans, old ICEGATE printouts — our vision pipeline reads the image directly without traditional OCR pre-processing, handling complex table layouts accurately.

PDF + Image Support

Identifier Validation

GSTIN (15-char), IEC (10-char), CIN (21-char), PAN — extracted values are validated against format rules and corrected for common OCR errors (O→0, I→1). State code is cross-checked against the company address to catch transposition errors.

GSTIN · IEC · CIN · PAN

Smart Excel Export

Output isn't just dumped into cells. IEC, GSTIN, and Challan numbers are stored as text (no leading-zero loss), dates normalised to DD-MMM-YYYY, columns auto-fitted, header rows frozen — ready for direct import into your ERP or Tally.

openpyxl · Auto-Format

Async Batch Processing

Upload 100 PDFs at once. Processing runs in a Celery task queue, so you're not waiting for each document. Come back to a single consolidated Excel download when the batch completes.

Celery · Redis

Prompt Caching

Document-level extraction results are cached in Redis (1-hour TTL). Column re-selection reuses cached data — no re-calling the AI. JSON repair logic recovers from any truncated responses, ensuring no failed extractions go unrecovered.

Redis · json-repair

Built for India's
Trade & Tax Teams

Why We Built This

5 Critical Document Types. Live Today.

Shipping Bill

Bill of Entry

BRC

GST Tax Invoice

Grey Despatch Challan

What Drives Every Decision

Accuracy Over Speed

Domain Depth

Data Security

The AI Stack Behind the Product

Contextual Document Intelligence

Vision OCR

Identifier Validation

Smart Excel Export

Async Batch Processing

Prompt Caching

Docs4U AI by the Numbers

Ready to Stop Typing
and Start Extracting?

Built for India's Trade & Tax Teams

Why We Built This

5 Critical Document Types. Live Today.

Shipping Bill

Bill of Entry

BRC

GST Tax Invoice

Grey Despatch Challan

What Drives Every Decision

Accuracy Over Speed

Domain Depth

Data Security

The AI Stack Behind the Product

Contextual Document Intelligence

Vision OCR

Identifier Validation

Smart Excel Export

Async Batch Processing

Prompt Caching

Docs4U AI by the Numbers

Ready to Stop Typingand Start Extracting?

Built for India's
Trade & Tax Teams

Ready to Stop Typing
and Start Extracting?