GDPR compliant · Processing on own VPS · Germany

Extract data from thousands of documents.
No setup required.

Upload your PDFs and Word files. Get an Excel with exactly the fields you need. Processed on our server, never on a third-party cloud.

See demo
PDF DOCX DOC ✕ Excel
Used by teams at
Accounting firms Legal practices Data consultancies BI teams
2.4M pages processed
98.7% accuracy on native docs
< < 2s per page

Data analysts lose 3 hours a day on this

Copy-pasting fields from PDF to Excel. One by one. With error risk on every row.

Before
  • Manual copy-paste each field
  • Python scripts that break with every new layout
  • Parseur templates that need reconfiguring
  • Adobe Acrobat Pro for 3 PDFs a month
  • Human errors that show up in the final report
With idpura
  • Upload the full batch at once
  • Define the fields you need (or use the financial template)
  • Download Excel ready for pivot tables
  • No templates, no scripts, no surprises
  • Traceability: which file originated each row

Available modules

Each module solves a specific data extraction problem.

Available now

Document Extractor

Extract text and tables from native PDF and DOCX. No AI, no hallucinations, 100% deterministic precision. 1 credit/page.

Available now

Data Sectioning

Upload 3+ identical documents and automatically extract dates, amounts, percentages and IDs into columns. 1 credit/page.

Coming soon

AI OCR

Process scanned documents, photos and images with Gemini. Text + structured data in a single call. 3 credits/page.

Coming soon

AI Extractor

Extract specific fields from invoices, payslips, contracts and bank statements with AI. Structured JSON output. 3 credits/page.

From document to data in 3 steps

Upload your files

Drag & drop or full folder selection. PDF, DOCX, in batch. Up to 500 files per job.

Choose the tool

Text and table extraction, data sectioning between documents, or AI extraction. The system calculates the cost before processing.

Download the result

Excel, JSON or CSV ready to use. With source column per row and full traceability.

The system calculates the exact cost before processing. You only pay for what you use. Credits renew monthly.

See it in action

Simulated extraction flow with sample invoices.

facturas_q1_2024/ 47 archivos
Uploading files...
Result ready resultado.xlsx · 4 filas
VAT IDDateSupplierBaseTaxTotal
B1234567815/01/2024Suministros Iberia SL€1,028.0021%€1,243.88
A9876543222/01/2024Tech Solutions Spain SA€735.5021%€890.00
B5554443303/02/2024Distribuciones Levante SL€2,150.0021%€2,601.50
A1239876510/02/2024Servicios Digitales SL€480.0021%€580.80

Credits per page, not per document

A 200-page contract doesn't cost the same as a 2-page invoice. Here you pay for what you actually process.

Launching in 2026

Paid plans are under development. In the meantime, usage is completely free.

Currently in free beta · No usage limits · No credit card

Free

Try idpura. Free forever.

0 /mes
100 credits / month + 300 welcome bonus
Sign up free

Starter

For freelancers and small businesses.

39 /mes
1,200 credits / month
€0.0325 / crédito

Business

For high-volume companies.

199 /mes
10,000 credits / month
€0.0199 / crédito

Enterprise

For large organizations. SLA and dedicated support.

Custom
custom credits

Credits renew monthly based on your plan

How much does each tool cost?

Tool Credits per page
Document Extractor (text + tables) 1 cr / pág
Data Sectioning (multi-doc variance) 1 cr / pág
AI OCR (scanned docs & images) 3 cr / pág
AI Extractor (structured fields) 3 cr / pág

Your documents never leave our server

No AWS. No GCP. No Azure. Dedicated server in Germany.

Own VPS in Germany

Hetzner Falkenstein, Frankfurt. Your files never pass through third-party cloud services. 100% processing on dedicated hardware under German jurisdiction.

Your documents are yours

idpura processes your files and immediately deletes them from our servers. We never store your original documents under any circumstances. Extraction results are available for 24 hours for you to download, then automatically deleted. We only keep your usage history (credits used, dates, and tools) so you can review it in your dashboard.

GDPR compliant · Enterprise teams

Architecture designed to comply with GDPR. Coming soon: Clerk Organizations for team management with organization-level access control.

HTTPS / TLS 1.3Hetzner DEGDPR Art. 5Auto-delete 24hNo third-party cloud

Frequently asked questions

What document types does idpura support?

Currently: PDF (native and digital) and DOCX (Word 2007 onwards). The .doc format (Word 97-2003) is not supported. Coming soon: AI OCR for scanned documents and images, and AI Extractor for structured fields.

What is a credit and how is it calculated?

A credit equals one unit of processing. Basic tools (Document Extractor, Data Sectioning) consume 1 credit per page. AI tools (OCR, AI Extractor) consume 3 credits per page. The system shows you the exact cost before confirming processing.

Are my documents secure?

Yes. All processing happens on a dedicated VPS in Germany (Hetzner). Your files are never sent to third-party cloud services. They are automatically deleted 24 hours after processing. Communication is always via HTTPS with SSL certificate.

Can I use idpura with my company team?

Currently it is an individual tool. Multi-user team support is on the roadmap for Q4 2026, with roles, shared credits and organization-level access control.

Does it work with scanned documents?

Coming soon. AI OCR (3 credits/page) will process scanned documents, photos and images using Gemini. Currently only native (digital) PDFs and DOCX are supported.

Is there an API to integrate into my pipeline?

The public REST API is on the roadmap for Q3-Q4 2026, available from the Business plan. It will include API keys, OpenAPI documentation and webhooks. If you have an urgent use case, contact us directly.

What's next

What's ready and what's coming.

Shipped In progress Up next Planned
Q1 2026 Shipped

Document Extractor

Text + tables from PDF and DOCX to Excel, JSON and CSV. Up to 500 files per job.

Q1 2026 Shipped

Data Sectioning

Automatic extraction of variable fields between identical documents.

Q1 2026 Shipped

ES + EN interface

Full navigation in Spanish and English.

Q2 2026 In progress

Payment plans (Stripe)

Monthly and annual subscriptions with Stripe. Starter, Pro, Business plans.

Q2 2026 Up next

Pricing page

Detailed plan comparison, tools and credit table.

Q2 2026 Planned

Legal pages

Terms of service, privacy policy and GDPR compliance.

Q2-Q3 2026 Planned

AI OCR

Scanned documents and images to structured data with Gemini. 3 cr/page.

Q2-Q3 2026 Planned

AI Extractor

Specific fields from invoices, payslips and contracts with AI. 3 cr/page.

Q3-Q4 2026 Planned

Public API

REST API with API keys, OpenAPI docs and webhooks. Business plan+.

Q4 2026 Planned

Multi-user and teams

Admin/member roles, shared credits, access control. Pro plan+.

Start today. Free, no credit card.

Go to the tool

No minimum subscription. No templates. No setup. Open beta.