An on-prem intelligent automation engine that transforms legacy legal scans into a searchable, audit-grade archive with sub-second full-text discovery and language-adaptive OCR.
About the client
European legal-tech provider supporting banks, insurers, and utilities. Their on-prem DMS holds ≈ 2 million legacy pages in three working languages — English, German, and French and ingests 8,000–10,000 new documents per month (scanned contracts, legal notices, compliance certificates). All data must remain on premises, and the service-level target for full-text search is < 1 second.
About the Product
and Introduction:
Devox’s partnership with the U.S. industrial group began after a board-level referral: earlier document-automation results convinced the client to invite us in to unlock its decade-old scan archive. The mandate was clear — deliver full-text search and audit-grade traceability without altering the company’s heavily customised on-prem ERP.
We designed a sidecar Document Intelligence Layer: a Dockerised FastAPI service that tails the ERP’s export queue, cleans images, auto-detects English or Spanish, and feeds a self-trained Tesseract OCR. A 48-hour pilot on 5,000 pages hit 94% accuracy and sub-second Elasticsearch search, securing go-live approval.
Now every new scan runs a single pass — enhancement, language detection, confidence-scored OCR, live QA on low-certainty tokens, PDF/A sealing, and instant indexing — and writes straight back through the ERP’s HeadersRepository/RowsRepository. The once-static archive is fully searchable, audit-ready, and remains entirely on-prem.
Project Team
Composition:
Challenges:
Tech
Stack:
Solution:
Devox Software delivered a fully on-prem Searchable Archive IA Engine that integrates high-precision OCR into the client’s document management system, combining microservice orchestration with intelligent automation.
Results:
BUSINESS OUTCOMES
TECHNICAL OUTCOMES
Sum Up:
Static scans are now a living, searchable knowledge base — AI-OCR applies language-adaptive models on premises, confidence scores guide instant QA, and every page appears in full-text results in under a second. The engine scales on existing hardware, meets the strictest data-residency rules, and pays for itself in months.
Need the same on-prem accuracy, speed, and audit-proof traceability for your legal or compliance archive? Let’s map the first pilot and put intelligent automation to work for you.
Tell us where your system needs help — we’ll show you how to move forward with clarity and speed. From architecture to launch — we’re your engineering partner.
Book your free consultation. We’ll help you move faster, and smarter.
Share the details of your project – like scope or business challenges. Our team will carefully study them and then we’ll figure out the next move together.
We appreciate you reaching out. Your message has been received, and a member of our team will get back to you within 24 hours.
In the meantime, feel free to follow our social.
Welcome to the Devox Software community! We're excited to have you on board. You'll now receive the latest industry insights, company news, and exclusive updates straight to your inbox.