Extract text from PDF files

We can extract text from PDF files using tailor-made scripts and build up content into a database that way. This is a more powerful and more flexible alternative than using an OCR program with advanced features. It is also faster!

Extraction areas can be set using exact coordinates, or using coordinates relative to keywords, or relative to "regular expressions" (search patterns). There is also the option to search for keywords with fuzzy matching (i.e. one character may be wrong)!

Common tasks may be flexible capture of company registration numbers and/or personal identity numbers, dates, order numbers or the like.

This may, for example, concern official documents such as court rulings, account statements and so on, documents from Bolagsverket, Skatteverket and so forth.

District court judgments - extraction of defendants and injured parties from the district courts' judgments

We have extensive experience of working with judgments in particular from all district courts! We can extract the defendant and the injured party, their personal identity numbers where present, the case number and the court name and so on from all judgments published as PDF files from the country's courts. Fast and at a reasonable price!

With a 100% custom-adapted script it is usually possible to get out exactly what you want!