The scanning company that helps you scan books, publications, journals and documents of all kinds quickly and to a high quality through our comprehensive scanning service! Here are a few examples: Scanning and OCR conversion of student addresses printed from the LADOK system, UHR / UHÄ (the Swedish Council for Higher Education), CSN, shareholder addresses from Euroclear, land survey data printed on paper, data from Skatteverket printed on paper, addresses of forest owners from the Swedish Forest Agency, doctors, nurses and midwives from the registers of Socialstyrelsen, politicians from the Swedish Election Authority, the dog register and sheep register at the Swedish Board of Agriculture, and so on. Survey scanning, form scanning, scanning of documents to PDF (with OCR interpretation that makes them searchable), text extraction from PDFs and more. We develop a method to capture exactly the information you want to extract from the documents! And we use every possible quality-assurance method!!
OCR scanning of address lists
We scan and OCR-interpret address lists. Our scanning service for address registers includes, among other things:
- Checking names against a personal-name database based on the Swedish population register, with over 900,000 unique name words!
- Checking street names against a street-name database from the Swedish postal service
- Checking the field length and characters in the postcode field
- Checking the town against a town database from the Swedish postal service
- Validation and correction of personal identity numbers and organisation numbers
- Removal of duplicates and/or comparison against your own register as an option!
Delivery as an Excel sheet, an Access database or a tab-separated text file.
OCR scanning of large volumes of data
We scan lists & tables with large volumes of numeric data! We scan at high quality and read the data with two or even three different programs in parallel. We compare the data from the two or three readings to identify misreadings, and we also run a check with "regular expressions" to verify the field content, and correct any errors manually against the image of the scanned data row!
OCR scanning of structured and semi-structured lists
We convert multi-line records into single-line, field-divided records. This might, for example, involve a membership register from a printed directory. It could also be a housing register, a register of prosecutions or any other data printout with a reasonably regular appearance.
OCR scanning of email addresses
This service includes extensive post-processing and quality checks to obtain email addresses that are as accurate as possible!
After OCR conversion, all email addresses are checked through a service that simulates sending to the addresses. We then check the addresses that turn out not to work especially carefully for OCR errors.
Scanning and interpretation of surveys
We and our printing partner take care of all the work involved in surveys. We help you scan your survey! Skriptoriet's comprehensive scanning service for, among other things, data capture from surveys includes the following:- A prepress department at our printing partner that adapts the survey's appearance to our rules
- Printing, enveloping, dispatch, return-mail handling, cutting off stapled spines and more through our printing partner
- Free updating of the address register ahead of reminder mailings
- Quality-assured scanning: dynamic thresholding, sequence and orientation checks
- Extremely low loss due to scanning errors!
- A3 scanning is also possible. For example, 4 A4 pages printed on 1 folded A3 sheet
- Careful verification of hand-filled data (ICR fields) such as date, age, height, etc.
- The option of extended quality control, verifying all tick boxes against the image.
- Data files in Excel and SPSS formats are always included. A TXT file is available as a free option.
- The data file in Excel format has a clickable direct link to the corresponding scanned PDF in every row
- Reports: error codes per variable, unfilled surveys, values per variable
- Open-ended answers can also be provided merged into one long PDF file per page number, searchable by survey ID
- One of our major customer benefits is that complete image sets in both TIFF and PDF format are always included at no extra cost!
- Encrypted delivery via Dropbox, or on an encrypted USB stick
- The lowest prices in Sweden
- Very clear quotes and no surprises on the invoice!
- Very fast deliveries
- Dedicated support by phone or at your computer with TeamViewer
See more under the link "Scanning of surveys" above to the right!
Interpretation with flexible data-capture technology
OCR interpretation of certain data is carried out using flexible rules instead of fixed form technology! Among other things, we use the ABBYY FlexiCapture software. A database with the corresponding data fields is filled with one data record per form or document! The interpretation can also be performed on an ordinary PDF (that does not contain an image), which is then processed as an image by the software when the interpretation is carried out!Programmatic text extraction from PDFs
We can extract text areas from PDFs programmatically using a bespoke script in VBA, and build a database that way. This might, for example, apply to public authority documents such as court judgments. Extraction areas can be set using exact coordinates, or using coordinates relative to keywords, or relative to "regular expressions" (search patterns).
Renaming of PDFs using OCR-interpreted or extracted text data
As a typical example, this service means that a few different pieces of data are OCR-interpreted and captured with flexible data-capture technology (or, alternatively, a small text area is extracted programmatically from a text PDF) from usually the first page of a multi-page PDF, and that this captured data, after checks and processing, is then used to rename the original PDF file.
Scanning to PDF with sequential numbering
This service means scanning without OCR. The end product consists of images stored in either PDF or TIFF format. They are numbered with sequential numbers in scanning order: 0001.PDF, 0002.PDF and so on. A common variant of this is to scan to a multipage PDF per multi-page document using separator sheets. Scanning to multi-page PDFs with automatic document breaking is well suited, for example, to due diligence, audit documentation and more.
Scanning with manual indexing
This service means scanning and manual indexing of a number of predefined fields. It is often best suited to older documents with hard-to-read typewritten text or with some handwritten information that must be captured.
Scanning of minutes — historical minutes and archives
We offer a complete service for the digitisation of meeting minutes, board minutes, annual-meeting minutes and historical collections of minutes for associations, congregations, city archives, museums and companies. We can carry out both non-destructive overhead scanning of bound volumes of minutes (with our FADGI-calibrated Zeutschel OS 12002 and Bookeye 5 V1A Archive) and fast scanning of material that can be cut apart (Inotec 6x1). The service includes splitting of book spreads, division of the scanned image stream into separate PDF documents per set of minutes, naming of the files to your requirements, and uncorrected batch OCR for searchability. Read more about scanning of minutes →
OCR to PDF
Scanning and/or OCR interpretation to PDFs with invisible interpreted text beneath the image. Well suited, for example, to OCR interpretation of TIFF images or PDF images in order to create an archive that is searchable in full text.
OCR to Word
OCR of books and document texts into an editable Word file. Character verification is included.
Digitisation of letters, press cuttings, postcards, photographs and more
We help you with overhead scanning of letters, press cuttings, postcards, photographs and similar historical material!