Internet Content Conversion/ Data Entry
 Data Entry/Data Processing
 Data Cleansing/Enrichment
 
Database Development & Migration
 Online Data Entry
 HTML/XML Conversion
 Adobe PDF Conversion
 E-Book Publishing
 Scanning/ OCR/ Indexing
 Document Management
 
Our Clients
 
 
Application Development
 
 
Business Process Outsourcing
 
 
Search Engine Optimization
 
 
Multimedia & Graphics
 
   

PDF Image + Searchable Text Conversion

PDF Image + Searchable Text Conversion: (formerly known as PDF plus hidden text) contains a bitmapped image of the original, and a hidden layer of searchable text. The conversion process involves: scanning the hardcopy original, performing OCR (Optical Character Recognition) to capture the text of the document, and distilling the two layers into a PDF searchable image file. Though text can be searched, hyperlinks and bookmarks are not fully functional in this format. As with PDF image only, PDF searchable image files are only as legible as the original. And PDF searchable image files have the largest file size of the three types - this can be a big issue if the PDF document is bound for the Internet.

Pages will be displayed as image resulting in accuracy which is inherently high based on image displayed.

Text resulting from an OCR (Optical Character Recognition) process may be “bonded” to the originating image to create a PDF/Searchable Image file. When you search for words or phrases, they will be highlighted in the image. 

This background text allows searchability, but the accuracy is dependent on the quality of your originals and other factors. Based on this background text, you have two options:

  • PDF Image + Text (Raw or uncorrected OCR text)
  • PDF Image + Text (Corrected or proof-read)

For many applications, the raw conversion with uncorrected text is accurate enough. For clients needing higher accuracy rates, Suntec will correct and proofread the OCR output. This process is often vital for documents containing italicized characters and small text, or for poor-quality original documents.

PDF/Searchable Image files may be indexed for full-text retrieval by any search engine capable of indexing PDF files.

Typical applications include: 

  • business records
  • academic journals
  • advertising and promotional materials
  • historical materials and 
  • handwritten materials including color or grayscale images.

PDF/Searchable Image is used globally by governments and businesses for electronic storage and retrieval of:

  • Business Records
  • CD-ROM publishing 
  • Electronic Publishing
  • Manufacturing and design documentation
  • On-line content / Intranet content
  • Records Retention / Legacy Data Conversion
  • Delivery Challans, Shipping notes, and Invoices

PDF File Type Comparison

  Image Image + Searchable Text PDF Normal (Formatted Text & Graphics)
Accuracy Very high 
(Page is retained as image)
Very high
(Page is retained as image)
High
(in effect, re-authoring the document)
Text searchability No Yes  Yes
File size Large 
(Typically, 40-50 KB at 300 dpi without grayscale or color images)
Large 
(Typically, 50-60 KB at 300 dpi without grayscale or color images)
Small size
(Typically, 4–6 kb per page for simple documents)
Typical Application Budget friendly archiving Full-text search for bitonal files Tiny but rich files - great for the web
Cost  Low Medium High


 

We look forward to working for you. Getting in touch is easy — please contact us at:

Tel: 91-9810162968
Online: Request Information
Email: info@suntecindia.com

 
Why Adobe Acrobat PDF?
Print-on Demand (POD PDF)

 
PDF Conversion Options
PDF Image
PDF Image+ Searchable Text
PDF Multi-Resolution Image
PDF Formatted Text & Graphics

 
Issues and Considerations when choosing the type of PDF
  • Bandwidth
  • Text Searchability
  • Color or half-tone images
  • Document size

 

We sent a complex PDF conversion job in two languages and the results were impeccable.

We will definitely use Suntec for any future works of this kind.

Josh Merrow
Barcelona, Spain

Thanks for the great job your company did for us in converting our text document and turning it into a first class .pdf eBook.

Your fast turn around time and efficiency, plus your willingness to make all the changes we requested made our working with you a very pleasurable experience.

Without your expertise in document management we would not have had our project finished in such a timely and superb manner.

Laurette and Marilyn

 

Litigation Support Services
Accounting/Financial Services
Medical Records Processing
Search Engine Optimization
Data Cleansing/Enrichment

 
Copyright © 2005 Suntec. All rights reserved.