Archive for the ‘Paperless Office’ Category

Create Searchable PDF Images Using OCR Software

Sunday, June 17th, 2007

You know you want to get rid of the paper and the file cabinets and the endless hours of searching for lost records, so you’ve decided to go paperless. Why stop there? Scanning papers just guarantees that you’re going to have a bunch of image files on your network, but you can search a picture. To run searches, you need text. That’s where OCR software comes in.

OCR stands for Optical Character Recognition. It’s the function that interprets graphical marks as letters forming words. Of course, that recognition is seldom 100% perfect, but it’s getting better. What you will want is an application that will run the OCR function in the background and then append the recognized text to the PDF image to create one searchable file.

Several OCR softwares are available, including OmniPage, Readiris, Presto, and Microsoft Office Document Imaging. What you will want to do is implement a solution that you can use in conjunction with your high-volume copier/scanners.

I’ll be writing more on this topic soon. Really, the point of this entry is to alert you to the fact that not all PDF files are the same: some are searchable and some aren’t. If you want the searchable kind (recommended for a paperless office), you’ll need OCR software to get it.

Scan to FTP Folders

Thursday, November 2nd, 2006

Many network scanning devices are configured to send scan jobs to users’ email inboxes. Since most companies run Microsoft networks and use Outlook, they don’t run into compatibility problems with this solution, and there’s a smaller setup cost. But what if you aren’t using Outlook? Or what if you simply don’t want a solution that’s going to bog down your mail server even more? After all, scans are image files, and those can get pretty huge. A better solution is to scan to a network folder using FTP (File Transfer Protocol). Here’s how it works.

Your network engineer enables an FTP server on your network and designates a mapped, shared network drive and folder as the target FTP folder (e.g., F:\FTP) with a login for authentication. Since your network is behind a firewall, all you need to know is the internal IP address of the FTP server. Anything sent to that address via FTP will go directly into the designated folder (F:\FTP, in this example).

Most high-volume copiers also have scanning capabilities and will scan to email or to an FTP address. If they also have a web-interface for administrative use, you can configure these settings via a web browser, assuming you’ve assigned static IP addresses to your network copier/scanners.

On Sharp copiers, for example, scanning is an “Image Send” function, and the user simply has to pick an “address,” which could be an FTP folder or subfolder. In most environments, you’ll want a separate scanning “inbox” for each of your users. With Sharp, this is easy to set up, although it has to be done on each machine and for each user.

Access the copier’s web configuration page by typing the copier’s static IP address in a browser window. Enter an administrative login (which your Sharp vendor should give you). Then go to the Image Send / Destination page. Select options to create a new FTP destination and then enter user ID (which will appear as a folder tab on the copier), the internal IP address for the FTP server, login information, and subfolder information (which is how you create separate inboxes for each of your users).

Say you want to give both Harry and Sally separate inboxes. From a workstation, just browse to F:\FTP and create subfolders called “Harry” and “Sally.” The path to Harry’s inbox would then be F:\FTP\Harry, and that’s how he would access his inbox for scans. Sally would browse to F:\FTP\Sally.

Remember that your network engineer has already configured your FTP server so that all FTP jobs go directly to F:\FTP. That means that when configuring each user’s destination folder on the copier’s web configuration page, you only need to enter their subfolder. For destination “Harry,” for example, you would simply have to enter “Harry\” in the subfolder information field. For Sally’s destination, you would just enter “Sally\” in the subfolder field.

Once those subfolders are created on the network and the Destinations are configured for the copier/scanner, your users would simply press the Image Send function at the Sharp copier, select Address, and then select their named tab (e.g., Harry). Then the Go button to run the scan job.

That’s basically it. Once they return to their desks, they can access their respective scanning inboxes by simply browsing to them. From there, they can rename the files and move them to whatever location they need to be saved, which involves basic file management techniques.

Do I Need a Document Management System?

Saturday, September 2nd, 2006

First things first: a Document Management System (DMS) is just a glorified database that can help your office file, categorize, summarize, search for, and find the tens of thousands of computer files you’re likely to have once you really go paperless. Furthermore, because all file storage takes place on the back end, users can’t as easily move or delete entire directory folders. That’s the up side.

The downside? Well, several disadvantages come to mind.

  • High Cost: DMS solutions can be very expensive.
  • No Interoperability: Once you lock into a DMS solution you’re stuck with it and its updates since the database produced by one DMS can’t be imported into or read by another DMS.
  • Time-Consuming: Although DMS solutions are meant to save you time in searching for documents, you and your staff need to profile each document you put in the system, and that can take more time than you think.
  • High Learning Curve: Everyone will have to learn the new system in order to profile new documents and bring up existing ones.

The real advantages are in being able to protect the directory structure and in being able to search documents. For the former, adequate training and a backup protocol are required to, respectively, minimize mistakes and recover from them. As for making documents easier to find: you can actually implement this without a DMS. In fact, some DMS solutions depend on other softwares to enable indexing features.

So bottom line: no, you do not need a DMS. You may want one, regardless, for the advantages noted above, but you definitely don’t need one, especially if cost is a consideration.