DocumentVision

DocumentVision is a node.js library for processing and understanding scanned documents.

Features

Image loading using jpgd, LodePNG and pixel buffers
Image manipulation using Leptonica (Version 1.69)
OCR using Tesseract (Version 3.02)
OMR for Barcodes using ZXing (Version 2.10 with PDF417 patches applied)

Installation

[sudo] npm install [-g] dv

Quick Start

Once you've installed, download that image. You can use any other image containing simple text at 300dpi or higher. Now run the following code snipped to recognize text from your image:

var dv = require('dv');
var fs = require('fs');
var image = new dv.Image('png', fs.readFileSync('textpage300.png'));
var tesseract = new dv.Tesseract('eng', image);
console.log(tesseract.findText('plain'));

Whats next?

Here are some quick links to help you get started:

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
deps		deps
lib		lib
src		src
tessdata		tessdata
test		test
tools		tools
.gitignore		.gitignore
.npmignore		.npmignore
README.md		README.md
binding.gyp		binding.gyp
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocumentVision

Features

Installation

Quick Start

Whats next?

License

About

Uh oh!

Releases

Packages

Languages

routix/node-dv

Folders and files

Latest commit

History

Repository files navigation

DocumentVision

Features

Installation

Quick Start

Whats next?

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages