Skip to content
/ node-dv Public
forked from creatale/node-dv

A node.js library for processing and understanding scanned documents

Notifications You must be signed in to change notification settings

routix/node-dv

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

105 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DocumentVision

DocumentVision is a node.js library for processing and understanding scanned documents.

Features

  • Image loading using jpgd, LodePNG and pixel buffers
  • Image manipulation using Leptonica (Version 1.69)
  • OCR using Tesseract (Version 3.02)
  • OMR for Barcodes using ZXing (Version 2.10 with PDF417 patches applied)

Installation

[sudo] npm install [-g] dv

Quick Start

Once you've installed, download that image. You can use any other image containing simple text at 300dpi or higher. Now run the following code snipped to recognize text from your image:

var dv = require('dv');
var fs = require('fs');
var image = new dv.Image('png', fs.readFileSync('textpage300.png'));
var tesseract = new dv.Tesseract('eng', image);
console.log(tesseract.findText('plain'));

Whats next?

Here are some quick links to help you get started:

License

Licensed under the incredibly permissive MIT License. Copyright © 2012 Christoph Schulz.

About

A node.js library for processing and understanding scanned documents

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 80.0%
  • JavaScript 18.3%
  • Python 1.7%