Skip to content

Commit

Permalink
Merge pull request #1 from edsu/master
Browse files Browse the repository at this point in the history
README.md
  • Loading branch information
tmbdev committed Nov 1, 2014
2 parents 8de0c4f + 2e1a128 commit 9c1e317
Showing 1 changed file with 1 addition and 3 deletions.
4 changes: 1 addition & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation.

There is a [http://docs.google.com/View?docid=dfxcv4vc_67g844kf Public Specification for the hOCR Format].
There is a [Public Specification](http://docs.google.com/View?docid=dfxcv4vc_67g844kf) for the hOCR Format.

# Available Programs

Expand All @@ -16,5 +16,3 @@ Included command line programs:
* hocr-split -- split an hOCR file into individual pages
* hocr-merge-dc -- merge Dublin Core meta data into the hOCR HTML header

See the [CommandLine] Wiki page for more information.

0 comments on commit 9c1e317

Please sign in to comment.