-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathFoLiA-hocr.1
89 lines (70 loc) · 1.29 KB
/
FoLiA-hocr.1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
.TH FoLiA-hocr 1 "2020 jan 19"
.SH NAME
FoLiA-hocr - Convert HOCR files into FoLiA.
.SH SYNOPSIS
FoLiA-hocr [options] FILE
FoLiA-hocr [options] DIR
.SH DESCRIPTION
When a DIR is provided,
.B FoLiA-hocr
will process all files in DIR and store the result in DIR, or in
the directory provided with
.B -O
When a FILE is provided,
.B FoLiA-hocr
will process that file and store its result in the directory where FILE is
found or in the directory provided with
.B -O
All files should be either HTML (.html), XHTML (.xhtml) or BZIP2 or GZ versions
of (X)HTML.
When the input file(s) are zipped, the output will be too.
.SH OPTIONS
.B --compress
kind
.RS
Use BZIP2 compression (kind=b) or GZIP compression (kind=g)
.RE
.B --setname
set
.RS
The FoLiA setname of the <str> nodes that are created. (default FoLiA-hocr-set).
.RE
.B --class
classname
.RS
The FoLiA classname of the <t> nodes that are created. (default OCR).
.RE
.B -O
prefix
.RS
use 'prefix' as a directory name to store the hocr-ed files in.
.RE
.B -t
or
.B --threads
number
.RS
Number of concurrent threads to be used by the program.
.RE
.B -v
.RS
be more verbose.
.RE
.B -V
or
.B --version
.RS
Show VERSION
.RE
.B -h
or
.B --help
.RS
Show some help
.RE
.SH BUGS
possible
.SH AUTHORS
Ko van der Sloot
Martin Reynaert
email: [email protected]