-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathFoLiA-abby.1
114 lines (92 loc) · 1.91 KB
/
FoLiA-abby.1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
.TH FoLiA-abby 1 "2023 may 09"
.SH NAME
FoLiA-abby - Convert ABBY files into FoLiA.
.SH SYNOPSIS
FoLiA-abby [options] FILE
FoLiA-abby [options] DIR
.SH DESCRIPTION
When a DIR is provided,
.B FoLiA-abby
will process all files in DIR and store the result in DIR, or in
the directory provided with
.B -O
When a FILE is provided,
.B FoLiA-abby
will process that file and store its result in the directory where FILE is
found or in the directory provided with
.B -O
All files should be in ABBY XML format or BZIP2 or GZ versions thereof.
When the input file(s) are zipped, the output will be too.
.SH OPTIONS
.B --compress
kind
.RS
Use BZIP2 compression (kind=b) or GZIP compression (kind=g)
.RE
.B -O
outdir
.RS
use 'outdir' as a directory name to store the abby-ed files in.
.RE
.B --prefix
prefix
.RS
use 'prefix' as a prefix for ALL xml:id's created. Default is FA-. Use 'none'
for no prefix. A prefix may be needed for files with non NCname characters in
their names.
.RE
.B -S
or
.B --setname
set
.RS
The FoLiA set of the <str> nodes that are created. (default FoLiA-abby-set).
.RE
.B -C
or
.B --class
classname
.RS
The FoLiA classname of the <str> nodes that are created. (default OCR).
.RE
.B --addbreaks
.RS
Normally we make as much 'running' text as possible, concatenating line-parts
into longer lines, when no change in style triggers a new part.
With this option, we insert FoLiA <br/> nodes when newlines are encountered.
.RE
.B --addmetrics
.RS
add metric information to every part. These are the posiions on the page of the parts. The information is collected from the first and last characters in that
part.
.RE
.B -t
or
.B --threads
number
.RS
Number of concurrent threads to be used by the program.
.RE
.B -v
.RS
be more verbose.
.RE
.B -V
or
.B
--version
.RS
Show VERSION
.RE
.B -h
or
.B --help
.RS
Show some help
.RE
.SH BUGS
possible
.SH AUTHORS
Ko van der Sloot
Martin Reynaert
e\-mail: [email protected]