source: trunk/gsdl/src/mgpp/text/mg_stem_idx.1@ 879

Last change on this file since 879 was 856, checked in by sjboddie, 24 years ago

Rodgers new C++ mg

  • Property svn:executable set to *
  • Property svn:keywords set to Author Date Id Revision
File size: 2.2 KB
Line 
1.\"------------------------------------------------------------
2.\" Id - set Rv,revision, and Dt, Date using rcs-Id tag.
3.de Id
4.ds Rv \\$3
5.ds Dt \\$4
6..
7.Id $Id: mg_stem_idx.1 856 2000-01-14 02:26:25Z sjboddie $
8.\"------------------------------------------------------------
9.TH mg_stem_idx 1 \*(Dt CITRI
10.SH NAME
11mg_stem_idx \- builds a stem index file
12.SH SYNOPSIS
13.B mg_stem_idx
14[
15.B \-h
16]
17[
18.BI \-b " entries-per-block"
19]
20.if n .ti +12n
21[
22.BI \-a " stemmer"
23]
24[
25.BI \-d " directory"
26]
27.if n .ti +12n
28.B \-s 1|2|3
29.BI \-f " name"
30.SH DESCRIPTION
31.B mg_stem_idx
32generates a stem index file for a collection.
33This program should be called three times: once for each
34.B -s
35parameter. It uses the stemmed dictionary to create the stem index which
36contains pointers into the stemmed dictionary.
37.SH OPTIONS
38Options may appear in any order.
39.TP "\w'\fB\-d\fP \fIdirectoryyyyyyyy\fP'u+2n"
40.B \-h
41This displays a usage line on
42.IR stderr .
43.TP
44.BI \-b " entries-per-block"
45The dictionary is stored in blocks on disk; this option is used to set
46the number of entries per block. The default is 16.
47.TP
48.BI \-a " stemmer"
49The name of the stemmer to use, the default is the Lovin stemmer.
50.TP
51.B -s 1|2|3
52The stem method to apply for the stem index.
53.br
541 = casefolded and non-stemmed
55.br
562 = non-casefolded and stemmed
57.br
583 = casefolded and stemmed
59.TP
60.BI \-d " directory"
61This specifies the directory where the document collection can be found.
62.TP
63.BI \-f " name"
64This specifies the base name of the document collection.
65.SH ENVIRONMENT
66.TP "\w'\fBMGDATA\fP'u+2n"
67.SB MGDATA
68If this environment variable exists, then its value is used as the
69default directory where the mg
70collection files are. If this variable does not exist, then the
71directory \*(lq\fB.\fP\*(rq is used by default. The command line
72option
73.BI \-d " directory"
74overrides the directory in
75.BR MGDATA .
76.SH FILES
77.TP 22
78.B *.invf.dict
79Compressed stemmed dictionary.
80.TP
81.B *.invf.dict.blocked.1
82Stem index with stem index method 1.
83.TP
84.B *.invf.dict.blocked.2
85Stem index with stem index method 2.
86.TP
87.B *.invf.dict.blocked.3
88Stem index with stem index method 3.
89.SH "SEE ALSO"
90.na
91.BR mg_compression_dict (1),
92.BR mg_fast_comp_dict (1),
93.BR mg_invf_dict (1),
94.BR mg_passes (1),
95.BR mg_perf_hash_build (1),
96.BR mg_weights_build (1)
Note: See TracBrowser for help on using the repository browser.