source: trunk/gsdl/src/mgpp/text/mg_invf_dict.1@ 856

Last change on this file since 856 was 856, checked in by sjboddie, 24 years ago

Rodgers new C++ mg

  • Property svn:executable set to *
  • Property svn:keywords set to Author Date Id Revision
File size: 2.0 KB
Line 
1.\"------------------------------------------------------------
2.\" Id - set Rv,revision, and Dt, Date using rcs-Id tag.
3.de Id
4.ds Rv \\$3
5.ds Dt \\$4
6..
7.Id $Id: mg_invf_dict.1 856 2000-01-14 02:26:25Z sjboddie $
8.\"------------------------------------------------------------
9.TH mg_invf_dict 1 \*(Dt CITRI
10.SH NAME
11mg_invf_dict \- build an `on-disk' stemmed dictionary
12.SH SYNOPSIS
13.B mg_invf_dict
14[
15.B \-h
16]
17[
18.BI \-b " entries-per-block"
19]
20.if n .ti +13n
21[
22.BI \-d " directory"
23]
24.BI \-f " name"
25.SH DESCRIPTION
26.B mg_invf_dict
27builds an `on-disk' stemmed dictionary from the
28compressed stemmed dictionary that can be used during querying.
29Each entry in the dictionary contains the disk
30pointer to the associated inverted file entry, and various other pieces
31of information about the stemmed word. The dictionary is blocked with
32each block containing the same number of entries (not the same size).
33.SH OPTIONS
34Options may appear in any order.
35.TP "\w'\fB\-d\fP \fIdirectoryyyyyyyy\fP'u+2n"
36.B \-h
37This displays a usage line on
38.IR stderr .
39.TP
40.BI \-b " entries-per-block"
41The dictionary is stored in blocks on disk; this option is used to
42specify the number of entries per block. The default is 16.
43.TP
44.BI \-d " directory"
45This specifies the directory where the document collection can be found.
46.TP
47.BI \-f " name"
48This specifies the base name of the document collection.
49.SH ENVIRONMENT
50.TP "\w'\fBMGDATA\fP'u+2n"
51.SB MGDATA
52If this environment variable exists, then its value is used as the
53default directory where the mg
54collection files are. If this variable does not exist, then the
55directory \*(lq\fB.\fP\*(rq is used by default. The command line
56option
57.BI \-d " directory"
58overrides the directory in
59.BR MGDATA .
60.SH FILES
61.TP 20
62.B *.invf
63Inverted file.
64.TP
65.B *.invf.dict
66Compressed stemmed dictionary.
67.TP
68.B *.invf.dict.blocked
69The `on-disk' stemmed dictionary.
70.TP
71.B *.invf.idx
72The index into the inverted file.
73.SH "SEE ALSO"
74.na
75.BR mg_compression_dict (1),
76.BR mg_fast_comp_dict (1),
77.BR mg_passes (1),
78.BR mg_perf_hash_build (1),
79.BR mg_stem_idx (1),
80.BR mg_weights_build (1)
Note: See TracBrowser for help on using the repository browser.