.\"------------------------------------------------------------ .\" Id - set Rv,revision, and Dt, Date using rcs-Id tag. .de Id .ds Rv \\$3 .ds Dt \\$4 .. .\"------------------------------------------------------------ .TH mgpp_invf_dict 1 \*(Dt CITRI .SH NAME mgpp_invf_dict \- build an `on-disk' stemmed dictionary .SH SYNOPSIS .B mgpp_invf_dict [ .B \-h ] [ .BI \-b " entries-per-block" ] .if n .ti +13n [ .BI \-d " directory" ] .BI \-f " name" .SH DESCRIPTION .B mgpp_invf_dict builds an `on-disk' stemmed dictionary from the compressed stemmed dictionary that can be used during querying. Each entry in the dictionary contains the disk pointer to the associated inverted file entry, and various other pieces of information about the stemmed word. The dictionary is blocked with each block containing the same number of entries (not the same size). .SH OPTIONS Options may appear in any order. .TP "\w'\fB\-d\fP \fIdirectoryyyyyyyy\fP'u+2n" .B \-h This displays a usage line on .IR stderr . .TP .BI \-b " entries-per-block" The dictionary is stored in blocks on disk; this option is used to specify the number of entries per block. The default is 16. .TP .BI \-d " directory" This specifies the directory where the document collection can be found. .TP .BI \-f " name" This specifies the base name of the document collection. .SH ENVIRONMENT .TP "\w'\fBMGDATA\fP'u+2n" .SB MGDATA If this environment variable exists, then its value is used as the default directory where the mgpp collection files are. If this variable does not exist, then the directory \*(lq\fB.\fP\*(rq is used by default. The command line option .BI \-d " directory" overrides the directory in .BR MGDATA . .SH FILES .TP 20 .B *.invf Inverted file. .TP .B *.invf.dict Compressed stemmed dictionary. .TP .B *.invf.dict.blocked The `on-disk' stemmed dictionary. .TP .B *.invf.idx The index into the inverted file. .SH "SEE ALSO" .na .BR mgpp_compression_dict (1), .BR mgpp_fast_comp_dict (1), .BR mgpp_passes (1), .BR mgpp_perf_hash_build (1), .BR mgpp_stem_idx (1), .BR mgpp_weights_build (1)