source: tags/gsdl-2_30d-distribution/gsdl/perllib/Kea-1.1.4/cstr.desc@ 2308

Last change on this file since 2308 was 1972, checked in by jmt14, 23 years ago

* empty log message *

  • Property svn:keywords set to Author Date Id Revision
File size: 2.3 KB
Line 
1 TYA 0.7 (for J116) loaded. Copyright (c) 1997,98 The TYA Team
2Building classifier and saving it...
3Loading data...
4Counting positive examples...
5Looking for missed keyphrases in the dataset...
6Found 130 documents in the dataset.
7Total number of examples missing: 211
8Total number of positive examples: 668
9Deleting columns...
10Discretizing columns...
11Building classifier...
12
13Naive Bayes
14
15Class yes: P(C) = 0.00481649
16
17Attribute keyword_freq
18'(-inf-0]' '(0-17]' '(17-18]' '(18-inf)'
190.39479393 0.52711497 0.01952278 0.05856833
20
21Attribute tfidf
22'(-inf-0.000347]' '(0.000347-0.00237]' '(0.00237-0.00512]' '(0.00512-0.0126]' '(0.0126-0.0474]' '(0.0474-inf)'
230.00647948 0.1663067 0.22678186 0.28293737 0.24406048 0.07343413
24
25Attribute first_occurrence
26'(-inf-0.00207]' '(0.00207-0.0103]' '(0.0103-0.0343]' '(0.0343-0.142]' '(0.142-inf)'
270.21861472 0.20995671 0.24242424 0.19264069 0.13636364
28
29
30
31Class no: P(C) = 0.99518351
32
33Attribute keyword_freq
34'(-inf-0]' '(0-17]' '(17-18]' '(18-inf)'
350.86182702 0.13687325 0.00001057 0.00128916
36
37Attribute tfidf
38'(-inf-0.000347]' '(0.000347-0.00237]' '(0.00237-0.00512]' '(0.00512-0.0126]' '(0.0126-0.0474]' '(0.0474-inf)'
390.22192166 0.52464681 0.16605556 0.06533385 0.02027748 0.00176464
40
41Attribute first_occurrence
42'(-inf-0.00207]' '(0.00207-0.0103]' '(0.0103-0.0343]' '(0.0343-0.142]' '(0.142-inf)'
430.01623061 0.04124223 0.10043747 0.2244706 0.61761909
44
45
46Finding most probable keyphrases...
47Finding best cutoffs...
48Best rank cutoff: 8 Best prob cutoff: 0.25999999999999934 F-Measure: 0.22979397781299524
49Classifying phrases...
50Preparing data...
51Counting positive examples...
52Looking for missed keyphrases in the dataset...
53Found 500 documents in the dataset.
54Total number of examples missing: 809
55Total number of positive examples: 2709
56Deleting columns...
57Finding most probable keyphrases...
58Computing statistics...
59Rank cutoff: 8
60Probability cutoff: 0.25999999999999934
61F-Measure: 0.22598870056497175
62Rank Prec. Conf.
631 0.308 0.0406 &&&
642 0.287 0.0291 &&&
653 0.26 0.0237 &&&
664 0.2395 0.0196 &&&
675 0.2288 0.0174 &&&
686 0.2177 0.0159 &&&
697 0.2071 0.0143 &&&
708 0.1913 0.013 &&&
719 0.1791 0.0117 &&&
7210 0.1696 0.011 &&&
7311 0.1622 0.0102 &&&
7412 0.1567 0.0097 &&&
7513 0.1486 0.0091 &&&
7614 0.1436 0.0086 &&&
7715 0.1377 0.0081 &&&
7816 0.1325 0.0077 &&&
7917 0.128 0.0073 &&&
8018 0.1236 0.0071 &&&
8119 0.1195 0.0068 &&&
8220 0.1146 0.0065 &&&
Note: See TracBrowser for help on using the repository browser.