root/gs3-extensions/solr/trunk/src/collect/solr-jdbm-demo/etc/conf/lang/stopwords_sv.txt @ 30001

Revision 30001, 3.4 KB (checked in by ak19, 4 years ago)

Final commit (I think) to get update to solr getTerms() to work on gs3 checkout. The solr-jdbm-demo collection needed to be rebuilt with the changes to the index. This time added in other .xml files from the lucene/solr upgrade to the colleciton, and updated schema.xml and solrconfig.xml. This last is especially necessary as it uses the new Greenstone custom SearchHandler? to get getTerms() to work.

Line 
1 | From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt
2 | This file is distributed under the BSD License.
3 | See http://snowball.tartarus.org/license.php
4 | Also see http://www.opensource.org/licenses/bsd-license.html
5 |  - Encoding was converted to UTF-8.
6 |  - This notice was added.
7 |
8 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9
10 | A Swedish stop word list. Comments begin with vertical bar. Each stop
11 | word is at the start of a line.
12
13 | This is a ranked list (commonest to rarest) of stopwords derived from
14 | a large text sample.
15
16 | Swedish stop words occasionally exhibit homonym clashes. For example
17 |  sÃ¥ = so, but also seed. These are indicated clearly below.
18
19och            | and
20det            | it, this/that
21att            | to (with infinitive)
22i              | in, at
23en             | a
24jag            | I
25hon            | she
26som            | who, that
27han            | he
28pÃ¥             | on
29den            | it, this/that
30med            | with
31var            | where, each
32sig            | him(self) etc
33för            | for
34sÃ¥             | so (also: seed)
35till           | to
36Àr             | is
37men            | but
38ett            | a
39om             | if; around, about
40hade           | had
41de             | they, these/those
42av             | of
43icke           | not, no
44mig            | me
45du             | you
46henne          | her
47dÃ¥             | then, when
48sin            | his
49nu             | now
50har            | have
51inte           | inte nÃ¥gon = no one
52hans           | his
53honom          | him
54skulle         | 'sake'
55hennes         | her
56dÀr            | there
57min            | my
58man            | one (pronoun)
59ej             | nor
60vid            | at, by, on (also: vast)
61kunde          | could
62nÃ¥got          | some etc
63frÃ¥n           | from, off
64ut             | out
65nÀr            | when
66efter          | after, behind
67upp            | up
68vi             | we
69dem            | them
70vara           | be
71vad            | what
72över           | over
73Àn             | than
74dig            | you
75kan            | can
76sina           | his
77hÀr            | here
78ha             | have
79mot            | towards
80alla           | all
81under          | under (also: wonder)
82nÃ¥gon          | some etc
83eller          | or (else)
84allt           | all
85mycket         | much
86sedan          | since
87ju             | why
88denna          | this/that
89sjÀlv          | myself, yourself etc
90detta          | this/that
91Ã¥t             | to
92utan           | without
93varit          | was
94hur            | how
95ingen          | no
96mitt           | my
97ni             | you
98bli            | to be, become
99blev           | from bli
100oss            | us
101din            | thy
102dessa          | these/those
103nÃ¥gra          | some etc
104deras          | their
105blir           | from bli
106mina           | my
107samma          | (the) same
108vilken         | who, that
109er             | you, your
110sÃ¥dan          | such a
111vÃ¥r            | our
112blivit         | from bli
113dess           | its
114inom           | within
115mellan         | between
116sÃ¥dant         | such a
117varför         | why
118varje          | each
119vilka          | who, that
120ditt           | thy
121vem            | who
122vilket         | who, that
123sitta          | his
124sÃ¥dana         | such a
125vart           | each
126dina           | thy
127vars           | whose
128vÃ¥rt           | our
129vÃ¥ra           | our
130ert            | your
131era            | your
132vilkas         | whose
133
Note: See TracBrowser for help on using the browser.