source: gs3-extensions/solr/trunk/src/collect/solr-jdbm-demo/etc/conf/lang/stopwords_sv.txt@ 30001

Last change on this file since 30001 was 30001, checked in by ak19, 9 years ago

Final commit (I think) to get update to solr getTerms() to work on gs3 checkout. The solr-jdbm-demo collection needed to be rebuilt with the changes to the index. This time added in other .xml files from the lucene/solr upgrade to the colleciton, and updated schema.xml and solrconfig.xml. This last is especially necessary as it uses the new Greenstone custom SearchHandler to get getTerms() to work.

File size: 3.4 KB
Line 
1 | From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt
2 | This file is distributed under the BSD License.
3 | See http://snowball.tartarus.org/license.php
4 | Also see http://www.opensource.org/licenses/bsd-license.html
5 | - Encoding was converted to UTF-8.
6 | - This notice was added.
7 |
8 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9
10 | A Swedish stop word list. Comments begin with vertical bar. Each stop
11 | word is at the start of a line.
12
13 | This is a ranked list (commonest to rarest) of stopwords derived from
14 | a large text sample.
15
16 | Swedish stop words occasionally exhibit homonym clashes. For example
17 | så = so, but also seed. These are indicated clearly below.
18
19och | and
20det | it, this/that
21att | to (with infinitive)
22i | in, at
23en | a
24jag | I
25hon | she
26som | who, that
27han | he
28på | on
29den | it, this/that
30med | with
31var | where, each
32sig | him(self) etc
33för | for
34så | so (also: seed)
35till | to
36Àr | is
37men | but
38ett | a
39om | if; around, about
40hade | had
41de | they, these/those
42av | of
43icke | not, no
44mig | me
45du | you
46henne | her
47då | then, when
48sin | his
49nu | now
50har | have
51inte | inte någon = no one
52hans | his
53honom | him
54skulle | 'sake'
55hennes | her
56dÀr | there
57min | my
58man | one (pronoun)
59ej | nor
60vid | at, by, on (also: vast)
61kunde | could
62något | some etc
63från | from, off
64ut | out
65nÀr | when
66efter | after, behind
67upp | up
68vi | we
69dem | them
70vara | be
71vad | what
72över | over
73Àn | than
74dig | you
75kan | can
76sina | his
77hÀr | here
78ha | have
79mot | towards
80alla | all
81under | under (also: wonder)
82någon | some etc
83eller | or (else)
84allt | all
85mycket | much
86sedan | since
87ju | why
88denna | this/that
89sjÀlv | myself, yourself etc
90detta | this/that
91Ã¥t | to
92utan | without
93varit | was
94hur | how
95ingen | no
96mitt | my
97ni | you
98bli | to be, become
99blev | from bli
100oss | us
101din | thy
102dessa | these/those
103några | some etc
104deras | their
105blir | from bli
106mina | my
107samma | (the) same
108vilken | who, that
109er | you, your
110sådan | such a
111vår | our
112blivit | from bli
113dess | its
114inom | within
115mellan | between
116sådant | such a
117varför | why
118varje | each
119vilka | who, that
120ditt | thy
121vem | who
122vilket | who, that
123sitta | his
124sådana | such a
125vart | each
126dina | thy
127vars | whose
128vårt | our
129våra | our
130ert | your
131era | your
132vilkas | whose
133
Note: See TracBrowser for help on using the repository browser.