Line | |
---|
1 | | From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt
|
---|
2 | | This file is distributed under the BSD License.
|
---|
3 | | See http://snowball.tartarus.org/license.php
|
---|
4 | | Also see http://www.opensource.org/licenses/bsd-license.html
|
---|
5 | | - Encoding was converted to UTF-8.
|
---|
6 | | - This notice was added.
|
---|
7 | |
|
---|
8 | | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
|
---|
9 |
|
---|
10 | | A Swedish stop word list. Comments begin with vertical bar. Each stop
|
---|
11 | | word is at the start of a line.
|
---|
12 |
|
---|
13 | | This is a ranked list (commonest to rarest) of stopwords derived from
|
---|
14 | | a large text sample.
|
---|
15 |
|
---|
16 | | Swedish stop words occasionally exhibit homonym clashes. For example
|
---|
17 | | så = so, but also seed. These are indicated clearly below.
|
---|
18 |
|
---|
19 | och | and
|
---|
20 | det | it, this/that
|
---|
21 | att | to (with infinitive)
|
---|
22 | i | in, at
|
---|
23 | en | a
|
---|
24 | jag | I
|
---|
25 | hon | she
|
---|
26 | som | who, that
|
---|
27 | han | he
|
---|
28 | på | on
|
---|
29 | den | it, this/that
|
---|
30 | med | with
|
---|
31 | var | where, each
|
---|
32 | sig | him(self) etc
|
---|
33 | för | for
|
---|
34 | så | so (also: seed)
|
---|
35 | till | to
|
---|
36 | Àr | is
|
---|
37 | men | but
|
---|
38 | ett | a
|
---|
39 | om | if; around, about
|
---|
40 | hade | had
|
---|
41 | de | they, these/those
|
---|
42 | av | of
|
---|
43 | icke | not, no
|
---|
44 | mig | me
|
---|
45 | du | you
|
---|
46 | henne | her
|
---|
47 | då | then, when
|
---|
48 | sin | his
|
---|
49 | nu | now
|
---|
50 | har | have
|
---|
51 | inte | inte någon = no one
|
---|
52 | hans | his
|
---|
53 | honom | him
|
---|
54 | skulle | 'sake'
|
---|
55 | hennes | her
|
---|
56 | dÀr | there
|
---|
57 | min | my
|
---|
58 | man | one (pronoun)
|
---|
59 | ej | nor
|
---|
60 | vid | at, by, on (also: vast)
|
---|
61 | kunde | could
|
---|
62 | något | some etc
|
---|
63 | från | from, off
|
---|
64 | ut | out
|
---|
65 | nÀr | when
|
---|
66 | efter | after, behind
|
---|
67 | upp | up
|
---|
68 | vi | we
|
---|
69 | dem | them
|
---|
70 | vara | be
|
---|
71 | vad | what
|
---|
72 | över | over
|
---|
73 | Àn | than
|
---|
74 | dig | you
|
---|
75 | kan | can
|
---|
76 | sina | his
|
---|
77 | hÀr | here
|
---|
78 | ha | have
|
---|
79 | mot | towards
|
---|
80 | alla | all
|
---|
81 | under | under (also: wonder)
|
---|
82 | någon | some etc
|
---|
83 | eller | or (else)
|
---|
84 | allt | all
|
---|
85 | mycket | much
|
---|
86 | sedan | since
|
---|
87 | ju | why
|
---|
88 | denna | this/that
|
---|
89 | sjÀlv | myself, yourself etc
|
---|
90 | detta | this/that
|
---|
91 | åt | to
|
---|
92 | utan | without
|
---|
93 | varit | was
|
---|
94 | hur | how
|
---|
95 | ingen | no
|
---|
96 | mitt | my
|
---|
97 | ni | you
|
---|
98 | bli | to be, become
|
---|
99 | blev | from bli
|
---|
100 | oss | us
|
---|
101 | din | thy
|
---|
102 | dessa | these/those
|
---|
103 | några | some etc
|
---|
104 | deras | their
|
---|
105 | blir | from bli
|
---|
106 | mina | my
|
---|
107 | samma | (the) same
|
---|
108 | vilken | who, that
|
---|
109 | er | you, your
|
---|
110 | sådan | such a
|
---|
111 | vår | our
|
---|
112 | blivit | from bli
|
---|
113 | dess | its
|
---|
114 | inom | within
|
---|
115 | mellan | between
|
---|
116 | sådant | such a
|
---|
117 | varför | why
|
---|
118 | varje | each
|
---|
119 | vilka | who, that
|
---|
120 | ditt | thy
|
---|
121 | vem | who
|
---|
122 | vilket | who, that
|
---|
123 | sitta | his
|
---|
124 | sådana | such a
|
---|
125 | vart | each
|
---|
126 | dina | thy
|
---|
127 | vars | whose
|
---|
128 | vårt | our
|
---|
129 | våra | our
|
---|
130 | ert | your
|
---|
131 | era | your
|
---|
132 | vilkas | whose
|
---|
133 |
|
---|
Note:
See
TracBrowser
for help on using the repository browser.