1 | | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt
|
---|
2 | | This file is distributed under the BSD License.
|
---|
3 | | See http://snowball.tartarus.org/license.php
|
---|
4 | | Also see http://www.opensource.org/licenses/bsd-license.html
|
---|
5 | | - Encoding was converted to UTF-8.
|
---|
6 | | - This notice was added.
|
---|
7 | |
|
---|
8 | | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
|
---|
9 |
|
---|
10 | | A Dutch stop word list. Comments begin with vertical bar. Each stop
|
---|
11 | | word is at the start of a line.
|
---|
12 |
|
---|
13 | | This is a ranked list (commonest to rarest) of stopwords derived from
|
---|
14 | | a large sample of Dutch text.
|
---|
15 |
|
---|
16 | | Dutch stop words frequently exhibit homonym clashes. These are indicated
|
---|
17 | | clearly below.
|
---|
18 |
|
---|
19 | de | the
|
---|
20 | en | and
|
---|
21 | van | of, from
|
---|
22 | ik | I, the ego
|
---|
23 | te | (1) chez, at etc, (2) to, (3) too
|
---|
24 | dat | that, which
|
---|
25 | die | that, those, who, which
|
---|
26 | in | in, inside
|
---|
27 | een | a, an, one
|
---|
28 | hij | he
|
---|
29 | het | the, it
|
---|
30 | niet | not, nothing, naught
|
---|
31 | zijn | (1) to be, being, (2) his, one's, its
|
---|
32 | is | is
|
---|
33 | was | (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river
|
---|
34 | op | on, upon, at, in, up, used up
|
---|
35 | aan | on, upon, to (as dative)
|
---|
36 | met | with, by
|
---|
37 | als | like, such as, when
|
---|
38 | voor | (1) before, in front of, (2) furrow
|
---|
39 | had | had, past tense all persons sing. of 'hebben' (have)
|
---|
40 | er | there
|
---|
41 | maar | but, only
|
---|
42 | om | round, about, for etc
|
---|
43 | hem | him
|
---|
44 | dan | then
|
---|
45 | zou | should/would, past tense all persons sing. of 'zullen'
|
---|
46 | of | or, whether, if
|
---|
47 | wat | what, something, anything
|
---|
48 | mijn | possessive and noun 'mine'
|
---|
49 | men | people, 'one'
|
---|
50 | dit | this
|
---|
51 | zo | so, thus, in this way
|
---|
52 | door | through by
|
---|
53 | over | over, across
|
---|
54 | ze | she, her, they, them
|
---|
55 | zich | oneself
|
---|
56 | bij | (1) a bee, (2) by, near, at
|
---|
57 | ook | also, too
|
---|
58 | tot | till, until
|
---|
59 | je | you
|
---|
60 | mij | me
|
---|
61 | uit | out of, from
|
---|
62 | der | Old Dutch form of 'van der' still found in surnames
|
---|
63 | daar | (1) there, (2) because
|
---|
64 | haar | (1) her, their, them, (2) hair
|
---|
65 | naar | (1) unpleasant, unwell etc, (2) towards, (3) as
|
---|
66 | heb | present first person sing. of 'to have'
|
---|
67 | hoe | how, why
|
---|
68 | heeft | present third person sing. of 'to have'
|
---|
69 | hebben | 'to have' and various parts thereof
|
---|
70 | deze | this
|
---|
71 | u | you
|
---|
72 | want | (1) for, (2) mitten, (3) rigging
|
---|
73 | nog | yet, still
|
---|
74 | zal | 'shall', first and third person sing. of verb 'zullen' (will)
|
---|
75 | me | me
|
---|
76 | zij | she, they
|
---|
77 | nu | now
|
---|
78 | ge | 'thou', still used in Belgium and south Netherlands
|
---|
79 | geen | none
|
---|
80 | omdat | because
|
---|
81 | iets | something, somewhat
|
---|
82 | worden | to become, grow, get
|
---|
83 | toch | yet, still
|
---|
84 | al | all, every, each
|
---|
85 | waren | (1) 'were' (2) to wander, (3) wares, (3)
|
---|
86 | veel | much, many
|
---|
87 | meer | (1) more, (2) lake
|
---|
88 | doen | to do, to make
|
---|
89 | toen | then, when
|
---|
90 | moet | noun 'spot/mote' and present form of 'to must'
|
---|
91 | ben | (1) am, (2) 'are' in interrogative second person singular of 'to be'
|
---|
92 | zonder | without
|
---|
93 | kan | noun 'can' and present form of 'to be able'
|
---|
94 | hun | their, them
|
---|
95 | dus | so, consequently
|
---|
96 | alles | all, everything, anything
|
---|
97 | onder | under, beneath
|
---|
98 | ja | yes, of course
|
---|
99 | eens | once, one day
|
---|
100 | hier | here
|
---|
101 | wie | who
|
---|
102 | werd | imperfect third person sing. of 'become'
|
---|
103 | altijd | always
|
---|
104 | doch | yet, but etc
|
---|
105 | wordt | present third person sing. of 'become'
|
---|
106 | wezen | (1) to be, (2) 'been' as in 'been fishing', (3) orphans
|
---|
107 | kunnen | to be able
|
---|
108 | ons | us/our
|
---|
109 | zelf | self
|
---|
110 | tegen | against, towards, at
|
---|
111 | na | after, near
|
---|
112 | reeds | already
|
---|
113 | wil | (1) present tense of 'want', (2) 'will', noun, (3) fender
|
---|
114 | kon | could; past tense of 'to be able'
|
---|
115 | niets | nothing
|
---|
116 | uw | your
|
---|
117 | iemand | somebody
|
---|
118 | geweest | been; past participle of 'be'
|
---|
119 | andere | other
|
---|