root/main/trunk/model-sites-dev/atea/collect/digital-nz/etc/conf/lang/stopwords_nl.txt @ 33166

Revision 33166, 4.5 KB (checked in by davidb, 4 months ago)

Collection config files and initial programming work for atea collections

Line 
1 | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt
2 | This file is distributed under the BSD License.
3 | See http://snowball.tartarus.org/license.php
4 | Also see http://www.opensource.org/licenses/bsd-license.html
5 |  - Encoding was converted to UTF-8.
6 |  - This notice was added.
7 |
8 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9
10 | A Dutch stop word list. Comments begin with vertical bar. Each stop
11 | word is at the start of a line.
12
13 | This is a ranked list (commonest to rarest) of stopwords derived from
14 | a large sample of Dutch text.
15
16 | Dutch stop words frequently exhibit homonym clashes. These are indicated
17 | clearly below.
18
19de             |  the
20en             |  and
21van            |  of, from
22ik             |  I, the ego
23te             |  (1) chez, at etc, (2) to, (3) too
24dat            |  that, which
25die            |  that, those, who, which
26in             |  in, inside
27een            |  a, an, one
28hij            |  he
29het            |  the, it
30niet           |  not, nothing, naught
31zijn           |  (1) to be, being, (2) his, one's, its
32is             |  is
33was            |  (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river
34op             |  on, upon, at, in, up, used up
35aan            |  on, upon, to (as dative)
36met            |  with, by
37als            |  like, such as, when
38voor           |  (1) before, in front of, (2) furrow
39had            |  had, past tense all persons sing. of 'hebben' (have)
40er             |  there
41maar           |  but, only
42om             |  round, about, for etc
43hem            |  him
44dan            |  then
45zou            |  should/would, past tense all persons sing. of 'zullen'
46of             |  or, whether, if
47wat            |  what, something, anything
48mijn           |  possessive and noun 'mine'
49men            |  people, 'one'
50dit            |  this
51zo             |  so, thus, in this way
52door           |  through by
53over           |  over, across
54ze             |  she, her, they, them
55zich           |  oneself
56bij            |  (1) a bee, (2) by, near, at
57ook            |  also, too
58tot            |  till, until
59je             |  you
60mij            |  me
61uit            |  out of, from
62der            |  Old Dutch form of 'van der' still found in surnames
63daar           |  (1) there, (2) because
64haar           |  (1) her, their, them, (2) hair
65naar           |  (1) unpleasant, unwell etc, (2) towards, (3) as
66heb            |  present first person sing. of 'to have'
67hoe            |  how, why
68heeft          |  present third person sing. of 'to have'
69hebben         |  'to have' and various parts thereof
70deze           |  this
71u              |  you
72want           |  (1) for, (2) mitten, (3) rigging
73nog            |  yet, still
74zal            |  'shall', first and third person sing. of verb 'zullen' (will)
75me             |  me
76zij            |  she, they
77nu             |  now
78ge             |  'thou', still used in Belgium and south Netherlands
79geen           |  none
80omdat          |  because
81iets           |  something, somewhat
82worden         |  to become, grow, get
83toch           |  yet, still
84al             |  all, every, each
85waren          |  (1) 'were' (2) to wander, (3) wares, (3)
86veel           |  much, many
87meer           |  (1) more, (2) lake
88doen           |  to do, to make
89toen           |  then, when
90moet           |  noun 'spot/mote' and present form of 'to must'
91ben            |  (1) am, (2) 'are' in interrogative second person singular of 'to be'
92zonder         |  without
93kan            |  noun 'can' and present form of 'to be able'
94hun            |  their, them
95dus            |  so, consequently
96alles          |  all, everything, anything
97onder          |  under, beneath
98ja             |  yes, of course
99eens           |  once, one day
100hier           |  here
101wie            |  who
102werd           |  imperfect third person sing. of 'become'
103altijd         |  always
104doch           |  yet, but etc
105wordt          |  present third person sing. of 'become'
106wezen          |  (1) to be, (2) 'been' as in 'been fishing', (3) orphans
107kunnen         |  to be able
108ons            |  us/our
109zelf           |  self
110tegen          |  against, towards, at
111na             |  after, near
112reeds          |  already
113wil            |  (1) present tense of 'want', (2) 'will', noun, (3) fender
114kon            |  could; past tense of 'to be able'
115niets          |  nothing
116uw             |  your
117iemand         |  somebody
118geweest        |  been; past participle of 'be'
119andere         |  other
Note: See TracBrowser for help on using the browser.