source: main/trunk/model-sites-dev/atea/collect/digital-nz/etc/conf/lang/stopwords_ckb.txt@ 33166

Last change on this file since 33166 was 33166, checked in by davidb, 5 years ago

Collection config files and initial programming work for atea collections

File size: 1.7 KB
Line 
1# set of kurdish stopwords
2# note these have been normalized with our scheme (e represented with U+06D5, etc)
3# constructed from:
4# * Fig 5 of "Building A Test Collection For Sorani Kurdish" (Esmaili et al)
5# * "Sorani Kurdish: A Reference Grammar with selected readings" (Thackston)
6# * Corpus-based analysis of 77M word Sorani collection: wikipedia, news, blogs, etc
7
8# and
9و
10# which
11کە
12# of
13ی
14# made/did
15کرد
16# that/which
17؊ەوەی
18# on/head
19سەر
20# two
21دوو
22# also
23هەروەها
24# from/that
25لەو
26# makes/does
27دەکات
28# some
29چەند
30# every
31هەر
32
33# demonstratives
34# that
35ØŠÛ•Ùˆ
36# this
37ØŠÛ•Ù…
38
39
40# personal pronouns
41# I
42م
43ن
44# we
45ØŠÛŽÙ…
46ە
47# you
48تۆ
49# you
50ØŠÛŽÙˆÛ•
51# he/she/it
52ØŠÛ•Ùˆ
53# they
54؊ەوان
55
56# prepositions
57# to/with/by
58ØšÛ•
59ٟێ
60# without
61ØšÛ•ØšÛŽ
62# along with/while/during
63ؚەدەم
64
65# in the opinion of
66ؚەلای
67# according to
68ؚەٟێی
69# before
70ؚەرلە
71# in the direction of
72ؚەرەوی
73# in front of/toward
74ؚەرەوە
75# before/in the face of
76ؚەردەم
77
78# without
79ؚێ
80# except for
81ؚێجگە
82# for
83ØšÛ†
84# on/in
85دە
86تێ
87# with
88دەگەڵ
89# after
90دوای
91# except for/aside from
92جگە
93# in/from
94لە
95لێ
96# in front of/before/because of
97لەؚەر
98# between/among
99لەؚەینی
100# concerning/about
101لەؚاؚەت
102# concerning
103لەؚارەی
104# instead of
105لەؚاتی
106# beside
107Ù„Û•ØšÙ†
108# instead of
109لەؚرێتی
110# behind
111لەدەم
112
113# with/together with
114لەگەڵ
115# by
116لەلایەن
117# within
118لەناو
119# between/among
120لەنێو
121# for the sake of
122لەٟێناوی
123# with respect to
124لەرەوی
125# by means of/for
126لەرێ
127# for the sake of
128لەرێگا
129# on/on top of/according to
130لەسەر
131# under
132لەژێر
133# between/among
134ناو
135# between/among
136نێوان
137# after
138ٟا؎
139# before
140ÙŸÛŽØŽ
141# like
142وەک
Note: See TracBrowser for help on using the repository browser.