source: gs3-extensions/solr/trunk/src/conf/lang/stopwords_ckb.txt@ 29135

Last change on this file since 29135 was 29135, checked in by ak19, 10 years ago

Part of port from lucene3.3.0 to lucene4.7.2. Solr related. conf and lib folders for solr4.7.2.

File size: 1.7 KB
Line 
1# set of kurdish stopwords
2# note these have been normalized with our scheme (e represented with U+06D5, etc)
3# constructed from:
4# * Fig 5 of "Building A Test Collection For Sorani Kurdish" (Esmaili et al)
5# * "Sorani Kurdish: A Reference Grammar with selected readings" (Thackston)
6# * Corpus-based analysis of 77M word Sorani collection: wikipedia, news, blogs, etc
7
8# and
9و
10# which
11کە
12# of
13ی
14# made/did
15کرد
16# that/which
17؊ەوەی
18# on/head
19سەر
20# two
21دوو
22# also
23هەروەها
24# from/that
25لەو
26# makes/does
27دەکات
28# some
29چەند
30# every
31هەر
32
33# demonstratives
34# that
35ØŠÛ•Ùˆ
36# this
37ØŠÛ•Ù…
38
39
40# personal pronouns
41# I
42م
43ن
44# we
45ØŠÛŽÙ…
46ە
47# you
48تۆ
49# you
50ØŠÛŽÙˆÛ•
51# he/she/it
52ØŠÛ•Ùˆ
53# they
54؊ەوان
55
56# prepositions
57# to/with/by
58ØšÛ•
59ٟێ
60# without
61ØšÛ•ØšÛŽ
62# along with/while/during
63ؚەدەم
64
65# in the opinion of
66ؚەلای
67# according to
68ؚەٟێی
69# before
70ؚەرلە
71# in the direction of
72ؚەرەوی
73# in front of/toward
74ؚەرەوە
75# before/in the face of
76ؚەردەم
77
78# without
79ؚێ
80# except for
81ؚێجگە
82# for
83ØšÛ†
84# on/in
85دە
86تێ
87# with
88دەگەڵ
89# after
90دوای
91# except for/aside from
92جگە
93# in/from
94لە
95لێ
96# in front of/before/because of
97لەؚەر
98# between/among
99لەؚەینی
100# concerning/about
101لەؚاؚەت
102# concerning
103لەؚارەی
104# instead of
105لەؚاتی
106# beside
107Ù„Û•ØšÙ†
108# instead of
109لەؚرێتی
110# behind
111لەدەم
112
113# with/together with
114لەگەڵ
115# by
116لەلایەن
117# within
118لەناو
119# between/among
120لەنێو
121# for the sake of
122لەٟێناوی
123# with respect to
124لەرەوی
125# by means of/for
126لەرێ
127# for the sake of
128لەرێگا
129# on/on top of/according to
130لەسەر
131# under
132لەژێر
133# between/among
134ناو
135# between/among
136نێوان
137# after
138ٟا؎
139# before
140ÙŸÛŽØŽ
141# like
142وەک
Note: See TracBrowser for help on using the repository browser.