source: gs3-extensions/solr/trunk/src/conf/lang/stopwords_pt.txt@ 29135

Last change on this file since 29135 was 29135, checked in by ak19, 10 years ago

Part of port from lucene3.3.0 to lucene4.7.2. Solr related. conf and lib folders for solr4.7.2.

File size: 4.5 KB
Line 
1 | From svn.tartarus.org/snowball/trunk/website/algorithms/portuguese/stop.txt
2 | This file is distributed under the BSD License.
3 | See http://snowball.tartarus.org/license.php
4 | Also see http://www.opensource.org/licenses/bsd-license.html
5 | - Encoding was converted to UTF-8.
6 | - This notice was added.
7 |
8 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9
10 | A Portuguese stop word list. Comments begin with vertical bar. Each stop
11 | word is at the start of a line.
12
13
14 | The following is a ranked list (commonest to rarest) of stopwords
15 | deriving from a large sample of text.
16
17 | Extra words have been added at the end.
18
19de | of, from
20a | the; to, at; her
21o | the; him
22que | who, that
23e | and
24do | de + o
25da | de + a
26em | in
27um | a
28para | for
29 | é from SER
30com | with
31não | not, no
32uma | a
33os | the; them
34no | em + o
35se | himself etc
36na | em + a
37por | for
38mais | more
39as | the; them
40dos | de + os
41como | as, like
42mas | but
43 | foi from SER
44ao | a + o
45ele | he
46das | de + as
47 | tem from TER
48à | a + a
49seu | his
50sua | her
51ou | or
52 | ser from SER
53quando | when
54muito | much
55 | há from HAV
56nos | em + os; us
57já | already, now
58 | está from EST
59eu | I
60também | also
61só | only, just
62pelo | per + o
63pela | per + a
64até | up to
65isso | that
66ela | he
67entre | between
68 | era from SER
69depois | after
70sem | without
71mesmo | same
72aos | a + os
73 | ter from TER
74seus | his
75quem | whom
76nas | em + as
77me | me
78esse | that
79eles | they
80 | estão from EST
81você | you
82 | tinha from TER
83 | foram from SER
84essa | that
85num | em + um
86nem | nor
87suas | her
88meu | my
89às | a + as
90minha | my
91 | têm from TER
92numa | em + uma
93pelos | per + os
94elas | they
95 | havia from HAV
96 | seja from SER
97qual | which
98 | será from SER
99nós | we
100 | tenho from TER
101lhe | to him, her
102deles | of them
103essas | those
104esses | those
105pelas | per + as
106este | this
107 | fosse from SER
108dele | of him
109
110 | other words. There are many contractions such as naquele = em+aquele,
111 | mo = me+o, but they are rare.
112 | Indefinite article plural forms are also rare.
113
114tu | thou
115te | thee
116vocês | you (plural)
117vos | you
118lhes | to them
119meus | my
120minhas
121teu | thy
122tua
123teus
124tuas
125nosso | our
126nossa
127nossos
128nossas
129
130dela | of her
131delas | of them
132
133esta | this
134estes | these
135estas | these
136aquele | that
137aquela | that
138aqueles | those
139aquelas | those
140isto | this
141aquilo | that
142
143 | forms of estar, to be (not including the infinitive):
144estou
145está
146estamos
147estão
148estive
149esteve
150estivemos
151estiveram
152estava
153estávamos
154estavam
155estivera
156estivéramos
157esteja
158estejamos
159estejam
160estivesse
161estivéssemos
162estivessem
163estiver
164estivermos
165estiverem
166
167 | forms of haver, to have (not including the infinitive):
168hei
169há
170havemos
171hão
172houve
173houvemos
174houveram
175houvera
176houvéramos
177haja
178hajamos
179hajam
180houvesse
181houvéssemos
182houvessem
183houver
184houvermos
185houverem
186houverei
187houverá
188houveremos
189houverão
190houveria
191houveríamos
192houveriam
193
194 | forms of ser, to be (not including the infinitive):
195sou
196somos
197são
198era
199éramos
200eram
201fui
202foi
203fomos
204foram
205fora
206fÃŽramos
207seja
208sejamos
209sejam
210fosse
211fÃŽssemos
212fossem
213for
214formos
215forem
216serei
217será
218seremos
219serão
220seria
221seríamos
222seriam
223
224 | forms of ter, to have (not including the infinitive):
225tenho
226tem
227temos
228tém
229tinha
230tínhamos
231tinham
232tive
233teve
234tivemos
235tiveram
236tivera
237tivéramos
238tenha
239tenhamos
240tenham
241tivesse
242tivéssemos
243tivessem
244tiver
245tivermos
246tiverem
247terei
248terá
249teremos
250terão
251teria
252teríamos
253teriam
Note: See TracBrowser for help on using the repository browser.