1 | my specific todo list. in no particular order. smallish jobs.
|
---|
2 |
|
---|
3 | * nzdl.org mirror presentation - from ian
|
---|
4 | replace macros
|
---|
5 | horizontal list -> first item should be selected
|
---|
6 | expand text, detach page buttons
|
---|
7 | remove unneccesary blank lines
|
---|
8 | how to in hdl - the entries are double spaced and also vertical lists
|
---|
9 | remove \n in config files
|
---|
10 | the image under the about/search etc titles is funny under macs
|
---|
11 | change font if language english/european??
|
---|
12 | rollover background for buttons and nav bar
|
---|
13 |
|
---|
14 | * get computer ggraphic design people to give input?? dave N matt J, polly cantlon ??
|
---|
15 |
|
---|
16 | * self documenting collections?? eg provides links to the source of the class files , the xslt , the config file etc everything needed to generate teh coll/page
|
---|
17 |
|
---|
18 | * UPDATE DOCUMENTATION!!!!
|
---|
19 |
|
---|
20 | * text on nzdl query page to resource bundle
|
---|
21 |
|
---|
22 | * collection getMetadata service
|
---|
23 |
|
---|
24 | * rethink how we determine docType - current method doesn't work for phind cos it only gets doc ids - it should come from the document if not specified.
|
---|
25 |
|
---|
26 | * niupepa display - abstract and commentary links. new action? or just new xslt?
|
---|
27 |
|
---|
28 | * is the 'to' att compulsory? do we have to have to=""?
|
---|
29 |
|
---|
30 | * query info - eg num docs found from a search, query term info -
|
---|
31 | is this called metadata for the query and returned in a metadataList? or
|
---|
32 | should it have another name?
|
---|
33 |
|
---|
34 | <response>
|
---|
35 | <metadataList>
|
---|
36 | <metadata name="numDocs">55</metadata>
|
---|
37 | <content>
|
---|
38 | <document name="HASHxxx"/>
|
---|
39 | ...
|
---|
40 | </content>
|
---|
41 | </response>
|
---|
42 |
|
---|
43 | ????
|
---|
44 |
|
---|
45 | * format statements - need to add If and Or. do we test only on metadata? <gsf:if test="Title"> and <gsf:if test="Title='snail farming'> or more sophisticated?
|
---|
46 | also need to fix the bit where <a href='[URL]'> gets turned into <a href='<gsf:metadata name='URL'/> - needs to be added as an attribute.
|
---|
47 |
|
---|
48 | * format statements for doc display
|
---|
49 |
|
---|
50 | * including named templates - need to replace the ones they overwrite - not allowed two templates with teh same name. eg docHeading, docContent. or should they be match=document mode=heading/content ??
|
---|
51 |
|
---|
52 | * coll specific xslt/actions?? how get these for remote colls?
|
---|
53 |
|
---|
54 | * process messages - do we still have pid and handle atts? make them the same, and both atts instead of params??
|
---|
55 |
|
---|
56 | * equivTermList contains terms
|
---|
57 |
|
---|
58 | * page link style paging eg 1, 2, 3, .... can have links to all, links to same eg 5 either side of current page. 1,2,3...24,25,26,27,28...231
|
---|
59 |
|
---|
60 | * multi query - finish.
|
---|
61 | 1. check if coll specified.
|
---|
62 | 2. check if ccs coll specified.
|
---|
63 | 3. get background info (description and format) for current coll - or all colls?? will we format stuff differently for different colls? have templates based on collection??
|
---|
64 | 4. do the query over all colls.
|
---|
65 | 5. get metadata for each result from appropriate coll.
|
---|
66 | 6. merge the results - append initially. sort by rank? this means we need to return the rank.
|
---|
67 |
|
---|
68 | * metadata retrieve determination - use an action helper? separate class? should look at xslt as well as format statements?
|
---|
69 |
|
---|
70 | * what if have two services with teh same name eg two phind applets
|
---|
71 |
|
---|
72 | * phind applet not working at the moment. why??
|
---|
73 |
|
---|
74 | * tei building stuff into gsdl3 build code.
|
---|
75 |
|
---|
76 | * subcollection/language handling for gs2 collections
|
---|
77 |
|
---|
78 | * datelists formatted nicely with years at top and months down the side.
|
---|
79 |
|
---|
80 | * on/off format stuff in config file eg query term highlighting, display cover image and toc, offer enriching services to user, buttons,
|
---|
81 |
|
---|
82 | * different templates for different positions in the hierarchy??
|
---|
83 | X
|
---|
84 | ---
|
---|
85 | ~~~~
|
---|
86 | ~~~~
|
---|
87 | ---
|
---|
88 | ~~~~
|
---|
89 | ~~~~
|
---|
90 | X
|
---|
91 | ---
|
---|
92 |
|
---|
93 | etc can we test for that? gs2 tests for [parent:Title]
|
---|
94 |
|
---|
95 | * global resource bundles, not class named ones. getGlobalText? getTExt(key, bundle name)?
|
---|
96 |
|
---|
97 | * fix errors in javadoc make
|
---|
98 |
|
---|
99 | * send back error messages to user? rather than getting null pointer exceptions. eg if ask for non-existant metadata. also invalid syntax info?
|
---|
100 |
|
---|
101 | * tidy up phind error messages
|
---|
102 |
|
---|
103 | * check text strings in code and xslt - put all into resource bundles
|
---|
104 |
|
---|
105 | * finalise the nzdl interface, and write script to do teh translations from the macro files.
|
---|
106 |
|
---|
107 | * query results: 7 out of 10 matches
|
---|
108 |
|
---|
109 | * bibliographic display??
|
---|
110 |
|
---|
111 | * look at what assumptions OID makes about ids - need it to be general not just working for hash ids.
|
---|
112 |
|
---|
113 | * specific tool tips for images. alt/title. eg this file is 380Mb.
|
---|
114 |
|
---|
115 | * nzdl prefs page - where do the defaults come from? do we need a full cgiargs class? with default value? nzdl, we already know what query options there are so dont need to get them from the service, so dont have displayinfo or default.
|
---|
116 |
|
---|
117 | * proper query term highlighting. phrases. fast.
|
---|
118 |
|
---|
119 | * system requests shouls also return status id not just a message. like the build stuff.
|
---|
120 |
|
---|
121 | * should preferences be carried over between collections? currently they are.
|
---|
122 |
|
---|
123 | * xslt assumes that something is saving the query args. is this ok?
|
---|
124 |
|
---|
125 | * nzdl query mode - some/all, boolean/ranked, ranked/natural. result in different query parsing. new queryaction??
|
---|
126 |
|
---|
127 | * should actions have access to config? currently only applet actionactually uses it. - used for filling in library - should receptionist do this instead?
|
---|
128 |
|
---|
129 | * should actions tell recepitonist if they need metadata? should receptionist pass in a list of metadata that the xslt requires?
|
---|
130 |
|
---|
131 | * polling for collections? delete and start again? check dates and only reload new ones?
|
---|
132 |
|
---|
133 | * caching. LInkedHashMap. overwrite the removeEldestEntry bits.
|
---|
134 |
|
---|
135 | * macros in coll description and titles. eg this collection was built _date_ days ago and contains _numdocs_ documents.
|
---|
136 | niupepa titles "_vol_1, _num_ 2" _commentary_
|
---|
137 |
|
---|
138 | *dynamic query content as well as dynamic query params?
|
---|
139 |
|
---|
140 | *communicators: 2 modes. 1 relays messages eg to a remote service
|
---|
141 | 2. site to site - needs to modify addresses and names.
|
---|
142 |
|
---|
143 | * make sure mgpp colls work with new config file stuff
|
---|
144 |
|
---|
145 | * presentation prefs - how do we decide on the list of langs? from interface config file? how do we use encoding?
|
---|
146 |
|
---|
147 | * mg/mgpp errors should not crash the servlet.
|
---|
148 |
|
---|
149 | * some stuff for actions can go in config files. metadata determination, query term highlighting, enrich services. result pre or post processing. is there a generic way?
|
---|
150 |
|
---|
151 | * do we want a 'link to external page' action that displays the page but with teh greenstone header? a=p&sa=external?? also what about displaying whole cached html pages as frames?? gsdl uses a nav frame (a=p&p=nav) and then a second frame which is the html page.
|
---|
152 |
|
---|
153 | * doc ranking
|
---|
154 |
|
---|
155 | * generic multi query. a=q&c=demo&ccs=fao,... use a base coll??
|
---|
156 |
|
---|
157 | * what to do when config file says : {If}{[parent:Title],[parent:Title],[Title]}. maybe we need to do root_or_self type thing?? or have some way of testing.
|
---|
158 |
|
---|
159 | * stuff from talking to ian - I am sure that I have typed this in already - where is it????
|
---|
160 |
|
---|
161 | macros in gdbm db - turn them into real text.
|
---|
162 | horizontal list - first item should be selected
|
---|
163 | expand text, detach page etc buttons.
|
---|
164 | remove extraneous blank lines eg under nav bar
|
---|
165 | how to in hdl is double spaced - shouldn't be
|
---|
166 | and vertical lists appear to have more space
|
---|
167 | remove \n from config files
|
---|
168 | on the mac, about image looks weird - the green bar is on teh wrong side
|
---|
169 | change font if language is english or european?? san serif font
|
---|
170 | rollover background or colour on nav bar and buttons
|
---|
171 |
|
---|
172 | get meeting with computer design person, dave N, matt J? Polly cantlon?? colours, any quick hints?
|
---|
173 |
|
---|
174 | self documenting collections: eg links to config file, service code, xslt files,automatically generate but also can provide manually if wish.
|
---|
175 |
|
---|
176 | cstr browse format statement.
|
---|
177 | turn off cover images
|
---|
178 |
|
---|
179 | Release: present existing gs2 colls
|
---|
180 | skeleton build for eg text and html.
|
---|
181 | documentation
|
---|
182 | scrap the collector?? ask the group
|
---|
183 | release README: how to build a collection - gs2+convert? basic gs3?
|
---|
184 | EASY installation
|
---|
185 | consolidate and document
|
---|
186 | adhoc testing
|
---|
187 | translation of resource bundles
|
---|
188 |
|
---|
189 | help page: type in gsf:format statemenets and it shows you xslt? or html?
|
---|
190 |
|
---|
191 | test protocol - sample thing that could generate every message??
|
---|
192 |
|
---|
193 |
|
---|
194 |
|
---|
195 | General todo list. in no particular order. large jobs.
|
---|
196 |
|
---|
197 | * gs3 release - basic stuff, translations, installation, documentation. oct 31? gs2 building only, both default and nzdl interfaces? gatherer?
|
---|
198 |
|
---|
199 | * generic resource retrieval - associated documents, src docs eg pdf/word, niupepa image files etc. if http accessible, put a link otherwise use a general resource retrieve?
|
---|
200 |
|
---|
201 | * collector collection building - do we want to finish this? needs addDocument and configureCollection services.
|
---|
202 |
|
---|
203 | The addDocument service (in GS2Construct) has not been implemented. It should take a file name and add the document to the import directory of teh collection.
|
---|
204 | there are problems with just transmitting a file name - the service may live remotely and therefore the document is not there. You should probably send it attached to the html - therefore need to work out:
|
---|
205 | how to get the document attached to the form (theres something to do with post and encoding = multipart ?? otherwise the browser just sends the filename),
|
---|
206 | and then where to get it in the servlet - is it a parameter? or something else?
|
---|
207 |
|
---|
208 | and then it needs to be added into the xml request to be passed to the service.
|
---|
209 |
|
---|
210 | If the program is running locally its much simpler just to send a filename - can we somehow check for this?
|
---|
211 |
|
---|
212 | * configure Collection for building
|
---|
213 |
|
---|
214 | also to do with building, a little harder, is the ConfigureCollection service. There is no stub for it yet, but easy enough to add one. - need to add this service to the service description xml stuff, and write a processConfigureColleciton() method. It would be easy enough to display the config file in a big text box, and have the user edit it like the collector does.
|
---|
215 |
|
---|
216 | the hard bit is that when you click ConfigureCollection, you dont know what collection you are going to be dealing with - all the building services, you select the collection on the service page. with the configure stuff, you need to select the collection, and then the config file needs to be retrieved. so its really a two step process to configure the coll - first select the coll, submit that, then edit the config file, and submit that.
|
---|
217 |
|
---|
218 | All the services currently are one step - need to think about how this type of service fits into the model.
|
---|
219 |
|
---|
220 | maybe it needs a hidden arg? - to tell teh service if you're at stage 1 or 2?
|
---|
221 | when teh action does the request, it then asks for the service description again to redisplay it for the user. maybe if the service knows that it has done the first half, it sends the second type of description?
|
---|
222 |
|
---|
223 | also do we use the collect.cfg file or the collectionConfig.xml file to show the user?
|
---|
224 |
|
---|
225 | * sequence of services
|
---|
226 |
|
---|
227 | some service clusters have services that you are supposed to carry out in sequence such as building, but there may be others.
|
---|
228 |
|
---|
229 | can we do a generic action or xslt or something that sends teh user to teh next service once they've completed the first one?
|
---|
230 |
|
---|
231 | Maybe the service cluster/serviceRack class specifies teh sequence of services, and they are all handled individually like present except that some xslt puts a next button on each page with a link to teh next service in the list.
|
---|
232 |
|
---|
233 | * proper install package
|
---|
234 |
|
---|
235 | * currently nothing is cached - service decriptions, text strings etc could all be cached. (cgi args are cached by tomcat)
|
---|
236 |
|
---|
237 | * document structure - new greenstone archive format? TEI, XHTML, OO, GML compatible? parallel document structures?
|
---|
238 | - associated resources/documents - xlinks
|
---|
239 |
|
---|
240 | * greenstone 3 building - is building going to change for gs3, or will we continue to use the perl scripts?
|
---|
241 | - import, build, activate
|
---|
242 | - info extraction, augmentation
|
---|
243 | - incremental update?
|
---|
244 | - modular? xml pipeline?
|
---|
245 |
|
---|
246 | * additional services - music search
|
---|
247 | - keyphrase stuff
|
---|
248 | - search history
|
---|
249 |
|
---|
250 | * all the gs2 admin/security stuff - user management, authentication
|
---|
251 |
|
---|
252 | * server side threading issues - is it thread safe?
|
---|
253 |
|
---|
254 | * usage logs
|
---|
255 |
|
---|
256 | * better error handling
|
---|
257 |
|
---|
258 | * document version control
|
---|
259 |
|
---|
260 | * combining requests to MR - results from first one becoming content for second one??
|
---|
261 |
|
---|
262 | * language translation stuff - when translating, we want to click on a text piece or macro and go to a sample page with that text string in it. can we do this?
|
---|
263 |
|
---|
264 | can we tell in gs3 where bits will be in pages?
|
---|
265 | I think we could have a new type of request, that provides a page containing all teh text strings used by a service/agent. - would it be class specific or service specific?
|
---|
266 | eg it would contain the service form, some sample results, and if its a process type of thing, all teh possible error and success messages.
|
---|
267 |
|
---|
268 | what about the interface bits?
|
---|
269 |
|
---|
270 | * are message formats fixed? can we create a DTD? or will they evolve too much over time?
|
---|
271 |
|
---|
272 | * sitewide service agents - may want to specify that only some of the services that could potentially be provided by an agent are actually provided.
|
---|
273 |
|
---|
274 | * should actions respond to describe requests? and what would they say if they did? are they agents and part of teh system, or somehow outside the system?
|
---|
275 |
|
---|
276 | * what about soap? is there a new version of that?
|
---|
277 |
|
---|
278 | * combined query services - eg one page with two service forms on there eg text query and music query - need to combine the results. eg teh action could carry out both queries then use xslt to combine the results.
|
---|
279 |
|
---|
280 | or do the same thing across title and creator indexes in mg to approximate fielded searching?
|
---|
281 |
|
---|
282 | true cross collection search would probably need a new service??
|
---|
283 |
|
---|
284 | * more generally, combining services in general, action and service level
|
---|
285 |
|
---|
286 | * action helpers: query term highlighting, page transformation and metadata determination (remove from receptionist?)
|
---|
287 | * if leave an applet page you lose all teh previous info eg for the status display for importing. need a function to retrieve again all the previous messages to redisplay them.
|
---|
288 |
|
---|
289 | * can the xslt dynamically retrieve the metadata it needs for say query results??
|
---|
290 |
|
---|
291 | * CSS instead of tables? - use html lists than use css to format them.
|
---|
292 |
|
---|
293 | * cross coll search - if colls are built the same, just present the form and search both. need to think about ranking.
|
---|
294 | if colls are not built the same, and have indexes, subcolls, langs - do we merge the lists? and only search those that have the right ones? what if two indexes have different meanings and display text but the same id? are they treated as same index? which name do you display?
|
---|
295 | what about colls with different levels eg oneonly has document, the other doc and section. what do you display? field lists? just combine?
|
---|
296 |
|
---|
297 | * search history
|
---|
298 |
|
---|
299 | * use objects instead of applets? java provides a conversion tool. NPX_PLUGIN_PATH=
|
---|
300 |
|
---|
301 | * danas combined search and browse stuff
|
---|
302 |
|
---|
303 | * can xslt be used to generate requests as well as transform teh output?
|
---|
304 |
|
---|
305 | * kea.
|
---|
306 |
|
---|