1 | Python based code for analysing the quality of metadata provided by
|
---|
2 | academic publishing sources such as CrossRef.
|
---|
3 |
|
---|
4 | Quick, rough guide:
|
---|
5 |
|
---|
6 | #----
|
---|
7 | 1. Download the CrossRef Dataset:
|
---|
8 | #----
|
---|
9 |
|
---|
10 | The CrossRef dataset is available via AcademicTorrents:
|
---|
11 |
|
---|
12 | https://academictorrents.com/details/d9e554f4f0c3047d9f49e448a7004f7aa1701b69
|
---|
13 |
|
---|
14 | At the time of writing, the provided dataset page doesn't link forward to
|
---|
15 | newer releases. To keep an eye for new releases, Google Search "Public Data File from Crossref"
|
---|
16 |
|
---|
17 | https://www.google.com/search?channel=fs&client=ubuntu&q=+Public+Data+File+from+Crossref+
|
---|
18 |
|
---|
19 | Transmission is a Torrent client availabe on Ubuntu Linux machine in
|
---|
20 | CMS, which can be used to download the JSON files. If looking
|
---|
21 | looking for a non-admin way to run a Torrent client then it
|
---|
22 | is possible to download Vuze for Linux from
|
---|
23 |
|
---|
24 | https://www.vuze.com/download.php
|
---|
25 |
|
---|
26 | (downloads as VuzeInstaller.tar.bz2)
|
---|
27 |
|
---|
28 | From that download, you can untar it, and then run ./vuze
|
---|
29 |
|
---|
30 | #----
|
---|
31 | 2. Setup a MongoDB Server
|
---|
32 | #----
|
---|
33 |
|
---|
34 | Get going with a mongodb server, for example:
|
---|
35 |
|
---|
36 | svn co https://svn.greenstone.org/gs3-extensions/mongodb/trunk mongodb
|
---|
37 |
|
---|
38 | Then follow the instruction in mongodb/README.txt
|
---|
39 |
|
---|
40 | --
|
---|
41 | Additional Notes:
|
---|
42 | --
|
---|
43 |
|
---|
44 | Studio 3T is a GUI client for MongoDB. It can be downloaded from:
|
---|
45 |
|
---|
46 | https://studio3t.com/
|
---|
47 |
|
---|
48 | It can be run with a free open source license (formly Robo 3T), but
|
---|
49 | since the move to Studio 3T the developers start you off in the trial
|
---|
50 | Pro/Ultimate version, so there are a few hoops to jump through to get
|
---|
51 | to the open source version.
|
---|
52 |
|
---|
53 | You need to sign-up for an account as part of the installation process
|
---|
54 | (can use Google Sign-in, which simplifies things), and then back in
|
---|
55 | the GUI you can change its configuration settings straightaway to be
|
---|
56 | the Free 3T version.
|
---|
57 |
|
---|
58 | In the GUI, setup/open a connection for:
|
---|
59 | mongodb:http://localhost:27017
|
---|
60 |
|
---|
61 |
|
---|
62 | #----
|
---|
63 | 3. Working with the Python code
|
---|
64 | #----
|
---|
65 |
|
---|
66 | Create your own Python virtual environment, for example:
|
---|
67 |
|
---|
68 | python3 -mvenv my-python3
|
---|
69 | source my-python3/bin/activate
|
---|
70 |
|
---|
71 | pip install wheel
|
---|
72 |
|
---|
73 |
|
---|
74 | To get going with the Python code itself
|
---|
75 |
|
---|
76 | cd py
|
---|
77 | pip install -r requirements.txt
|
---|
78 |
|
---|
79 |
|
---|
80 |
|
---|
81 |
|
---|
82 |
|
---|
83 |
|
---|
84 | Developed by Joel Crombie (jc550) as a Summer Research Project
|
---|
85 | (ALPSS373-23C)
|
---|
86 |
|
---|
87 |
|
---|
88 |
|
---|
89 |
|
---|
90 |
|
---|
91 | --------------------------------------------------
|
---|