Detailed disk usage statistics
From EPrints
I generated the following statistics from a number of active eprints archives on 3 March 2005.
All disk-usage values are in megabytes.
ratio of average total
diskspace liveeprints diskspace diskspace used diskspace total storage %of total
archive liveeprints withdocs used for to eprints used per by archive used by needed for storage
documents with eprint (documents, SQL archive used by
documents with configuration, tables (SQL+archive) documents
files webpages)
ecs 9049 2433 2327 0.27 0.96 2598 90 2688 86.57%
soton 4096 928 3174 0.23 3.42 3454 39 3493 90.87%
akt 171 224 137 1.31 0.61 145 9 154 88.96%
ebank 105 117 136 1.11 1.16 145 3 148 91.89%
cogprints 2161 2443 1323 1.13 0.54 1412 58 1470 90.00%
TOTALS 15582 6145 7097 0.39 1.15 7754 199 7953 89.24%
This shows that the only significant factor in the storage requirements of an archive is the document storage.
Some archives have more eprints-with-documents than live-eprints. This is because some eprints-with-documents are either not yet approved for the live archive, or have never been submitted for approval.
The average size for an eprint varies but is under 5 megabytes.
