Difference between revisions of "Frequently Asked Questions"

From EPrints Documentation
Jump to: navigation, search
(title and fulltext search returns no results, but date search does (EPrints 2.3))
(No longer obsolete but certainly out of date)
 
(21 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
[[Category:FAQ]]
 
[[Category:FAQ]]
 
[[Category:Management]]
 
[[Category:Management]]
 +
[[Category:Out of Date]]
 +
 
This all needs sorting out, it's just being grabbed from the old wiki for now.
 
This all needs sorting out, it's just being grabbed from the old wiki for now.
  
 
* [[Copyright and License FAQ]]
 
* [[Copyright and License FAQ]]
* [[Metadata FAQ]]
 
 
* [[Searching FAQ]]
 
* [[Searching FAQ]]
 
* [[OAI FAQ]]
 
* [[OAI FAQ]]
* [[BOAI Self-Archiving FAQ]]
 
 
* [[How much will it cost?]]
 
* [[How much will it cost?]]
  
 
== What operating systems can we use? ==
 
== What operating systems can we use? ==
  
EPrints should work on any UNIX-like operating system. We use Ubuntu (Debian) and Redhat Enterprise Linux.
+
EPrints should work on any Linux operating system. We use Ubuntu (Debian) and Redhat Enterprise Linux.
  
Running EPrints on Microsoft Windows is possible but not recommended - it can be difficult to get all the necessary dependencies in place.
+
Running EPrints on Microsoft Windows is possible but strongly not recommended.  It can be very difficult to get all the necessary dependencies in place.  If you want to get EPrints running but do not have as computer running a Linux operating system.  You could install [https://www.virtualbox.org/wiki/Downloads VirtualBox] and then create a Ubuntu or Redhat "virtual machine" (VM) to run on top of your Windows operating system.  Alternatively, you could install [https://docs.docker.com/docker-for-windows/install/ Docker Desktop for Windows] and [[Installing EPrints using Docker|install EPrints using Docker]].  However, both these methods are only recommended for testing or demoing EPrints.  If you intend to run EPrints long-term you should install on a dedicated Linux server or virtual machine.
  
 
== What computer do we need? ==
 
== What computer do we need? ==
 
+
Any modern physical PC (server or desktop) or a virtual machine should be more than capable of running EPrints. A suggested minimum specification for running a live service:
Any new PC is easily powerful enough. Suggested minimum spec. for a live service: 1gig RAM, 20gig Harddrive, 1GHz+ processor.
+
* 2GB RAM  
 +
* 15GB disk space
 +
* 100Mb/s network speed
  
 
== How much will it cost to set up? ==
 
== How much will it cost to set up? ==
Line 28: Line 30:
 
See [[How much will it cost?]]
 
See [[How much will it cost?]]
  
== How much diskspace will we need? ==
+
== How much disk space will we need? ==
 
+
The main usage of disk space will be the uploaded files for the publications, (e.g. PDF, Word documents, etc.).  On average an eprint item will have associated files (e.g. uploaded PDF and thumbnail and preview images of that PDF) that take up 2MB of disk space. So for every extra 500 eprint items you will need an extra 1GB of disk.  However, if you intend to upload lots of videos, images, are large documents, then this requirement will be higher. On top of the space needed for uploaded documents and associated files you will need approximately 5GB for installing the operating system. The disk space required by the database is difficult to determine, as this is dependent on a number of different factors but 1MB per eprint item.
Proabably about 2 megabytes per eprint. At the time of writing a 120GB drive costs 50 pounds. That drive would hold approximately 60000 eprints.
+
=== Example usage requirements ===
 +
* Small repository: 1000 eprint items
 +
** Operating system: 5GB
 +
** Upload and associated files: 2MB x 1000 = 2GB
 +
** Database: 1MB x 1000 = 1GB
 +
** '''TOTAL:''' 8GB (15GB to give room for expansion)
 +
* Medium-sized repository: 10,000 eprint items
 +
** Operating system: 5GB
 +
** Upload and associated files: 2MB x 10,000 = 20GB
 +
** Database: 1MB x 10,000 = 10GB
 +
** '''TOTAL:''' 35GB (50GB to give room for expansion)
 +
* Large repository: 100,000 eprint items
 +
** Operating system: 5GB
 +
** Upload and associated files: 2MB x 100,000 = 200GB
 +
** Database: 1MB x 100,000 = 100GB
 +
** '''TOTAL:''' 305GB (350-400GB to give room for expansion)
  
See also: [[Detailed disk usage statistics]].
 
 
 
== Is it possible when depositing a document, to just point to an "alternative location" rather than have the full text copied and held in the eprint archive? ==
 
== Is it possible when depositing a document, to just point to an "alternative location" rather than have the full text copied and held in the eprint archive? ==
  
Line 47: Line 62:
  
 
If you are installing a new copy of eprints, you can specify the user and group to use when you run 'configure'. Do
 
If you are installing a new copy of eprints, you can specify the user and group to use when you run 'configure'. Do
 
+
<syntaxhighlight lang="bash">
./configure --help  
+
./configure --help  
 
+
</syntaxhighlight>
 
for details.
 
for details.
  
 
== How do I get the body HTML of a page without the template around it? ==
 
== How do I get the body HTML of a page without the template around it? ==
 +
This is handy for dynamically linking EPrints content into other sites.
  
This is handy for dymnamically linking eprints content into other sites.
+
For "view" pages you need to add the option <code>include=>1</code> to the view configuration. This will cause generate_views to make a ''.include'' page in addition to the ''.html'' page. The ''.include'' page will have no template around it.
 
 
For "view" pages you need to add the option include=>1 to the view configuration. This will cause generate_views to make a .include page in addition to the .html page. The .include page will have no template around it.
 
 
 
For dynamic pages, those under /perl/, you can add the cgi parameter mainonly=yes
 
 
 
eg. http://eprints.ecs.soton.ac.uk/perl/latest?mainonly=yes
 
  
 +
For dynamic pages, those under ''/cgi/'', you can add the cgi parameter <code>mainonly=yes</code>.  E.g. <code>http://tryme.demo.eprints-hosting.org/cgi/latest?mainonly=yes</code>
  
 
== How do I get statistics on number of deposits per month? ==
 
== How do I get statistics on number of deposits per month? ==
  
 
This rather grim bit of SQL should work, although datestamp is the last modified date, not the submission OR creation date, it should still give a good indication.
 
This rather grim bit of SQL should work, although datestamp is the last modified date, not the submission OR creation date, it should still give a good indication.
 +
<syntaxhighlight lang="sql">
 +
select count(*), year(datestamp), month(datestamp)
 +
from archive
 +
group by year(datestamp),month(datestamp)
 +
order by year(datestamp),month(datestamp);
 +
</syntaxhighlight>
  
select count(*), year(datestamp), month(datestamp)
+
== I've edited the template (or other config file) but nothing seems to have changed - why? ==
from archive
 
group by year(datestamp),month(datestamp)
 
order by year(datestamp),month(datestamp);
 
  
== I've edited the template (or other config file) but nothing seems to have changed - why? ==
+
While EPrints tries to automatically reload when a configuration file is changed you may need to reload the Web server for changes to take effect. See also the [[API:bin/epadmin|epadmin reload]], the [[API:bin/generate_static|generate_static]] and [[API:bin/generate_views|generate_views]] commands.
  
While EPrints tries to automatically reload when a configuration file is changed you may need to reload the Web server for changes to take effect. See also the ''reload'' command in [[API:bin/epadmin]].
+
= How to I add a metadata field to a live repository? =
 +
See [[Adding a Field to a Live Repository]].
  
 
= Installation Related Questions =
 
= Installation Related Questions =
 
   
 
   
== What platforms will GNU EPrints run on? ==
 
 
In theory any UNIX like platform: Linux, Solaris, BSD etc. even OSX! If you don't care then we recommend the RedHat Linux distribution.
 
 
 
== When running a script I get the error; "Insecure dependency in mkdir while running with -T switch" ==
 
== When running a script I get the error; "Insecure dependency in mkdir while running with -T switch" ==
  
Line 90: Line 101:
 
The indexer daemon is probably not running or is not working correctly, see [[API:bin/indexer]].
 
The indexer daemon is probably not running or is not working correctly, see [[API:bin/indexer]].
  
== I don't want to give configure_archive my mysql root password. What is the alternative? ==
+
== I don't want to give epadmin my mysql root password. What is the alternative? ==
 
+
<syntaxhighlight lang="sql">
(instructions acurate as of EP 2.3.12)
+
  mysql> CREATE DATABASE myrepo;
 
 
Run configure_archive but say "no" to "create the database?"
 
 
 
Log into the mysql client as root:
 
 
 
% mysql -u root -p
 
Enter password:
 
 
 
(and enter your password)
 
 
 
This example creates a database for archive "foo" with user "foouser" and password "foopass".
 
 
 
These values should match the values you gave to configure_archive. You can check them in /opt/eprints2/archives/foo.xml
 
 
 
  mysql> CREATE DATABASE foo;
 
 
  Query OK, 1 row affected (0.06 sec)
 
  Query OK, 1 row affected (0.06 sec)
  
  mysql> GRANT ALL ON foo.* TO foouser@localhost;
+
  mysql> GRANT ALL PRIVILEGES ON myrepo.* TO myrepo@localhost IDENTIFIED BY 'secret';
 
  Query OK, 0 rows affected (0.52 sec)
 
  Query OK, 0 rows affected (0.52 sec)
 +
</syntaxhighlight>
  
The last bit depends if you are running on a MySQL?
+
Then provide the database name, user name and password to the '''[[API:bin/epadmin|epadmin]] create''' command.
;version >= 4.1
 
mysql> SET PASSWORD FOR foouser@localhost = OLD_PASSWORD("foopass");
 
 
 
;version < 4.1
 
mysql> SET PASSWORD FOR foouser@localhost = PASSWORD("foopass");
 
 
 
That's all <tt>configure_archive</tt> would have done.
 
 
 
----
 
  
 
==  How do I get a value for a field of an eprint (without using any SQL)? ==
 
==  How do I get a value for a field of an eprint (without using any SQL)? ==
Line 128: Line 116:
 
(assuming the eprint is in the main archive, and has eprintid number 23)
 
(assuming the eprint is in the main archive, and has eprintid number 23)
  
  my $ds = $session->get_archive()->get_dataset( "archive" );
+
<syntaxhighlight lang="perl">
  my $eprint = EPrints::EPrint?->new( $session, 23, $ds );
+
my $ds = $repo->dataset( "eprint" );
  my $value = $eprint->get_value( 'editors' );
+
my $eprint = $ds->dataobj( 23 );
 +
my $value = $eprint->value( "editors" );
 +
</syntaxhighlight>
 +
 
 +
See [[API:EPrints/DataObj]].
  
 
== How can I get a utf8 string of the name of a subject, given its subjectid? ==
 
== How can I get a utf8 string of the name of a subject, given its subjectid? ==
  
<pre>
+
<syntaxhighlight lang="perl">
  sub get_subject_name_string
+
sub get_subject_name_string
  {
+
{
 
my( $session, $subjectid ) = @_;
 
my( $session, $subjectid ) = @_;
 
my $subj = EPrints::Subject->new( $session, $subjectid );  
 
my $subj = EPrints::Subject->new( $session, $subjectid );  
Line 144: Line 136:
 
}
 
}
 
return EPrints::Utils::tree_to_utf8( $subj->render_description() );
 
return EPrints::Utils::tree_to_utf8( $subj->render_description() );
  }
+
}
</pre>
+
</syntaxhighlight>
 
 
...
 
 
 
 
 
== Using mod_perl2, pages with redirects (e.g. /perl/search) are blank. How do I fix this? ==
 
 
 
In perl_lib/EPrints/Session.pm change
 
 
 
      $self->{"request"}->status_line( "302 Moved" );
 
 
 
to
 
      $self->{"request"}->status(302);
 

Latest revision as of 12:36, 21 January 2022


This all needs sorting out, it's just being grabbed from the old wiki for now.

What operating systems can we use?

EPrints should work on any Linux operating system. We use Ubuntu (Debian) and Redhat Enterprise Linux.

Running EPrints on Microsoft Windows is possible but strongly not recommended. It can be very difficult to get all the necessary dependencies in place. If you want to get EPrints running but do not have as computer running a Linux operating system. You could install VirtualBox and then create a Ubuntu or Redhat "virtual machine" (VM) to run on top of your Windows operating system. Alternatively, you could install Docker Desktop for Windows and install EPrints using Docker. However, both these methods are only recommended for testing or demoing EPrints. If you intend to run EPrints long-term you should install on a dedicated Linux server or virtual machine.

What computer do we need?

Any modern physical PC (server or desktop) or a virtual machine should be more than capable of running EPrints. A suggested minimum specification for running a live service:

  • 2GB RAM
  • 15GB disk space
  • 100Mb/s network speed

How much will it cost to set up?

Most of the costs are staff time. Equipment costs are a PC, an internet connection and a BACKUP STRATEGY. Please remember to budget for backups.

EPrints, and all the other software required to make it work, are available for no cost. At some point in the future EPrints may offer some pay-services, but the core software will remain zero cost and freely available to all.

See How much will it cost?

How much disk space will we need?

The main usage of disk space will be the uploaded files for the publications, (e.g. PDF, Word documents, etc.). On average an eprint item will have associated files (e.g. uploaded PDF and thumbnail and preview images of that PDF) that take up 2MB of disk space. So for every extra 500 eprint items you will need an extra 1GB of disk. However, if you intend to upload lots of videos, images, are large documents, then this requirement will be higher. On top of the space needed for uploaded documents and associated files you will need approximately 5GB for installing the operating system. The disk space required by the database is difficult to determine, as this is dependent on a number of different factors but 1MB per eprint item.

Example usage requirements

  • Small repository: 1000 eprint items
    • Operating system: 5GB
    • Upload and associated files: 2MB x 1000 = 2GB
    • Database: 1MB x 1000 = 1GB
    • TOTAL: 8GB (15GB to give room for expansion)
  • Medium-sized repository: 10,000 eprint items
    • Operating system: 5GB
    • Upload and associated files: 2MB x 10,000 = 20GB
    • Database: 1MB x 10,000 = 10GB
    • TOTAL: 35GB (50GB to give room for expansion)
  • Large repository: 100,000 eprint items
    • Operating system: 5GB
    • Upload and associated files: 2MB x 100,000 = 200GB
    • Database: 1MB x 100,000 = 100GB
    • TOTAL: 305GB (350-400GB to give room for expansion)

Is it possible when depositing a document, to just point to an "alternative location" rather than have the full text copied and held in the eprint archive?

Yes, and to suppress the normal warning about no documents attached look in [archive_root]/cfg/cfg.d/eprint_warnings.pl.

I need to run apache as a user other than "eprints", what do I do to make EPrints work in this situation?

Example, apache is running as user "apache".

  • Make all the eprints files owned by "apache" instead of "eprints"
  • Edit SystemSettings?.pm to tell eprints to run as user "apache"
  • You'll need to run all command line scripts as user "apache"
  • All eprints cron jobs should be owned by user "apache"

If you are installing a new copy of eprints, you can specify the user and group to use when you run 'configure'. Do

./configure --help

for details.

How do I get the body HTML of a page without the template around it?

This is handy for dynamically linking EPrints content into other sites.

For "view" pages you need to add the option include=>1 to the view configuration. This will cause generate_views to make a .include page in addition to the .html page. The .include page will have no template around it.

For dynamic pages, those under /cgi/, you can add the cgi parameter mainonly=yes. E.g. http://tryme.demo.eprints-hosting.org/cgi/latest?mainonly=yes

How do I get statistics on number of deposits per month?

This rather grim bit of SQL should work, although datestamp is the last modified date, not the submission OR creation date, it should still give a good indication.

select count(*), year(datestamp), month(datestamp)
from archive
group by year(datestamp),month(datestamp)
order by year(datestamp),month(datestamp);

I've edited the template (or other config file) but nothing seems to have changed - why?

While EPrints tries to automatically reload when a configuration file is changed you may need to reload the Web server for changes to take effect. See also the epadmin reload, the generate_static and generate_views commands.

How to I add a metadata field to a live repository?

See Adding a Field to a Live Repository.

Installation Related Questions

When running a script I get the error; "Insecure dependency in mkdir while running with -T switch"

This usually indicates you are running an eprints script as root. Don't do that; become user 'eprints' instead.

title and fulltext search returns no results, but date search does

The indexer daemon is probably not running or is not working correctly, see API:bin/indexer.

I don't want to give epadmin my mysql root password. What is the alternative?

 mysql> CREATE DATABASE myrepo;
 Query OK, 1 row affected (0.06 sec)

 mysql> GRANT ALL PRIVILEGES ON myrepo.* TO myrepo@localhost IDENTIFIED BY 'secret';
 Query OK, 0 rows affected (0.52 sec)

Then provide the database name, user name and password to the epadmin create command.

How do I get a value for a field of an eprint (without using any SQL)?

(assuming the eprint is in the main archive, and has eprintid number 23)

my $ds = $repo->dataset( "eprint" );
my $eprint = $ds->dataobj( 23 );
my $value = $eprint->value( "editors" );

See API:EPrints/DataObj.

How can I get a utf8 string of the name of a subject, given its subjectid?

sub get_subject_name_string
{
	my( $session, $subjectid ) = @_;
	my $subj = EPrints::Subject->new( $session, $subjectid ); 
	if( !defined $subj ) 
	{
		return "errer, unknown subject: $subjectid";
	}
	return EPrints::Utils::tree_to_utf8( $subj->render_description() );
}