Migration

From EPrints Documentation
Revision as of 18:29, 21 August 2007 by TahYbd (talk | contribs)
Jump to: navigation, search

sapori ditalia e dal mondo srl gk giugno di fuoco per far cry power rangers - il film di silvester stallone ram notebook fujitsu sony dsc-st80 batteria per samsung sghe710 camera commercio roma alfa 33 1.7 ie 16v quadrifoglio verde bacio tra paola e chiara sodimm 256mb ddr 333mhz pc2700 scarpa calcio lotto poesie sull amicizia erdal a-style t-shirt festa degli invitati clisteri teleobiettivo nikon www inq7 com da ethernet a wireless video asiatiche driver stampante hp download nk 300 tema desktop windows xp ha kieu a fresh kill recorder dvd home theatre memoria rambus patty pravo eidos vacanza benessere trentino acer veriton 256 paglialunga cartuccia epson c60 notti di cabiria km0 rover discovery auto km 0 pelicula titanic orologio fasi lunari due assi nella manica laface enzo indesit wp 100 gigi and molly heterohyrax volvo varese buzzle dipinto paesaggio natacha galeria gratis chicos famosos neger bums mich moto mondiale r 1200 gs clara morgan nuda di max de angeli oakley crosshair work manowar wheels of fire gande fratello uk mulinello surf il croccodile come fa la colina de la vida de ataque 77 kenzo eau coppie over 50 grasse polistirolo prato polo jersey uomo t amo ps2 gt 4 laser hp stampante enterostomie frigorifero coca cola rossanna mitsubishi - pajero golden sun the o c telefilm palm te2 tamron 28-75mm gci net finanziamento berlin www aduc it reyno animal s3 racconti erotici moglie porte chiuse cooperative edilizie torino dying diana one t ft cool t the magic key epson stylus photo rx sms gain villa santantonio san remo 2002 profumi gucci spia spione calzature donna inverno 40 prada mercatino trieste ministerointerni the battle di hans zimmer apple mouse ingresso videocamere adsl2 firewall router giochi di avventura 3d distretto militare di salerno racchetta da tennis dunlop jonni giochi demo di strategia arei prealpi srl tammorriata pugni chiusi fr david kinkyman auto noleggio vigevano luci e ombre per il tokyo game show autunnale cachorros foto donne comuni nude stradale belluno volo lissone cheap obiettivi canon ef 55-200 dsc t7 sony photoshop 8 rav4 2002 tesi scienze statistiche reale mutua assicurazione racconti sculacciate rh 7900mh confessionale calendario costantino hydromania newport hotel amplificatori valvole verona trapano black decker cartuccia oro air 3 bud asus pda phone sostanza dimagranti fred dal whisky facile rs mmc card 512mb dual jvc gr-x5 albergo 3 stella copenhagen played a life safari duo fatto o rifatto film gratis animal sexi copay secondo www gemidos com ar novoaltajsk tanto la queria de andy y lucas lg direct drive lavatrice custodia chitarra manuale philips ice-shelf linea mani lancome testo pgr engher eva cellulare supporto auto sedia iroko la natura in goethe pioneer deh-p8600mp kobayashi, issa te quiero te quiero lancia z defend km0 chrysler pt cruiser auto km 0 video free hard navigatore satellitare gps batman del futuro ferro da stiro delonghi chatta erotica in due parole meka gianna singole quadri courbet don t you worry www it vodafon it cagliari milano linate biglietti aerei hp 5590p toner epson al-c2000 bij 2300 donna con lui cuckold nuovo colombo bonders xxxporno ms office small business epson aculaser al-cx11n nike air max automatic porno animali www il fiume po it tutto wind ram v-data igiene degli alimenti mp4 player televisore vendita john jeans dana skully passeggino chicco trio s3 top in partibus infidelium incontri a agrigento nikon 12 24 roma vienna wireless schede acquisizione e tv giada de blanck lo spavaldo cartoon network la albergo economico praga www unina2 lampo pascoli giovanni commento server hp ml sony mavica cd 500 mercedesbenz zena gang bang peugeot ranch infocus lp120 oyo como fun camera nokia sumer wind femmine d epoca nude agriturismo barletta ram notebook fujitsu sony t33n embu divani letto oregon scientific rmr 966 grognardo eamon fik it il signore degli anelli game boy buscar video eamon supermercado forcella corsa navigatore gps bluetooth finvetro spa porche italia hinhanh cordless logitech dinovo lcd tuner zeppelin foto nude ragazzi www plaka it appositi intitle istituto lexmark z25 distributore coca il nostro anniversario blasy letterina il diabolico dott. satana switch wi-fi shubert s ave maria www snals it na grundig tvc 15 dsc s80 ragazze con jeans attillati foto lavoro manuale casa cacciatori giocgi ford fiesta 1989 pascali pino libri rs-mmc dv per nokia 6630 512 un bacio ti domando ragazza rozzano trasformatore camper four flicks schlieren battle of the year discoteche del veneto annuncio case emilia romagna tennis madrid promessi sposi cap 28 lo scandalo dx 50 codec uomo negro foto hp stampanti multifunzione con fax katana con filo backstage video parole di i believe my heart mousepad razer profumi uomo armani sapphire radeon x600 maser srl foto chitarre prs il passaggio del reno francesco oppini scanner formato a4 campagnolo direct toples dobry dj fotoannunci trans monnalisa rojas harman kardon avr 7300 elementi di informatica macchina da pasta atlas 150 ricerca numero telefonico prestito ristrutturazione western digital essential 250gb usb 2 0 la locura de yahir nike force sistars kylie minogue i believe in you suunto orologi gianni garko info cccccccc test canzoni aemon nec e313 sbloccare aggressive alpine skiings all little lolita sellano agriturismo giunti cardanici bucketheads the vestito in velluto nuove immagini per legend of kay hp server elena barolo giorgia palmas black magic woman password bios dimenticata dragostea din tei original rom panda 1 3 crea gelati muriel, il tempo di un ritorno dee jay parade 2000 evahenger club moto epoca inglesi tradurre in russo diabetes discussion ragazzo jesi kv 29 old matura assistenza samsung exmo stuprato noleggio auto bolzano miguel bose morena mia carta termica cnc usati polo nord magnetico hotel sharm el sheikh ingresso videocamere viet net com batteria jvc videocamera finanziamento valencia toxicity sistem of a down smile di chaplin pereyra ta 5 fnf ps friges haier lavatrici berlin alexanderplatz sync 913 monitor samsung epson c82 ef 70-300mm f 4 5-5 6 do is usm nazionale italiana pallavolo harry potter il prigioniero di azkaban cartoni animati giapponesi black pearl jam foto scooter preparati antinori vini http costantino e alessandra auguri cc exp lil wayne amarsi e l immenso megaworks 5 1 esterno tv street skater camerieri echinocyamus vito mercedes scalo a orly temi svolti eutanasia gps vista steve lachance flandre mayra veronica urlo bottari bologna fc dvd vergini 16x frigorifero 400 etimologia della parola informare sony mdr v 500 stranieri come noi hino nacional da indonesia voorne lucillo scandalo gf lumix fz 30 froufrou federico e giulia oesis facolta di psicologia padova voyeur sex ballerine brasiliane cercasi offerta cellulari aftherhours flasdance corsi trading on line ufficio imola mily hp officejet psc1610 castellina pasi botta e risposta con le creature impossibili trasferimenti docenti si la ves de franco de vita mercedes slk 320 she will be in love This page covers how to migrate from EPrints 2 to EPrints 3.

Migration Toolkit

The migration toolkit, available from http://files.eprints.org/ does quite a bit of the heavy lifting. It is intended to help configure an EP3 archive to have the same files, eprint types etc. as an EPrint 2 repository and then copy the data over.

Release 1.0-beta-1 should be a big improvement over 0.2 but it still doesn't do everything.

Installation

Backup

First of all make sure your EPrints 2 repository is backed up, just in case things don't go to plan. You already back it up daily anyway, right...?

Mtoolkit

Un-tar the package on the same machine as your EPrints 2 repository.

If your EPrints 2 was not installed in /opt/eprints2 then you'll need to modify the first line of the two .pl scripts in the toolkit.

EPrints 3

Minimum version required: 3.0.2 (This version introduces some very small options and bugfixes aimed at migration).

Also, get an EPrints 3 server set up. This can be either on the same machine (you'll need a separate instance of apache as ep2 and ep3 can't run under the same server at the same time, put it on port 8080 for now - see http://httpd.apache.org/docs/2.0/install.html for instructions - put it in another directory using the --PREFIX option!), or on a different machine. Get a repository created (probably with the same ID as your ep2 repo, although that's not essential). The database will need to be a different name or you'll get in an utter mess.

mkconfig.pl

This tool takes the id of an EPrints 2 repository and generates a number of EPrints 3 config. files. Copy these files into the cfg dir of your EPrints 3 repository. It also creates a file called migration_notes.txt with some helpful comments of anything it's messed with.

Get your (empty) EP3 repository up and running using these configuration files.

export3data.pl

This script exports the data from your EPrints 2 repostory in a format which can be imported by EPrints 3.

To export the data do the following:

 export3data.pl ARCHIVEID eprints > eprints.xml
 export3data.pl ARCHIVEID users > users.xml
 export3data.pl ARCHIVEID subjects > subjects.xml

eprints.xml references the full paths of the files in EPrints 2. If your EPrints 3 is on a different machine you'll need to either make sure they are the same on the new machine or do a big search-and-replace on eprints.xml!

Importing

EPrints 3.0.2 no longer needs the hacks which were required for mtoolkit 0.2

Empty out any test data

To erase the current data in your EP3 repository use:

bin/epadmin erase_data ARCHIVEID

Import the data

To import the subjects and users do:

/opt/eprints3/bin/import_subjects --verbose --force --xml ARCHIVEID subjects.xml
/opt/eprints3/bin/import --verbose --migration ARCHIVEID user XML users.xml

If something goes wrong with subjects or users, use epadmin erase_data to empty the database and start again.

To import the EPrints do:

/opt/eprints3/bin/import --verbose --migration ARCHIVEID eprint XML eprints.xml

If something goes wrong with importing the eprints, use epadmin erase_eprints, to just erase the eprints data so you don't need to redo subjects and users.

the --migration option tells the importer to:

  • skip are-you-sure? messages.
  • use the eprintid and userid from the XML rather than assigning them.
  • use the "datestamp" from the XML rather than assign it.
  • load files from the local file system (normally this would be a security hole)

You may encounter some issues with badly formed XML. This is due to non correctly encoded data creeping into your database. It should all be utf-8 but earlier versions of EPrints didn't always check... If your EPrints 2 server is running perl 5.8 you can install the Perl module Encode which will clean up your data, but on our system our EPrints 2 was running on a machine with an older version of Perl and we didn't want to risk upgrading.

Finishing up after using mtoolkit

You will probably still want to tweak some of the following things by hand, depending how much you customised EPrints 2:

Some of these we can't easily add to the mtoolkit (those involving perl code). The XML files we could add in theory, but we've made a decision to release 1.0 with the current features, rather than delay it months but make it perfect.

  • the template
  • the workflow (EPrints 3 offers some nice features, look at the lib/defaultcfg/workflows/ for an idea of what you can do)
  • the static pages (.xpage)
  • the citation files
  • the /view/ browsing configuration
  • the search configuration
  • any custom render routines
  • the render eprint method (eprint_render.pl)
  • any custom document security options
  • any custom validation options
  • etc.

Feel free to add tips on the wiki, linked from this section.


Known bugs in version 1.0 of toolkit / importing into EPrints 3.0.2

Documents with subdirectories fail to import

FIX: do them by hand at the end.

Warning messages about "hideemail"

hideemail was introduced in a version of EPrints 2 (I forget which). Earlier repositories may not have this field. Some of the EPrints 3 default config files assume it exists (user_fields_default.pl and user_render.pl).

FIX 1: Don't worry about it.

FIX 2: Before importing users.xml, add the hideemail field back into user_fields.pl

         {
           'name' => 'hideemail',
           'input_style' => 'radio',
           'type' => 'boolean',
         },

Error missing field: X

The default EPrints 3 config. may reference a field not imported. If so you can almost always just remove the offending section of configuration. Examples: searches, citations, views.

Problems with bad characters in eprints.xml

This is not tested, but I think this should clean it up...

iconv -c eprints.xml --output=eprints_cleaned.xml -f utf-8 -t utf-8

Warning about Pagerange

Argument "" isn't numeric in addition ( ) at
 /opt/eprints3/perl_lib/EPrints/MetaField/Pagerange.pm line 182.

This is a warning that is caused by having non-numeric data in the pagerange field. eg. "iii-xi".

FIX: Don't worry about it.

Can't import files which contain "/"

eg if your document had index.html and images/dia.jpg

FIX: Make a note of the offenders, and just add those documents by hand.

FIX2: Bug chris to add this to fix this in the final release of 3.0.2 (it's not in beta-1)