Autocompletion

From EPrints Documentation
Revision as of 17:20, 19 August 2007 by AvuPhu (Talk | contribs)

Jump to: navigation, search
www ottoperuna com barboncino nano cartina autostradale di milano niekumacje ocean hotel betty cantante video la settimana enigmistica www anagrafe prestazioni it presentazione tesi laurea il signore degli anelli torna su pc e psp lancio del balivo caterina intel pentium d 830 ospedale pisa nolwenn leroy nick burg video strip archos video agenzie immobiliari donoratico castagneto carducci ciaoamigos it just cavalli abiti film irreversible pioneer dvr-110d risultati campionato calcio serie b b charme univisionradio dove l amore videocamera digitale con zoom letteratura antica brindisi alberghi giuseppe verdi. il trovatore babycakes three of a kind hotel calabria ombrello per passeggino hopkinson, john il ritorno di ringo tasman, abel janszoon rs-mmc - dual voltage nokia bandiere dell europa annuncio affitto pisa motorola c350 software fabro mostra mercato pensioni bologna buttercup why do you build me up spears porno britney spears 2005 euro dance 1 depo-provera im 1 ml 50 mg slim thug killa kyleon cyber guerieri polar s720 logitech mouse g7 fun magic 1 4 maggioli spa soldiers tale programma scaricare giochi pc usher brano yeah da scaricare gratis poesie per compleanno donne in cerca di un uomo s wonder chat cristiano com francesca ridolfi colonna filippo srl tiziano ferro universal prayer download fax inkjet segreteria olgiate comasco salvatore paola autonoleggi lamezia terme hit my love caesars palace dragonstea dintai www ashanti com video dream theater the enigma take the long way home chi fermera la muscia una demo americana per final fantasy xii mercedes slk 2004 fersam moves sick and tired di anastacia new trolls che idea opel astra elegance video clip christina aguilera the voice www quellenhof it sito scaricare musica rientro ostaggi elicottero raptor bitware fax ati 9600 sartre la rubrica di ale e costa outlook italia pocket pc 9090 mercedes e 270 cdi riassunto vanity fair monitor benq t905 navigatore pioneer avic hub usb frontale www brevetti it isi testo di bice frasi augurali laurea engin nursani distruggi lo schermo schermo per videoproiettore vivian blaine erotika erotika i-mate jam televisori crt 29 ricoh aficio cl7000 guerra e pace romanzo www acffiorentina it raccomandate postali eraclea forum di grande fratello exeem cartina asia fisica tanjore san michel teneriffe canon ef 14mm f 2 8l usm dyson animal pro moondance michel buble www perdasdefogu it power animal top toelettatura animali brascia cover vasco mogli esibite roulotte torino midi alfio lombardi dragon die tei voli tenerife sinan gerusalemme cose fare benni benassi desktop monitor ricchi et poveri jubilate subwoofer philips she by loved jimmy choo v1 sony da hypa fz10 panasonic video di raf annuncio online immobile gratis pro trek prg-50 cubetti di ghiaccio tu trouveras natasha st pier divx sintoamplificatore home theatre you make me feel brand in your eyes gabriel secret grandefratello 4 testo canzone convivendo hulk1 sole negli occhi dvd nichetti dany jay totti pubblicita rischio del circolante e mezzi di pagamento nel commercio internazionale donne casteltermini fuoristrada usati piastra di registrazione ingegnere delle telecomunicazioni (regione lombardia - bergamo provincia) voyeur foto amatoriali la morte preannunciata holub, emil contratto nazionaledi lavoro bmw cabrio 2002 pioneer home teatre 333 rag doll hockery pioneer p70bt bergamo timisoara film baldios driver motorola c330 gprs hp - ipaq hw6515 men in black vera pelle morcinek, gustaw dvd fantozzi monitor lcd 17 silver troms bergamo pagine bianche maria mercader www pampers it ventola in rame aspire wlmi 1672 esprimo p5600 sony kv 32fq70e terra senza donne strade subsonica la pallina magica - il ritorno della pallina magica cover buoni cattivi living in my hand data premiazione mtv roma 2004 bamba karma pornostar postalmarket k minolta dimage z3 gigolo s black eye pease un caldo corpo di femmina erice hotel chicco alaska minclip com benny benassi no matter what you do free regione friuli geonav 4c nautica azione obbligazione bratz babyz fiat stilo selespeed jerez de los caballeros foto cbs principessa diana orologi casio illuminator king kong dvd www kimco it gloria constantino foto e l era tardi dvd moana pozzi hit mania 1995 firstservice srl office microsoft renzo arbore batterie sony ericsson cirano di guccini cartolina augurio compleanno sentieri soapopera eric von stroheim tre individui tanto odio macchine da caffe professionali dj casper chachaslide toner xerox 2135 la grigliata foto clitty una rockstar in cerca damore personaggi settimo cielo mendelson dragostea fin tei lyrics nikon - f 55 testo lo strano percorso 883 niurka fotos toner samsung ml-2150 vendita cam eccitazione genitale magic combi warlords battlecry ii napoli kappa mimosa festa donna lacie hard disk usb in tutti i miei giorni canzoni napoletane classiche impianti stereo the simps university jam spoon set me fre acer c300 allenamento per la mtb stampante hp 6840 nikon 70-300 g kt tunstall eye telescope acer mp3 512mb r9250 128mb pci ddr dvi tvo 64bit warp brothers grande sophie netgear wgr614 802 11g wireless router art carney gioco di macchine da corsa pristina musica y videos musicales de yandel frisinga one piece ps2 videogiochi il cuore e un zingaro x ray murder boom boom get here la festa dei folli girlsaloud sound of the underground occhiali ray ban donna www googl com santa clara (u.s.a.) joe lansdale saoke hej nissan micra c c immagine mare lop sony np-fp90 tastiera retro illuminata michel bouble rulli per bicicletta dvx s120 yamaha linea porno gay www i love messenger it fort bliss filato torrie macchina a vapore rian air siemens gigaset sl barabani paolo pesca da riva ingrossamento pene athlon xp 64 3200 scanner fujitsu a3 la strada giusta mollificio cappeller gladiatore rmx teac hd esterno usb kaho counter attak francesi boneym imac 20 di luca brothers spa tim curry street of simcity www banca lombarda www pesca in it a tutto bergman stylus photo r200 myskina nue nike basketball shoes tendenze dell architettura contemporanea morron associazione invalidi civili rovigo stock market rdb vigili il gioco di oggi jenner, edward lettore di pcmcia tp 012 usa sesso la nosta favola auto noleggio barcellona pozzo gotto incontro san gimignano maria nazionale notiziario foggia moglie zoccole video grtatis porno gay progettista meccanico (regione emilia romagna - bologna provincia) epistassi oberon (astronomia) paula vazquez charles boyer www chicas calatas lavastoviglie da 45 cm ophelia winter themes films nastydollars paul frank ink jet printable cd www nose de sexo com dawnload dvx free eric clapton. unplugged da muro orologi talk ventole di raffreddamento cpu cagnotto plextor dl battaglie le colline hanno gli occhi sexs coppie it unisannio router 3cr858-91 3com cinelli yelena club sexy xxx gisela molinero orario treno eurostar per firenze miele 8762 trevi tv gerber, gottfried heinrich partite di calcio venditore di mappamondi tinto rosse lana lane home theater aiwa il mantenuto western digital sataii c chicas inside jupiter tetas gratis pioneer dvr k05 hunchun fake sexy panca nordictrack karekano intestatura cherubini roma audio hai nu sinh ngo quyen mi ermanita moho kokkola woofer sony per auto telecomando 6 jayden telefoni cordless duo panasonic stil unic pc2700 memoria ram hard disk esterno 10 neon kit

EPrints 3 Reference: Directory Structure - Metadata Fields - Repository Configuration - XML Config Files - XML Export Format - EPrints data structure - Core API - Data Objects

For a how-to, please see Autocompletion and Authority Files (Romeo Autocomplete)

Autocompletion in EPrints 3 consists of serveral stages.

  • A field in the workflow is configured to say what autocompletion URL to use, plus any additional parameters to pass to the script. This URL must be on the same server (eg. foo.eprints.org) but does not have to be part of the EPrints system.
  • The autocomplete script takes the text typed so far (and maybe the additional parameters) and returns a chunk of XML describing possible autocomplete options. This XML consists of a number of rows (how many is up to the script).
  • Each row contains some HTML to show the person viewing plus a magic <ul> block which is hidden from display, but is used by the autocomplete javascript to autocomplete the page.

Autocomplete Scripts

EPrints autocomplete scripts live in /opt/eprints3/cgi/users/lookup/ you can add your own here, or maybe elsewhere if, for example, you needed to use PHP.

There are several kinds of autocomplete scripts:

  • thoses that just use the existing data in your repository (these are dead easy as they work out of the box)
  • ones which use a file which you place in your repositories cfg/autocomplete/ directory.
  • more clever ones.

You may be able to find new autocomplete scripts and authority files on http://files.eprints.org/

Scripts are in (rough) order of complexity to use...

journal_by_name

Can only be used on the "publication" field. Looks up the publication in the existing publications in the repository and autocompletes the publication. If ISSN and/or publisher exist in the same input component as the journal field they will also be completed if data is available.

journal_by_issn

As above, but attached to the ISSN field.

event_by_name

Similar to journal_by_name. Is attached to the event_title field and autocompletes from existing repository data. If they are in the same (multi) input component it will also try and autocomplete event_location, event_dates and event_type.

name

Attached to a multiple compound name/id field (eg. creators) looks up the name in the existing list in the repository. Can match on any id or given or family. Populates all parts of the current row it can.

title_duplicates

This is a slightly odd script as it doesn't actually provide any autocomplete data. What it does is search the list of existing titles to see if there is a match. It only searches if there are 5 or more characters entered so far.

If it finds any matches it lists them with a warning that they might be a problem, but does not assist autocompletion. If many matches are made then a short title only is shown, if the list is only 4 or lest then a full citation is shown.

This is set to "on" by default in the hope that it will reduce duplicate submissions.

simple_file

File needs an additional parameter to be passed to it. This is configured in the workflow. This parameter is the name of a file in the cfg/autocompete directory. This file contains a list of values which are searched (case insensitively) and matches returned. A second parameter of "mode=prefix" can be set to only match values which start with the text being typed, rather than contain it.

simple_sql

Similar to simple_file but gets its values from a database table.

The table must be in the eprints database used by this repository and start with "ac_". The script needs a param. passed from workflow to indicate the name of the table WITHOUT the ac_ prefix. Eg. if the table was "ac_badgers" the parameter would be "table=badgers". The only field used is "value" which works like the lines in the text file. If you want this to be blindingly fast you can make sure "value" is indexed, and set mode=prefix. With those set autocompleting from a dictionary of half a million words worked cheerfully.

romeo

(not included in 3.0, expected in 3.1) This script uses the EPrints/Romeo data to provide journal autocomplete data. Should be attached to the publication field. This is almost identical to file, but inserts the required Powered by Sherpa note.

url_name_value

This works like simple_sql except for the fact it uses three columns. url, name and value. It searches and autocompletes using value, but the human-readable description is supplied by "name" and if url is set then a (more info) link is shown. The link opens a new window to avoid mid-form trauma.

file

This is for more complex autocompletion authority files. It works like simple_file except that the file format is more complicated.

The file constists of lines which contan:

  • a value to search, (eg. "African Journal of Agricultural Research")
  • a tab
  • a <li> autocomplete chunk. (with no line breaks) eg.
 <li style='border-right: solid 50px #30FF30' >

external source

This takes all the ideas above, and extends them to make an API call to an external data source. This has the advantage that you are always referring to the authoritative source, but the disadvantage that you are reliant on both the network being up and the external source being available.

It breaks down into two parts:

  • the autocompleter call in the web page
  • the script being called

For an example, here is one way to query the RoMEO data directly:

First, set the autocompleter in the eprints workflow:

     <component type="Field::Multi">
      <title>Article Publication Details</title>
        <field ref="publication" input_lookup_url="{$config{perl_url}}/get_journals" />
        <field ref="publisher" />
        <field ref="issn" />
      </component>

Next have the script:

use strict;
use HTTP::Request;
use LWP::UserAgent;
use XML::Twig;

use Data::Dumper;
use EPrints;

my $journal_data = {};

sub urldecode{
  my ($url) = @_;
  $url =~ s/%([0-9a-f][0-9a-f])/pack("C",hex($1))/egi;
  $url =~ s/\x2B/ /; # swap ' ' for ' '
  return $url;
}

# XML::Twig's routine for dealing with a journal entry
sub process_journal {
  my ( $twig, $journal ) = @_;

  # get the components
  my $title = urldecode( $journal->first_child('jtitle')->text );

  my $zetoc = urldecode( $journal->first_child('zetocpub')->text ) 
                  if $journal->first_child('zetocpub');
  my $romeo = urldecode( $journal->first_child('romeopub')->text )
                  if $journal->first_child('romeopub');
  my $issn  = urldecode( $journal->first_child('issn')->text )
                  if $journal->first_child('issn');

  my $publisher = $romeo;
  $publisher = $zetoc if (not $publisher