Presentation on theme: "National federated searching Arthur Sale Emeritus Professor of Computer Science University of Tasmania"— Presentation transcript:
National federated searching Arthur Sale Emeritus Professor of Computer Science University of Tasmania Arthur.Sale@utas.edu.au http://eprints.utas.edu.au/8694
Australian national options 1.Australasian Digital Theses Program 2.ARROW Discovery Service 3.DIY: AuseSearch 4.DIY: eSearch-Africa
ADT The Australasian Digital Theses Program (ADT) harvests data from 41 universities in Australia and New Zealand, providing a two-nation gateway to PhD theses. Searching on individual universities is also possible. ADT provides the thesis metadata and local search – the actual text is always held in the universitys repository.
Lets look at PhDs ADT is the oldest gateway to Australian research, pilot operating in 1999, went live July 2000. Please look at http://adt.caul.edu.auhttp://adt.caul.edu.au
ADS The ARROW Discovery Service (ADS) was created under an initial government grant to ARROW (Australian Research Repositories Online to the World). It is operated by the National Library of Australia on an ongoing basis. ADS harvests metadata from all Australian universities through their OAI interfaces and provides a search facility.
Federated again As with ADT, ADS is a gateway to a federation of repositories. ADS holds copies of all the metadata relating to research papers, but the full texts (Versions of Record) are always held in the various university repositories. Please look at http://search.arrow.edu.au http://search.arrow.edu.au
Digression to BASE BASE is a European repository search engine with global coverage, which is developing the capability of country search. See http://base.ub.uni- bielefeld.de/en/about_sources_map.php?me nu=2&submenu=1. At present only European countries are searched, but an extension to anywhere is easy. Put pressure on BASE (Bielefeld University) and help them. http://base.ub.uni- bielefeld.de/en/about_sources_map.php?me nu=2&submenu=1
Do-it-yourself Google provides the capability of creating Custom Search Engines: search sites that use the Google database, but are limited to a defined set of sites or characteristics. I created such a search facility for Australia. Later I added New Zealand, but viewers can separate the two countries. Universities can add this search to their Library pages. Scripts are available.
AuseSearch AuseSearch does NOT harvest metadata. Instead it uses Googlebots crawl data to provide access to the html/pdf pages held in repositories as well as metadata. This means its results are differently ranked from ADS or ADT, since the PageRank algorithm is used. See: http://www.google.com/coop/cse?cx=0 12189697858739272261:yyyqychcumo http://www.google.com/coop/cse?cx=0 12189697858739272261:yyyqychcumo
South Africa At this point, I should say that the University of Pretoria has created such a custom search engine for South Africa. I have no knowledge of what repositories are included, so talk to the UP people here. See http://www.google.com/coop/cse?cx=013518 019117943970829%3Atlw8-sayn_q http://www.google.com/coop/cse?cx=013518 019117943970829%3Atlw8-sayn_q
So what about Africa? So, I decided to create a custom search engine for all of Africa, accessing every repository I could find via eIFL, OpenDOAR and ROAR (25). Time spent: maybe 1 hour, with an additional hour or two in adding new repositories and classifying their countries. Low cost. Result: an Africa-only search, capable of being refined to a country.
eSearch-Africa You wont be surprised to look at this site too: http://www.google.com/coop/cse?cx=01 2189697858739272261:nppegeei_q4 This could easily be maintained in Africa say by UP, with additional nations as refinements, or when it gets too large turned into national CSEs. All UP needs to do is add country refinements.
Summary I Lets start summarizing: 1National (or larger) gateway aggregations are not hard to do, and may be relatively inexpensive. 2The issue is always having content to share – in other words repositories at the level of the university or research centre (such as CSIR).
Summary II 3Lobby national government to provide money to establish repositories for all higher education research. By university is best if you can afford it, otherwise shared consortial arrangements are very acceptable. 4Register existing repositories into global networks! 5Establish cheap national searches.
Summary III 6Start pressuring national government to make global exposure of national research a policy priority. 7Suggest a national mandate, law or regulation that all publicly funded research in universities and research institutes must be made open access as soon as possible. This implies repositories to put them in.