From Web Indexing To Hybrid Libraries, With Thanks to eLib Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY URL:

Slides:



Advertisements
Similar presentations
Subject Based Information Gateways in The UK Coordinated Activities in The UK Within the UK Higher Education community, the JISC (Joint Information Systems.
Advertisements

Why metadata matters for libraries... Rachel Heery UKOLN: The UK Office for Library and Information Networking, University of Bath
UKOLN and the Institutional Web Service UKOLN (UK Office for Library and Information Networking) is a research and dissemination unit based at the University.
1 RDF Tools Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the British Library Research and Innovation Centre,
A centre of expertise in digital information management Developing a Quality Culture For Digital Library Programmes Author & Presenter Brian Kelly UKOLN.
A centre of expertise in digital information managementwww.ukoln.ac.uk Search Facilities For Web Sites A Discussion Group Session Brian Kelly UKOLN University.
A centre of expertise in digital information managementwww.ukoln.ac.uk Dont Do It Yourself Content Syndication on the Web Pete Cliff UKOLN University of.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
A centre of expertise in digital information management A QA Framework To Support Your Library Web Site Review Brian Kelly UKOLN University of Bath Bath.
Towards consensus on collection-level description Collection Description Focus Briefing Day 1 British Library, St Pancras, London 22 October 2001 Bridget.
Portal-to-portal : joining up content to decrease the time spent clicking as distinguished from the time spent working Michael Fraser.
Creating web guides for a library portal Jackie Wickham – Intute Martin Gill – University of Leeds
RDN-Include: Re-branding Remote Resources Subject Gateways in the UK The UK Higher Education community has funded a range of subject gateway, now part.
1 Technical Developments Related to Quality Issues Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY
1 Authentication and Open Standards Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the British Library Research.
Metadata and interoperability: Michael Day UKOLN: the UK Office for Library and Information Networking University of Bath
Developing portal services: the Subject Portals Project Rosemary Russell SPP Project Manager UKOLN, University of Bath
Publishing on the WWW Search Engines & Metadata. Aims and Objectives To identify and discuss the different types of search engine Understand the basic.
1 Promoting Your Project Web Site Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY England UKOLN is funded by the Library and Information.
Automated News Feeds: The RSS Standard For News Feeds Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by:
1 WebWatch: Monitoring Web Developments In The UK Brian Kelly UK Web Focus UKOLN University of BathURL Bath, BA2 7AY
CEN/ISSS DC workshop, January The UK approach to subject gateways Rachel Heery UKOLN University of Bath UKOLN is.
A Lightweight Approach To Support of Resource Discovery Standards The Problem Dublin Core is an international standard for resource discovery metadata.
A centre of expertise in digital information managementwww.ukoln.ac.uk Approaches To E-Learning: The Users’ Perspective Brian Kelly UKOLN University of.
1 If I Could Start All Over Again: Lessons To be Learnt From The HE Community Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is.
Digitising Journals, March 2000, Copenhagen Astrid Wissenburg Information Services and Systems King’s College London
Technologies For Hybrid Libraries: Implementation Issues Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the Library.
WebWatch Ian Peacock UKOLN University of Bath Bath BA2 7AY UK
Archiving The UK Domain And UK Web Sites Brian Kelly UK Web Focus UKOLN University of Bath URL UKOLN.
New Approaches To Resource Discovery In The UK HE Community Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY
The DNER - a national digital library Andy Powell ZIG Meeting, York October 2001 UKOLN, University of Bath UKOLN is funded by Resource:
Approaches To Indexing in The UK Higher Education Community Institutional Activities Surveys of 150 UK University web sites show the popularity of freely.
1 The Latest Web Developments Brian Kelly, UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the British Library.
1 Exploit Interactive: The Development of a Web Magazine Bernadette Daly Information Officer UKOLN University of BathURL Bath,
The Agora hybrid library project Rosemary Russell, UKOLN (UK Office for Library and Information Networking) Agora Communications Coordinator.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
The Resource Discovery Network and OAI Andy Powell UKOLN, University of Bath UKOLN is funded by Resource: The Council.
Possible Developments in Resource Discovery & National Directories. Paris, 6 July Metadata for interoperable cultural content: a personal viewpoint.
Automated Benchmarking Of Local Authority Web Sites Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by:
1 Libraries & the trade: converging standards? Paul Miller Interoperability Focus UK Office for Library & Information Networking (UKOLN)
Finding Resources On Your Web Site Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY URL:
1 Hybrid Libraries and information Clumps: a view from the UK Paul Miller Interoperability Focus UK Office for Library & Information Networking (UKOLN)
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
1 An Introduction to Metadata Brian Kelly UK Web Focus UKOLN University of Bath BA2 7AY
Metadata for the Web Andy Powell UKOLN University of Bath
1 Ariadne and Exploit-Mag: Web Review and European Library Telematics Philip Hunter UKOLN University of Bath Bath, BA2 7AY
1 Web Site Creation: Good Practice Guidelines Architectures For Project Web Sites Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is supported.
Disseminating News Within Your Organisation Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Publishing Web Magazines, e-Journals and WebZines Brian KellyMarieke Napier UK Web FocusInformation OfficerUKOLNUniversity of Bath
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
1 Future Of The Web Brian Kelly, UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the British Library Research.
Future Web Trends Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
A centre of expertise in digital information managementwww.ukoln.ac.uk Quality Assurance For Museum Web Sites: Review Brian Kelly UKOLN University of Bath.
1 Metadata – Has The Time Arrived? Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the Library and Information Commission,
A centre of expertise in digital information managementwww.ukoln.ac.uk A Standards Framework For Digital Library Development Programmes Brian Kelly UK.
A centre of expertise in digital information managementwww.ukoln.ac.uk UKOLN: WWW Brian Kelly UKOLN University of Bath Bath, BA2 7AY
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
1 Institutional Web Management: The Next Steps Brian Kelly UKOLN University of BathURL: Bath, BA2 7AY
Current Approaches to Web Site Development Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
A centre of expertise in digital information managementwww.ukoln.ac.uk UKOLN is supported by: Effective Web Site Training Workshop: Benchmarking Web Sites.
A centre of expertise in digital information managementwww.ukoln.ac.uk Search Facilities For Web Sites A Discussion Group Session Brian Kelly UKOLN University.
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Exploit Interactive Web Magazine.
Advertising On The Network Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives and Libraries,
Auditing and Evaluating Web Sites Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums,
Institutional Web Management: Report on “Working With HERO” Session UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint.
A centre of expertise in digital information managementwww.ukoln.ac.uk Web Site Accessibility: Looking At Our Communities Brian Kelly UKOLN University.
Providing Information To Third Parties: The Pros And Cons Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by:
Accessing a national digital library: an architecture for the UK DNER
Presentation transcript:

From Web Indexing To Hybrid Libraries, With Thanks to eLib Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY URL: UKOLN is funded by the Library and Information Commission, the Joint Information Systems Committee (JISC) of the Higher Education Funding Councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath where it is based. Aims of Talk: Review approaches taken by UK HE community to indexing web sites Discussion of findings Describe future developments Aims of Talk: Review approaches taken by UK HE community to indexing web sites Discussion of findings Describe future developments

2 Which To Choose? Alkaline (Vestris) AltaVista - Search Intranet ASTAWare SearchKey atomz Search (remote) BooleanSearch BBDBot BRS/Search (Dataware) Compass Server (Netscape) Cybotics DataWare BRS/Search DocFather (formerly SiteSearch) dtSearch Web Excalibur RetrievalWare EWS (Excite) Excerpt (Obsolete) Extense FAST Search Server Findex (code library) Folio siteDirector FreeFind (remote) Fulcrum Glimpse Harvest ht://Dig ICE iHound (ICATT) Index Search (Xavatoria) Index Server (Microsoft) IndexMySite (remote) Infoseek - Ultraseek Intermediate Search intraSearch (remote) I-Search Isearch ITMS Isys:web Java Applets JHLSearch JObjects QuestAgent Lycos / InMagic Magnifi Enterprise Server Matt's SimpleSearch Microsoft Index Server Microsoft Site Server MiniSearch (remote) MondoSearch Muscat NetResults (now SearchKey Plus) Netscape - Compass Server OpenText - LiveLink Perl Scripts Perlfect Search Phantom (Maxum) PicoSearch (remote) Etc. Indexing software from Which to choose? What software may be obsolete? What does remote mean? Indexing software from Which to choose? What software may be obsolete? What does remote mean? Can choose by reading reviews, web sites, etc. or by looking at usage in community

3 Findings: UK HE Web Sites Main findings of 2 surveys: SoftwareNos. (Jul) ht://Dig eXcite Microsoft Harvest Ultraseek Other None Nos. (Mar)      —  Totals Article published in Ariadne issue 21 - Results (including update on survey) available from: Article published in Ariadne issue 21 - Results (including update on survey) available from:

4 Popular Products: ht://Dig ht://Dig Now used at 32 (up from 25) UK HEIs Freely available New version released in December 1999 Own domain with well- designed web site Robot to index multiple servers See Oxford Case Study 131 servers 438,500 resources Indexes MS Office, PDF, etc. files (external parser) Oxford Case Study 131 servers 438,500 resources Indexes MS Office, PDF, etc. files (external parser) Case Studies produced by Helen Sargan (Cambridge)

5 Popular Products: Ultraseek Ultraseek: Used at 9 (up from 7) UK HEIs Powerful but expensive See Cambridge Case Study 232 servers 188,000 resources Weightings given to meta tags Useful logs and reports Cambridge Case Study 232 servers 188,000 resources Weightings given to meta tags Useful logs and reports

6 Popular Products: Harvest Harvest: Now used at 6 UK HEIs (down from 8) For IR research use? See Issue: Pay for software Pay for programming support to implement free software Issue: Pay for software Pay for programming support to implement free software

7 Use of Third Party Services Small usage of third parties to provide indexes: FreeFind (Used at 2 HEIs) and AltaVista (Used at 1 HEI) Why not more use by 50 institutions with no search facility? Benefits from services provided by popular large- scale search engine Low cost (free?)  Incomplete coverage?  Document fluctuation  Loss of control, advertising, … Benefits from services provided by popular large- scale search engine Low cost (free?)  Incomplete coverage?  Document fluctuation  Loss of control, advertising, …

8 Try Them For Yourself Interfaces to UK University search engines are available, providing a single location for evaluation The page also provides a link to organisational search pages The resources are grouped in alphabetical order and by search engine What functionality do libraries using Domino provide? What does Aberdeen's search facility provide? See

9 Other Developments What else is happening to indexing of these communities? National search engines Local initiatives eLib Hybrid Libraries

10 National Search Engines ACDC (Academic Directory) (Unfunded) pilot of index of ac.uk domain based on distributed approach using Harvest Set up in March 1996 Lack of development effort resulted in degraded service (e.g. indexer not aware of JavaScript code) No longer being developed

11 Institutional Developments Maestro robot (Dundee): Indexes Scottish resources Volunteer effort Maestro robot (Dundee): Indexes Scottish resources Volunteer effort North East Universities (UNIS4NE): Appearance of cross-searching Actually interface to HotBot / AltaVista North East Universities (UNIS4NE): Appearance of cross-searching Actually interface to HotBot / AltaVista

12 eLib Hybrid Libraries eLib Phase 3 includes "Hybrid Library" projects: Help users find electronic (web, OPAC, etc.) and "real world" resources Includes regional and subject-specific approaches MusicOnline search of Music Catalogues BUILDER search of eLib Phase 3 web sites

13 Other Possibilities What other developments may we expect: Increased indexing in institutions of other web sites (opposition / friends) Leave it to commercial sector Development of a HE (or public sector?) national search engine New developments (XML / RDF / etc.)

14 Indexing Remote Sites May see increased indexing of remote sites within institutions:  Examples provided by Dundee and BUILDER (eLib) Feeling of ownership Easily done Can develop enhancements locally  Increased server load locally  Increased server load remotely  Increased network load  Not scalable  Unnecessary duplication

15 Commercial Solutions Could leave searching to commercial world: No costs to institution / HE community  Not integrated with non-Web services  Results too broad  Distracting interface  Little scope for tailoring

16 What About Metadata? Metadata can: Improve search results Provide structured information (for automated processing) which can provide richer services: –Fielded searches –Limit searches (e.g. only Library pages on Council web site) –Web site administration –Alternative browsing interfaces Tools, standards, etc. becoming available Expected growth area

17 Example Exploit Interactive web magazine ( ) is using metadata to provide enhanced searching: Search for foo in: Issue 2 or in issue 2 and 4 (this is possible using directory structure) Feature Articles (needs metadata) Articles about EU- funded projects Etc. Combinations of above Also provides alternative browsing structures

18 JISC Developments DNER (Distributed National Electronic Resource): Seamless access to national resources What about local resources? Need for "institutional portals" RDN Resource Discovery Network Builds on work of eLib subject gateways Based on standards (Dublin Core, Z39.50, whois++, LDAP, RSS, Dublin Core,etc.)

19 Conclusions To conclude: No clear "best buy" for indexing software Probably some to avoid In 2 years time are you likely to still be using same software? Have changed software / architecture? If changes likely, need to think about change migration strategies, interoperability issues, etc. Library community has much to offer Need for user studies (not covered) Useful Resources Useful Resources Questions welcome