Download presentation
Presentation is loading. Please wait.
Published byMagnus Robertson Modified over 8 years ago
1
Alexandria Digital Library Project ALEXANDRIA DIGITAL LIBRARY PROJECT Larry Carver James Frew Greg Janée Mike Goodchild Linda Hill Terry Smith www.alexandria.ucsb.edu
2
Alexandria Digital Library Project 2 Smith et al NSF April 3, 2003 Outline o Alexandria Digital Library Project (ADLP) History Goals, activities, partners o Distributed DL supporting georeferenced access Research and development issues Operational collections and services o Knowledge organization systems (KOS) Gazetteers and related KOS o ADEPT learning environment Concept-based learning spaces Collections and services
3
Alexandria Digital Library Project 3 Smith et al NSF April 3, 2003 ADLP History o Pre-1994: UCSB geo-information and map library o 1994-98 DLI-1: georeferenced collections/access o 1998-99: Operational ADL (UCSB Library/CDL) o 1999-2004 DLI-2: distributed DL Extension of architecture and access services Knowledge organization services Integration of learning services Geo/GIS-based interfaces Basic CS research o 2004-2008: Large-scale DLs and beyond NSDL Core Infrastructure and services Cyber Infrastructure
4
Alexandria Digital Library Project 4 Smith et al NSF April 3, 2003 ADLP Goals o Current goals: Distributed DLs and applications Operational distributed digital library –services for construction/use of georeferenced collections –DL federation and interoperation –scalability over many heterogeneous collections Development/integration of KOS services Integration of concept-based learning spaces –services for creating/using learning environments Development of geo-based interfaces Evaluation of services Basic computational science research o Emerging goals: Large-scale DLs and beyond Extending NSDL Core Infrastructure and services Cyber Infrastructure
5
Alexandria Digital Library Project 5 Smith et al NSF April 3, 2003 ADLP Major Collaborative Activities o 1994-98 4 DLI-1 partners: CMU, Illinois, Stanford, UCB SDSC, U.Arizona, US Navy, NIMA, LoC, MSFT, ESRI,… o 1999-2004 UCSB Library, CDL DLI-2 partners: UCLA, GT, SDSC/NPACI, Stanford, UCB DLESE NSDL CI partners: Cornell, Columbia, U.Mass NSDL Services partners: IIT Chicago, UCSD JISC partners: Penn State, Southampton, Leeds …
6
Alexandria Digital Library Project 6 Smith et al NSF April 3, 2003 georeferenced DL tutorials distributable software packages operational libraries: UCSB library,... outreach; federated nodes OPERATIONAL APPLICATIONS gazetteers: research and community gazetteer content standard web service protocols for gazetteers, thesauri, and other KOS ADL gazetteer thesauri for feature and object types duplicate detection for gazetteers textual-geospatial integration services KNOWLEDGE ORGANIZATION distributed georeferenced DL services NSDL core infrastructure data environment (e.g., GIS) integration hardware acceleration for spatial data collaborative tools Z39.50 support ingest and workflow systems GEOREFERENCED DIGITAL LIBRARIES knowledgebase and lecture composing, visualization, and presentation tools physical geography concept space and learning object collections applications to undergraduate education educational evaluation learning services and DL integration digital classrooms metadata content standards learning objects computational models EDUCATIONAL APPLICATIONS reusable user interface components contextual maps, footprint creation KOS navigation lightweight GIS functionality Digital Earth visualization image processing query-by-content, classification spatial extent determination USER INTERFACES ADLP Activities
7
Alexandria Digital Library Project 7 Smith et al NSF April 3, 2003 Outline o Alexandria Digital Library Project (ADLP) History Goals, activities, partners è Distributed DL supporting georeferenced access Research and development issues Operational collections and services o Knowledge organization systems (KOS) Gazetteers and related KOS o ADEPT learning environment Concept-based learning spaces Collections and services
8
Alexandria Digital Library Project 8 Smith et al NSF April 3, 2003 Goals o Digital library architecture for geospatial/georeferenced information heterogeneous rich services scalable –many providers –collections, large and small o DL infrastructure, not artifact standard components and interfaces distributed participants
9
Alexandria Digital Library Project 9 Smith et al NSF April 3, 2003 Issue: discovery o Naïve approach I want a map of Boulder “Downtown street map of Boulder, Colorado” o But... remote-sensing imagery is nameless AVHRR NOAA-13 2002-06-03 14:33 UTC o But... direct placename search is unreliable I want a map of the Flatirons in the Rocky Mountains just behind Boulder, Colorado USGS topographic map “Eldorado Springs” generally: many names for any given place
10
Alexandria Digital Library Project 10 Smith et al NSF April 3, 2003 ADL approach o Coordinate-based representation and discovery lat/lon coordinates rich geometry –polygons, polylines spatial operators –overlaps, contains o Gazetteer content standard defines representation service maps placenames coordinates client gazetteer library coordinates placenames
11
Alexandria Digital Library Project 11 Smith et al NSF April 3, 2003 Issue: multiple data types o Geospatial discovery is not amenable to text treatment constitutes new data type o Adding notion of different data types has many implications: input validation internal structures, external representations query language and processing ranking user interface components
12
Alexandria Digital Library Project 12 Smith et al NSF April 3, 2003 ADL approach o Discovery: “bucket framework” extensible data type system for metadata –XML representations, search operations native metadata is explicitly mapped to buckets software supports bucket views over arbitrary RDBMSs 9 Dublin Core-like standard buckets o User interface components background maps, item footprint identification/creation o Spatial ranking by spatial similarity to query region
13
Alexandria Digital Library Project 13 Smith et al NSF April 3, 2003 Bucket mapping Originator FGDC Citation/Originator USGS DOQ Producer U.S. Geological Survey Photo Science, Inc. bucket-level searching field-level searching collection statistics
14
Alexandria Digital Library Project 14 Smith et al NSF April 3, 2003 Collection statistics Object Type cartographic works maps images photographs aerial photographs Count 324,876 2,014,799 484,083 Temporal Spatial
15
Alexandria Digital Library Project 15 Smith et al NSF April 3, 2003 ADL approach o Discovery: “bucket framework” extensible data type system for metadata –XML representations, search operations native metadata is explicitly mapped to buckets software supports bucket views over arbitrary RDBMSs 9 Dublin Core-like standard buckets o User interface components background maps, item footprint identification/creation o Spatial ranking by spatial similarity to query region
16
Alexandria Digital Library Project 16 Smith et al NSF April 3, 2003 ADL in context ADL Web affordances generality structure DLs Greenstone ODL OAI GIS
17
Alexandria Digital Library Project 17 Smith et al NSF April 3, 2003 Issue: scalability o Size easy to accumulate lots of data –satellites image continuously geospatial discovery scales... not so well –indexing unwieldy at 10 6 items efficiently joining spatial, other constraint types is difficult o Burden & management collection building is labor-intensive providers have differing content, services, IP concerns, policies, lifetimes providers already exist –MS Terraserver: 3 TB, 750 million items
18
Alexandria Digital Library Project 18 Smith et al NSF April 3, 2003 ADL approach o Distributed library of peer nodes library nodes host collections other nodes host gazetteers, thesauri, other KOS other components, e.g., map servers o Federated item-level search over buckets over individual metadata fields mapped to buckets o Centralized collection-level search/ranking over collection statistics derived from bucket mappings –space, time, type, format any library node can act as collection registry o Collection aggregation
19
Alexandria Digital Library Project 19 Smith et al NSF April 3, 2003 Issue: context & use of library items o Context is critical in geospatial DLs formulating queries evaluating result sets and individual results o Use of geospatial data need access descriptions –“item content single URL” is insufficient –multiple formats –multiple access methods –multiple components need integration with common data environments –ARC/INFO, etc.
20
Alexandria Digital Library Project 20 Smith et al NSF April 3, 2003 Geospatial context o Does this answer your question? Flatirons #1-5 Flagstaff Rd. Green Mountain
21
Alexandria Digital Library Project 21 Smith et al NSF April 3, 2003 ADL approach o All library functionality is accessible via... web service APIs Java RMI o Content access model characterizes methods of access multiple “access points” –download, service, web interface, offline –hierarchies of alternatives, decompositions o Context background maps library-supplied lightweight GIS functionality
22
Alexandria Digital Library Project 22 Smith et al NSF April 3, 2003 Incorporation into NSDL/CI o Geospatial/georeferenced data is an instance of science data complex, well-defined structure rich metadata large size poorly served by traditional information retrieval methods o Science data belongs in NSDL o For NSDL: comparable infrastructure enabling... distributed, content-specific search services association of DL items and content-specific helper tools
23
Alexandria Digital Library Project 23 Smith et al NSF April 3, 2003 Operational status o ADL co-developed with UCSB Library production-quality software foundation of operational library since 2000 complete system in 2003 o UCSB Library: Map & Imagery Laboratory (MIL) self-supporting, 5 full-time employees 2.6 million items, 6.5 TB, growing 1.5 TB/year 4.5 million item gazetteer o Remote sites ESSW, CNR, DLESE, SIO, NTNU, AUT
24
Alexandria Digital Library Project 24 Smith et al NSF April 3, 2003 Outline o Alexandria Digital Library Project (ADLP) History Goals, activities, partners o Distributed DL supporting georeferenced access Research and development issues Operational collections and services è Knowledge organization systems (KOS) Gazetteers and related KOS o ADEPT learning environment Concept-based learning spaces Collections and services
25
Alexandria Digital Library Project 25 Smith et al NSF April 3, 2003 KOS activities & contributions o KOS as primary components of DL architecture Heretofore not acknowledged as a major component ADL/ADEPT thesaurus and gazetteer service protocols o Gazetteer components of DLs Growth of a research and development community, adopting/adapting/sharing our ADL Gazetteer components Gazetteer research issues NSDL Textual Geospatial Integration Project o KOS integration into learning environments Terry Smith will address this in detail
26
Alexandria Digital Library Project 26 Smith et al NSF April 3, 2003 Digital Library Components CATALOG OF METADATA SERVICES ACCESSING ANALYZING ARCHIVING CATALOGING DIGITIZING RETRIEVING SEARCHING VISUALIZING KNOWLEDGE ORGANIZATION SYSTEMS AUTHORITY FILES CLASSIFICATION SYSTEMS CONCEPT SPACES DICTIONARIES GAZETTEERS GLOSSARIES ONTOLOGIES SUBJECT HEADING SETS THESAURI DATA STORE OF OBJECTS Libraries Collections
27
Alexandria Digital Library Project 27 Smith et al NSF April 3, 2003 KOS Generalization Relationships Label Type Definition Meaning NavigationTranslationSense-making
28
Alexandria Digital Library Project 28 Smith et al NSF April 3, 2003 Digital Gazetteer Essentials (controlled vocabulary) None of these elements are unique identifiers of a particular place
29
Alexandria Digital Library Project 29 Smith et al NSF April 3, 2003 Building gazetteer research community o 1994-1996: ADL built the first multi-million-entry international gazetteer and integrated it into the ADL system o 1996-1999: ADL created... Gazetteer Content Standard Feature Type Thesaurus (210 preferred terms; 1046 non- preferred) rebuilt the ADL Gazetteer (over 4 million entries) provided web interfaces for searching the ADL Gazetteer
30
Alexandria Digital Library Project 30 Smith et al NSF April 3, 2003 Building a research community o 1999-present Digital Gazetteer Information Exchange (DGIE) Workshop, funded by NSF (66 participants), 1999 JCDL 2002 workshop on Digital Gazetteers – Integration in Digital Library Services (38 participants; sponsored by NKOS) NAACL 2003 workshop on Analysis of Geographic References ADL-hosted discussion list for gazetteer issues; archived by NSF DLI2 (146 subscribers) Set of 5.9 million geographic names available for download – useful for placename recognition in text Gazetteer Service Protocol and protocol server code An “external identifier” for ADL Gazetteer records New gazetteer client that is based on the gazetteer protocol
31
Alexandria Digital Library Project 31 Smith et al NSF April 3, 2003 Our network of gazetteer interactions o Electronic Cultural Atlas Initiative (ECAI) gazetteer project o Academia Sinica’s Taiwan Gazetteer o UK Historical Boundaries project o UK Geo-crosswalk project o Digital Library for Earth System (DLESE) Education o Biodiversity research, such as the “Specify” system – University of Kansas o State projects, such as NY Agricultural History project (in proposal stage) and Florida statewide gazetteer project o University of Redlands internship proposal (mini-GIS) o Bulgarian Antarctic Place-Names Commission o SRI’s Artificial Intelligence Center (spatial reasoning) o Navy’s SPAWAR Systems Center (natural language process.) o THREDDS project at UCAR (event gazetteers) o Illinois Institute of Technology (geoparsing research)
32
Alexandria Digital Library Project 32 Smith et al NSF April 3, 2003 Advancing and extending gazetteers Named Time Periods World War I ___|_____|___ 1914 1918 Named Spatiotemporal Events Such as Hurricane Hugo
33
Alexandria Digital Library Project 33 Smith et al NSF April 3, 2003 Advancing and extending gazetteers What happens when we extend the digital gazetteer model to anatomy: named structures in the brain, for example? http://www.ohiou.edu/~linguist/l550ex/brainpic.htm Credit & Copyright: Sherry Buttnor http://antwrp.gsfc.nasa.gov/apod/ap011120.html Anticline Famennian sandstone, Hastière http://www.nitg.tno.nl/eng/iccp_tripj.shtml Or to celestial space and 3-d features?
34
Alexandria Digital Library Project 34 Smith et al NSF April 3, 2003 Advancing and extending gazetteers Recognizing patterns Identifying features from gazetteers Deriving the extent of the features from feature analysis Adding bounding box footprints to gazetteer entries Santa Barbara Municipal Airport Obtaining extents from image analysis
35
Alexandria Digital Library Project 35 Smith et al NSF April 3, 2003 Advancing and extending gazetteers Lake Bigler, thru 1920s Lake Bonpland (also Bondland), thru 1890s Da-ow-a-ga, thru 1850s The duplicate detection problem. Given variant names and variant footprints, how do we determine that two pieces of information are about the same place?
36
Alexandria Digital Library Project 36 Smith et al NSF April 3, 2003 Advancing and extending gazetteers From Michael Freeston, New Generic Indexing Technology Effective and efficient database indexing techniques for large spatial + text data collections Test database of 2-d shapes in a geographic area to test the “sufficiency” of spatial generalizations (e.g., bounding boxes) for information retrieval based on spatial similarity (e.g., degree of overlap or containment)
37
Alexandria Digital Library Project 37 Smith et al NSF April 3, 2003 Gazetteer ITR Proposal Advancing and Extending Georeferencing Interoperability and Services (AEGIS) Medium ITR proposal for 2003 Michael Goodchild, UCSB, PI Lewis Lancaster, Berkeley/ECAI, co-PI o Formalization and extension o Performance and scalability o Cross-cultural issues o Cognitive and behavior issues o Extents: representation of a feature’s geometry o Integration of locator services
38
Alexandria Digital Library Project 38 Smith et al NSF April 3, 2003 NSDL Textual Geospatial Integration Goals Extend NSDL infrastructure by enabling o geographic queries across heterogeneous, text and non-text resources o spatial georeferencing of arbitrary texts without explicit geographic cataloging 2001 - 2003 Participants University of California, Santa Barbara o James Frew, PI o Terence Smith o Michael Bueno o Linda Hill Information Retrieval Lab, Illinois Institute of Technology o Ophir Frieder o David Grossman o Eric Jensen o Steve Beitzel The American Geological Institute (AGI) has permitted us to use a set of their GeoRef records for system training.
39
Alexandria Digital Library Project Example text -> Estimated footprint Structure and petrography of the schist of Skookum Gulch, Callahan-Yreka area, eastern Klamath Mountains, Northern California blueschist | California | Callahan California | foliation | Klamath Mountains | melange | metamorphic rocks | Ordovician | Paleozoic | petrology | schists | Silurian | Siskiyou County California | Skookum Gulch | United States | Yreka California The schist of Skookum Gulch (SSG) is an informal name applied to a fault-bounded melange composed mainly of schistose metamorphic rocks and less abundant sedimentary and igneous rocks located in the eastern Klamath Mountains of Northern California. The SSG features outcrops of lawsonite+sodic amphibole blueschist and epidote+sodic amphibole rocks transitional to the greenschist facies. Isotopic dating indicates that the schist was metamorphosed during the Ordovician. The SSG is the oldest known Paleozoic blueschist-bearing melange in California and one of the oldest preserved blueschist terranes in North America. Tonalitic rocks associated with the schist have Early Cambrian ages and are among the oldest rocks yet dated within the Klamath Mountains. Field relations indicate that the schist of Skookum Gulch is a complex tectonic melange composed of metavolcanic,... N410000N420000W1220000W1230000 Derived footprint - small Blue: derived footprint – large Red: GeoRef footprint
40
Alexandria Digital Library Project 40 Smith et al NSF April 3, 2003 KOS activities & contributions o KOS as primary components of DL architecture Heretofore not acknowledged as a major component ADL/ADEPT thesaurus and gazetteer service protocols o Gazetteer components of DLs Growth of a research and development community, adopting/adapting/sharing our ADL Gazetteer components Research issues NSDL Textual Geospatial Integration Project o KOS integration into learning environments Terry Smith will address this in detail
41
Alexandria Digital Library Project 41 Smith et al NSF April 3, 2003 Outline o Alexandria Digital Library Project (ADLP) History Goals, activities, partners o Distributed DL supporting georeferenced access Research and development issues Operational collections and services o Knowledge organization systems (KOS) Gazetteers and related KOS è ADEPT learning environment Concept-based learning spaces Collections and services
42
Alexandria Digital Library Project 42 Smith et al NSF April 3, 2003 Applications services based on DLs o Integrate applications with DL infrastructure Web portals lack library organization “packages” not integrated with DLs o Important applications include Services/collections supporting learning environments Services/collection supporting research o Apply domain-specific KOS principles for organizing collections/services for given application Geospatial applications: use georeference Science learning environments: use concept spaces
43
Alexandria Digital Library Project 43 Smith et al NSF April 3, 2003 Science learning spaces: Concept KOS o Concepts of science as basic knowledge granules Sets of concepts form bases for scientific representation DL and KOS technology can support organization of science learning materials in terms of concepts –Collections of models of science concepts (knowledge base) –Collections of learning objects (LO) cataloged with concepts –Collections of instructional materials organized by concepts o Organize learning materials as “trajectory through concept space” Lecture, lab, self-paced materials Services for creating/editing/displaying such materials
44
Alexandria Digital Library Project 44 Smith et al NSF April 3, 2003 Learning environment components/services
45
Alexandria Digital Library Project 45 Smith et al NSF April 3, 2003 Application to learning environments o Application Introductory physical geography (F2002, S2003) o Collections created Knowledge base (KB) of strongly structured concepts Structured lectures and labs Learning objects cataloged by ADN metadata (+ concepts) o Services created For concepts –Web-based concept input tool –Graphic and text-based display tools For instructional materials –Web-based “lecture composer” –“Conceptualization” graphing tool For learning objects –Metadata input tool
46
Alexandria Digital Library Project 46 Smith et al NSF April 3, 2003 Learning environment display (lecture mode) o The lecture is presented on three projection screens, showing the Concept window (left) Lecture window (center) Object window (right)
47
Alexandria Digital Library Project 47 Smith et al NSF April 3, 2003 Model of science concepts o Representing a concept involves more than terms Objective, information-rich, scientific representations –e.g., for concepts of heat diffusion, DNA, drainage basin, … Associated semantics –e.g., relating to measurement, recognition,… Many interrelationships –e.g., hierarchical, causative, property,… o Models of science concepts Already exist for chemistry (ASA), materials (NIST),… Generalize such models for this application o Structure items in concept KB using model
48
Alexandria Digital Library Project 48 Smith et al NSF April 3, 2003 Model of science concepts ID TYPE and FACET CONTEXT (KNOWLEDGE DOMAIN) TERM(S) (P/NP) DESCRIPTION(S) HISTORICAL ORIGIN(S) EXAMPLE(S) HIERARCHICAL RELATIONS DEFINING OPERATIONS SCIENTIFIC REPRESENTATION(S) –Scientific classifications –Data/Graphical/Mathematical/Computational reps PROPERTIES CAUSAL RELATIONS CO-RELATIONS APPLICATION(S)
49
Alexandria Digital Library Project 49 Smith et al NSF April 3, 2003 Item in concept knowledge base
50
Alexandria Digital Library Project 50 Smith et al NSF April 3, 2003 Concept input tool
51
Alexandria Digital Library Project 51 Smith et al NSF April 3, 2003 Collections of learning materials o Lecture/lab composer Creates learning materials with –Tailorable structure –Underlying organization as “forest of trees” of concepts Small reusable granules for –Easy creation/edit/access/re-use Can link in –Concepts from concept KB –Items from learning object collections –Items from lecture collection
52
Alexandria Digital Library Project 52 Smith et al NSF April 3, 2003 Current instructional material window o The left-hand frame displays the structure of the lecture o The right- hand frame displays the content of the lecture o ADL icons (globe image) attached to a concept link to a display of concept properties in the concept window Other icons attached to a concept link to a display of concept examples in the illustration window
53
Alexandria Digital Library Project 53 Smith et al NSF April 3, 2003 View of learning material by concepts
54
Alexandria Digital Library Project 54 Smith et al NSF April 3, 2003 Lecture/lab/… composer tool
55
Alexandria Digital Library Project 55 Smith et al NSF April 3, 2003 Learning object collections o Cataloged with tool for metadata creation ADN metadata content standard with concept fields o Use of ADL/ADEPT middleware search services E.g., in creation of lecture/lab presentation materials o Display of collection items in collection window Photos, images, maps, text, videos,… o Support in display window for ADL browser Allows dynamic search of collection holdings
56
Alexandria Digital Library Project 56 Smith et al NSF April 3, 2003 The illustrations window
57
Alexandria Digital Library Project 57 Smith et al NSF April 3, 2003 Evaluation of concept-based approach o Evaluation of efficacy for student learning Do students attain “deeper levels” of understanding? Comparison approach to evaluation o Evaluation of value to instructors/TAs UCLA evaluation team o Evaluation issues Instrumenting students’ use of course materials Time to assess pedagogic value of approach
58
Alexandria Digital Library Project 58 Smith et al NSF April 3, 2003 Example of lessons learned o Importance of “conceptualizations” of concept e.g., characterize concept of Fluvial Landscape with concepts of {River, Watershed} Embed conceptualizations in lecture/labs (not in KB) Idea of learning materials as trees in concept space o Construct labs using analogous “lab” composer Tailored for lab presentations/work Supports of logic of using concepts as framework Can import material from lecture/other collections
59
Alexandria Digital Library Project 59 Smith et al NSF April 3, 2003 Summary o DL infrastructure as basis for Learning Environments Collections –Concept KBs, Lectures, DL objects Services –Creation/Search/Display o Evaluation of efficacy of approach o Community-based development of KBs, Learning Materials, Collections
60
Alexandria Digital Library Project 60 Smith et al NSF April 3, 2003 georeferenced DL tutorials distributable software packages operational libraries: UCSB library,... outreach; federated nodes OPERATIONAL APPLICATIONS gazetteers: research and community gazetteer content standard web service protocols for gazetteers, thesauri, and other KOS ADL gazetteer thesauri for feature and object types duplicate detection for gazetteers textual-geospatial integration services KNOWLEDGE ORGANIZATION distributed georeferenced DL services NSDL core infrastructure data environment (e.g., GIS) integration hardware acceleration for spatial data collaborative tools Z39.50 support ingest and workflow systems GEOREFERENCED DIGITAL LIBRARIES knowledgebase and lecture composing, visualization, and presentation tools physical geography concept space and learning object collections applications to undergraduate education educational evaluation learning services and DL integration digital classrooms metadata content standards learning objects computational models EDUCATIONAL APPLICATIONS reusable user interface components contextual maps, footprint creation KOS navigation lightweight GIS functionality Digital Earth visualization image processing query-by-content, classification spatial extent determination USER INTERFACES ADLP Activities
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.