Semantically-Rich Tools for Text Exploration Andrew Ashton Center for Digital Scholarship Brown University.

Slides:



Advertisements
Similar presentations
Home-Grown Digital Library System Built Upon Open Source XML Technologies and Metadata Standards David Lacy Villanova University
Advertisements

HATHI TRUST A Shared Digital Repository Delivering Data For New Generations of Research Strategies and Challenges Jeremy York NISO/BISG Forum ALA 2010.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Database Management Systems and Enterprise Software
The National Endowment for the Humanities Brett Bobley Chief Information Officer
Management Information Systems, Sixth Edition
Chapter 14 Maintaining Information Systems Modern Systems Analysis and Design Seventh Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
Brown’s Digital Repository An overview of services.
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878 – 1879) Maria Nisheva-Pavlova, Pavel Pavlov Faculty.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Overview of Search Engines
Elisha Chiware Debbie Becker CPUT Libraries. Agenda The role of statistics in library operations and management planning Statistics and the research librarian.
Framework for Model Creation and Generation of Representations DDI Lifecycle Moving Forward.
Library HITS Helpful Information for Trinity Students/Staff Library eResources for Languages & Literatures Michaelmas Term 2013 Trinity College Library.
Alyssa DeBlasio, PhD Dickinson College (USA) What are the Digital Humanities?
What does it mean to tell stories? Why are stories so important to us? How do different media present stories? And what happens when artists, writers and.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
MODULE A Experience through language Elective II = Distinctively Visual.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Configuration Management and Server Administration Mohan Bang Endeca Server.
Database Management Systems. This lesson includes the following sections  Databases and Management Systems Working with a Database Enterprise Software.
Online Scholarly Editions Introduction to Advanced Research Academic Technology Services.
Maintaining Information Systems Modern Systems Analysis and Design.
The Making of a Snapshot Laura Gibbs-Univ. of Oklahoma Jane Moore-National-Louis Univ. Pam Scheibel-Univ. of Wisconsin.
Dov Winer Faculty of Humanities, HUJI Digital Manuscripts to Europeana Session: Digital Humanities and the DM2E project A first course on computerized.
Why TEI? How Text Encoding Facilitates Research and Analysis Society of Southwestern Archivists Annual Meeting Lisa Spiro May 22, 2008.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Chapter 2: Traditional Approaches
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Presentation of the SITS:Vision Accessibility Module by Alex Johnson.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Database A database program is a piece of software to organize and sort information. It creates an electronic list of information that can be sorted very.
Working with Paratext Software for New Testament Studies A. Somov, ‘Biblical Scholarship and Humanities Computing’ Workshop Presentation, 6 February, 2012.
ISpheresImage iSpheresImage Feature Overview and Progress Summary.
Maps and their textual associations in a digital collection: a report from the Early Washington Maps project. Trevor Bond, Special Collections Librarian.
The role of subject specialists in building humanities-based digital resources Jenn Riley Metadata Librarian IU Digital Library Program.
Directions for Hypertext Research: Exploring the Design Space for Interactive Scholarly Communication John J. Leggett & Frank M. Shipman Department of.
The Social Functions of Science Fictions BIS 384: Literary & Popular Genres Happy New Year!
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Research Services at La Merced Campus Objectives: Provide a framework to support researchers to develop their teaching and research. Provide advanced students.
Texas A&M University Libraries– 5/11/2009 Unmil P. Karadkar Center for the Study of Digital Libraries and The Department of Computer Science Texas A&M.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Topic Maps for Cultural Heritage Collections Conal Tuohy Senior Developer New Zealand Electronic Text Centre
Discovery Environments Barbara DeFelice Director, Digital Resources & Scholarly Communications Programs Dartmouth College Library RUSA/MARS Local Systems.
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
The role of subject specialists in building humanities-based digital resources Jenn Riley Metadata Librarian IU Digital Library Program.
TEI presentation for IS 590 Robert Patrick Waltz July 10 th, 2012.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
Modern Voice in American Literature:3223 Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science, University of.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
How I Learned to Love Classical Studies:
Chapter 14 Maintaining Information Systems
Emphasize “scholarly” and “universities” to distinguish TDL from other efforts. A digital infrastructure for the scholarly activities of Texas universities.
An annotation framework for Fedora
Making Connections: guidance on non-exam assessment
Best Practices for Electronic Theses and Dissertations
GSLIS Research Showcase, April 9, 2010
Metadata to fit your needs... How much is too much?
IIIF AV Player Andrew Kam.
Metadata The metadata contains
Web archives as a research subject
Presentation transcript:

Semantically-Rich Tools for Text Exploration Andrew Ashton Center for Digital Scholarship Brown University

Center for Digital Scholarship The Brown University Women Writers Project (WWP) is a long-term research project devoted to early modern women's writing and electronic text encoding. WWP supports research on women's writing, text encoding, and the role of electronic texts in teaching and scholarship. The Brown University Scholarly Technology Group (STG) provides advanced technology consulting to Brown humanities faculty, departments, libraries, and research centers. We explore the critical new technologies that are transforming scholarly work and helping to maintain its longevity: data and metadata standards, XML publication tools, text encoding methods, database design, and accessibility standards.

Text Encoding Initiative (TEI) at Brown 300+ WWP texts in TEI P4 Inscriptions, epigraphy, and other texts using TEI variants Encoding focuses on semantic and contextual data (i.e. genre, text structure, personal & place names, etc.)

Extract and separate a collection of texts by genre, then retrieve genre-specific structures within the text (e.g. poems, dramatic speeches, letters, recipes) Distill from the selected texts or text pieces the personal names, and separate these by type (references to historical figures, mythological figures, biblical figures; place names; etc.) Sort the subset of data chronologically. Pass the data through a component that tokenizes and adds morphosyntactic information to each word. Generate a visualization for each genre that describes changes in the association of certain adjectives with personal names, differentiated by gender. Examples

Project activities 1.Identify an initial set of structural and semantic textual features that have particular significance for literary studies, and examine the ways in which these must be manipulated and processed to support analysis. 2.Develop a test set of about a dozen SEASR components that operate on these features; these will be contributed to the SEASR repository for common use. 3.Develop a set of SEASR “flows” using these components in combination with other SEASR modules, to produce analytical outcomes that address specific scholarly questions, using the WWP collection as a testbed. 4.Distribute these new SEASR resources via an open repository, so that they can be used by other SEASR projects and users. Thanks!