CLARIN Language Resources Switchboard in CLARIAH

Slides:



Advertisements
Similar presentations
Batches, Buckets and Bookbags Elizabeth B. Thomsen NOBLE: North of Boston Library Exchange EGILS2014.
Advertisements

Example-Based Treebank Querying Liesbeth Augustinus Vincent Vandeghinste Frank Van Eynde CLARIN Sofia,
Example queries for Federated search Jan Odijk CLARIN Federated Search Workshop Copenhagen, 24 Apr
Interoperability aspects in the The Virtual Language Observatory Dieter Van Uytvanck Max Planck Institute for Psycholinguistics
Sustainable Procurement Gerie Schonewille & Rainier Wiering.
Linguistic Research with PaQu Jan Odijk, Utrecht University Small Experiment (was intended as a user test) Take all Dutch CHILDES corpora Select all adult.
Tutorial 8 Sharing, Integrating and Analyzing Data
Introduction to JavaScript. Aim To enable you to write you first JavaScript.
CLARIN (NL PART): Current State and Near Future Jan Odijk Digital Humanities Summer School Leuven,
CLARIN for Linguists Introduction Jan Odijk LOT Summerschool Nijmegen,
Dr. C. Wrandle Barth ADNET Systems October 21, 2010.

1 CLARIN - NL Language Resources and Technology Infrastructure for the Humanities in the Netherlands Jan Odijk Utrecht 28 June 2010.
Linguistics with CLARIN Concluding Overview Jan Odijk LOT Winterschool Amsterdam,
Linguistics with CLARIN Introduction Jan Odijk LOT Winterschool Amsterdam,
1 CLARIN - NL Language Resources and Technology Infrastructure for the Humanities and the Social Sciences in the Netherlands.
Retrieving and Processing Transparencies during a Conference Michaela Marx, DESY JACoW Team Meeting, November 2009, Hamburg, Germany.
CODA – CATCHPlus Open Document Annotation Hennie Brugman OAC II Project Review meeting Chicago – July 26-27, 2012.
Populating the infrastructure the case of the Netherlands Hans Bennis executive board of CLARIN-NL Meertens Institute (KNAW) CLARIN COORDINATORS BUDAPEST,
Common Lab Research Infrastructure for the Arts and Humanities CLARIAH Jan Odijk EuroRisNet+ Workshop, Lisbon,
© Anselm Spoerri Web Design Information Visualization Course Prof. Anselm Spoerri
Linguistics with CLARIN Storing resources in CLARIN Jan Odijk LOT Winterschool Amsterdam,
CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen,
Walk through the reporting process for Barcelona Convention using Reportnet Miruna Badescu, Giuseppe Aristei.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
1 CLARIN - NL What is going on? Jan Odijk Amsterdam 26 Aug 2010.
Procurement Query Login Using Mail User & Password.
ID Mapping to accessions from different databases. COST Functional Modeling Workshop April, Helsinki.
Zet GeoICT aan het werk! Ruimte voor bodem Andreas Hoogeveen 12 november 2015.
PARSEME Alpino MWE Encoding Jan Odijk PARSEME Meeting Iasi,
April , 2006 HEASARC Users Group Tom McGlynn The HEASARC On-line Services Tom McGlynn.
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
HTML HYPER TEXT MARKUP LANGUAGE. INTRODUCTION Normal text” surrounded by bracketed tags that tell browsers how to display web pages Pages end with “.htm”
Prizms for Data Publication and Management May 9, 2014 Katie Chastain.
Using PaQu for language acquisition research Jan Odijk CLARIN 2015 Conference Wroclaw,
Search and Annotation Tool for Oral History INTER-VIEWS Henk van den Heuvel, Centre for Language and Speech Technology (CLST) Radboud University Nijmegen,
Chapter 8 Adding Multimedia Content to Web Pages HTML5 & CSS 7 th Edition.
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
1 New Perspectives on Access 2016 Module 8: Sharing, Integrating, and Analyzing Data.
Audio-visual resources Software applications Services to do:
Dr. Barry Norton, Development Manager, ResearchSpace*
Eclipse EHX System Diagnostic tools
Journal of Mountain Science
Service Registration Scenario 3 _ E-Pack
Regulatory Genomics Lab
Using SHOPFLOOR with QUALITY control
Converting word, excel, and powerpoint into html docs
Eform Generator.
TE004 Smart Change Management with Sage CRM Component Manager
Jan Odijk Birmingham, Corpus and Computational Linguistic Methods and Tools beyond corpus linguistics in CLARIAH Jan Odijk Birmingham,
Guides to Reviewerss Journal of Mountain Science Guides to Reviewerss
3 Dash Web Overview.
AASCIF STATBOOK Keeping it Relevant What can the data do for you?
Part of the Multilingual Web-LT Program
Data Upload & Management
ICEweb 2 a new way of compiling high-quality web-based components for ICE corpora Martin Weisser Center for Linguistics & Applied Linguistics, Guangdong.
Code Analysis, Repository and Modelling for e-Neuroscience
Jan Odijk LREC Miyazaki
Search in Token-annotated Corpora Search in Treebanks
Practical work on NetCDF - CFPOINT
Regulatory Genomics Lab
Metadata used throughout statistics production
Regulatory Genomics Lab
Tutorial 8 Sharing, Integrating, and Analyzing Data
D3.1 Accessibility Statement Generator
Speaking the language of publishing. Worldwide
Presentation plan Accessing and Retrieving SDMX data
Presentation transcript:

CLARIN Language Resources Switchboard in CLARIAH Jan Odijk CLARIAH Techdag 2017-10-06

What is possible already

Text of the message De Volkskrant, 5 oktober 2017 Nieuwe coalitie koerst aan op 1,5 miljard extra voor defensie De nieuwe coalitie gaat jaarlijks ongeveer 1,5 miljard euro extra uittrekken voor defensie. Het gaat om een beginbedrag, mogelijk groeien de uitgaven nog meer. Dat bevestigen bronnen rondom de formerende coalitiepartijen VVD, CDA, D66 en ChristenUnie.

Upload file (bericht.txt) to CLRS

File uploaded

Show Tools

Select NLP Suite for Dutch Benadrukken: komt meer van tegen eind programma a;s meer gebouwd en meer te dissemineren.

Click and Run The Tool

(Additional Parameters) and Tool is running

Output

What is not yet possible Enter with a (zipped) collection of HTML or Word files OpenConvert is suggested, converted to FoLiA, NLP suite is suggested Enriched with linguistic annotations AutoSearch is suggested, and the researcher can search in and analyse his/her own corpus Etc for all data types and tools in CLARIAH

Which Tools? General: any tool that operates on user-supplied input data WP2: Anansi (Excel, CVS, Dataperfect) WP3: Adelheid-Visualiser, Autonomata-tool, AutoSearch upload, COREA, FROG, INPOLDER, NameScape-NER, OpenConvert, PaQu upload, GrETEL upload, @PhilosTEI, TICClops, TQE, TTNWW, UCTO, PICCL? (and other Nijmegen web applications) and some others WP4: DataLegend: Qber (CSV, Excel), GRLC (SPARQL query), BRWSR(?), Inspector(?), Yasgui, DRUID WP5: Mediasuite components: Collection analyser, Collection selector(?), various players, annotation: commenting, classifying, linking and older standalone versions of the tools (e.g. AVResearcherXL)

Background Materials SwitchBoard spec: https://office.clarin.eu/v/CE-2015-0684- LR_switchboard_spec.pdf Stand-alone version of the CLRS: http://weblicht.sfs.uni-tuebingen.de/clrs/#/ Example of CLRS in the VLO: https://vlo.clarin.eu/record?3&docId=urn_58_cts_58_pbc_58_bible.parallel.ces.k ralicka_58_&fqType=format:or&fq=format:application/tei%2Bxml&index=7&coun t=20 Go to the  tab Resources and click on the three dots to the right of the resource CLRS registry: https://github.com/clarin- eric/LRSwitchboard/blob/master/app/back-end/Registry.js Presentation by Claus Zinn at CLARIN 2016: https://www.clarin.eu/sites/default/files/08%20-%20ZINN-Lg-Sw-Board.pdf Abstract by Claus Zinn at CLARIN 2017: http://www.clarin.eu/content/abstracts- overview-clarin-annual-conference-2017#I