Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.

Slides:



Advertisements
Similar presentations
BI Web Intelligence 4.0. Business Challenges Incorrect decisions based on inadequate data Lack of Ad hoc reporting and analysis Delayed decisions.
Advertisements

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
© 2008 RightNow Technologies, Inc. Title Best Practices for Maintaining Your RightNow Knowledge Base Penni Kolpin Knowledge Engineer.
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
SE 555 Software Requirements & Specification Requirements Management.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Course Instructor: Aisha Azeem
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Integrating Complementary Tools with PopMedNet TM 27 July 2015 Rich Schaaf
Hipikat: A Developer’s Recommender Davor Cubranic, Gail Murphy, and Kelly Booth University of British Columbia.
TEAM FOUNDATION SERVER (TFS) By Sunny Niranjana Devi. M.
Enterprise Systems & Architectures. Enterprise systems are mainly composed of information systems. Business process management mainly deals with information.
Sage CRM Developers Course
Ricerca Distribuita Semantica Protocolli opensource per la condivisione di risorse online.
Valma Technical Aspects
1 Open Source – åpen kildekode Understanding an open source project.
Safety Driven Performance Conference 2013 Capstone RBMI spotlight: software roadmap and user advisory council Oswaldo Rodriguez Deputy Product Manager.
Software Engineering Modern Approaches
Free Open-Source, Open- Platform System for Information Mash-Up and Exploration in Earth Science Tawan Banchuen, Will Smart, Brandon Whitehead, Mark Gahegan,
RUP Implementation and Testing

Presented by Abirami Poonkundran.  Introduction  Current Work  Current Tools  Solution  Tesseract  Tesseract Usage Scenarios  Information Flow.
SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.
© 2008 IBM Corporation ® Atlas for Lotus Connections Unlock the power of your social network! Customer Overview Presentation An IBM Software Services for.
1 G4MICE Design Iteration Malcolm Ellis MICE Video Conference 21 st April 2004.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Adaptive Hypermedia Tutorial System Based on AHA Jing Zhai Dublin City University.
Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011.
INFO 637Lecture #101 Software Engineering Process II Review INFO 637 Glenn Booker.
Intermediate 2 Software Development Process. Software You should already know that any computer system is made up of hardware and software. The term hardware.
Rapid Prototyping of Semantic Mash-Ups through Semantic Web Pipes Danh Le-Phuoc, Axel Polleres, Manfred Hauswirth, Giovanni Tummarello 1, Christian Morbidoni.
Introduction to Dialogue Systems. User Input System Output ?
Exploitation of Semantic Web Technology in ERP Systems Amin Andjomshoaa, Shuaib Karim Ferial Shayeganfar, A Min Tjoa (andjomshoaa, skarim, ferial,
11 Version Control Systems Mauro Jaskelioff (originally by Gail Hopkins)
Effort.vs. Software Product “Quality” Effort Product “Quality” Which curve? - linear? - logarithmic? - exponential?
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Virtual techdays INDIA │ 9-11 February 2011 virtual techdays Data grail: Data Market on Windows Azure Sudhindra Kovalam │ Developer, Icertis Inc.
Service Brokering Yu-sik Park. Index Introduction Brokering system Ontology Services retrieval using ontology Example.
Handling Semantic Data for Software Projects Data Management CSE G674 – SW Engineering Project.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
1 Management Information Systems M Agung Ali Fikri, SE. MM.
| See the possibilities… ePace Support Process Review Fusion 08 Reece Abreo.
Developers Users Committers How do I configure this now? Just one more fix and I am done! CVS Download/Use Software Submit problems/ request features Store.
datalibweb – Stata module to access micro data
David Hatten Developer, UrbanCode 17 October 2013
Software Hardware refers to the physical devices of a computer system.
Regression Testing with its types
PLM, Document and Workflow Management
Cloud based linked data platform for Structural Engineering Experiment
Architecture Concept Documents
Harnessing the Semantic Web to Answer Scientific Questions:
Chapter 18 Maintaining Information Systems
Web Software Model CS 4640 Programming Languages for Web Applications
Software Documentation
Contributing to Open Source Projects
Big Data Quality the next semantic challenge
Textbook Engineering Web Applications by Sven Casteleyn et. al. Springer Note: (Electronic version is available online) These slides are designed.
Engineering Processes
Chapter 5 Designing the Architecture Shari L. Pfleeger Joanne M. Atlee
Automated Testing and Integration with CI Tool
Chapter 11: Software Configuration Management
LOD reference architecture
Chapter 5 Architectural Design.
September 12-14, 2018 Raleigh, NC.
Linked Data Ryan McAlister.
Chapter 2: Building a System
Building a “System” Moving from writing a program to building a system. What’s the difference?! Complexity, size, complexity, size complexity Breadth.
Presentation transcript:

Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February 2010

All the material put in this slide is gathered from publications of DERI research lab accessible on the web. Some references: cases cases 2

Outline Introduction LD2SD Implementation Conclusion 3

Introduction 4 There are different software artifacts involved in software development life cycle Specifications Test data Source code Bug reports Feature requests Discussion forums Version control Configuration management s ….

5

Introduction Therefore, information about a software project are stored in number of heterogeneous, closely related and interdependent datasets These datasets are logically interconnected, but not physically Interconnection is implicit, not explicit Valuable knowledge is hidden inside these datasets 6

Introduction A thread in the discussion forum focuses on a special module It leads to a feature request Several s are communicated between development staff Modifications are made on current code New Java classes are added New unit tests Several people might be involved Documentation must be updated Different people are involved 7

Introduction It is required to make the links between software artifacts and people explicit Also to link them to data on the Web (e.g. discussion forums) 8

LD2SD LD2SD is: a light-weight Semantic Web methodology for turning software artifacts into linked data This explicit representation makes new scenarios possible 9

LD2SD Finding an expert Jim is a software project manager. He needs to find a developer in his team with a special expertise and experience. E.g. finding a developer with experience in parser development which has been involved in the last year projects and no bug is reported for code he has written 10

LD2SD Bug tracking issues not fixed in due time Jim wants to know if all the issues due yesterday have been fixed and which packages are affected. 11

LD2SD Find developer replacement Jim needs to find a developer to be replaced with Mary. He needs to analyze Mary’s expertise and latest activities: Assigned bugs Committed code Mailing list and blog posts And finally he wants to find a developer whose CV matches Mary’s expertise 12

LD2SD LD2SD methodology Assign URIs to all entities in software artifacts and convert to RDF representations based on the linked data principles, yielding LD2SD datasets Use semantic indexers, e.g. Sindice, to index the LD2SD datasets Use semantic pipes, e.g. DERI pipes, allowing to integrate, align and filter the LD2SD datasets Deliver information to end-users integrated in their preferred environments 13

14

LD2SD LD2SD datasets can be linked to LOD datasets such as Dbpedia and Revyu It enables the reuse of existing information in the software development process 15

LD2SD LD2SD allows us to integrate, view, and filter the data But one problem: Updating the original software artifacts Current linked data is read-only A recently launched project pushback aimed at a read/write Semantic Web We are confident to adequately address this issue in the near future 16

LD2SD Implementation Implementation 3 layers 1. Data layer 2. Integration layer 3. Interaction layer 17

LD2SD Implementation “Sindice software project” as the reference software project A list of candidate software artifacts 18

Data layer RDFication and Interlinking 19

Data layer 20

Data layer 21

Data layer 22

Data layer 23

Integration Layer DERI pipes are used to build RDF-based mashups. They allow to fetch documents from different sources, merge them and operate on them. 4 steps: 1. Fetch the RDF representation of the artifacts using the RDF Fetch operator 2. Merge the datasets using a Simple Mix operator 3. Query the resulting, integrated dataset with SPARQL 4. Apply XQuery in order to sort and format the dta from the previous step The output of the implemented pipe is then accessible via an URI 24

Integration Layer 25

26

Interaction Layer Handles the interaction between the integrated data and the end-users such as developers Semantic Widgets are used 27

LD2SD Plug-in A plug-in is implemented for Eclipse IDE Enables developers to find related information about software artifacts without leaving their development environment 28

LD2SD Plug-in 29

Evaluation 12 participants with 1-5 years development experience Were asked to carry out a set of tasks in two ways: Manual Approach, and Plug-in Approach Identify all blog posts that mention a specific Java class Identify all bugs that have been fixed by modifying a specific Java class Identify all developers that are working on a Java package Identify all blog posts that mentions a specific Java class Identify all bugs that belong to a specific Java package 30

Evaluation Results 31

Conclusion Introduced linked data approach in software development paradigm The idea is to make implicit links between software artifacts explicit and expose them using RDF Provide valuable information to end users by aggregating information from different interconnected software artifacts 32

Future Work Implement further use cases Improve the interlinking among LD2SD datasets 33