Intelligent Classifier STI Innsbruck & Excogito User-friendly Semi-Automatic Product Classification System.

Slides:



Advertisements
Similar presentations
13 September 2012 SDMX Technical Working Group1 Report of the SDMX Technical Standards Working Group SDMX Expert Group Meeting, Paris, September 2012.
Advertisements

WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
Copyright © 2006 Data Access Technologies, Inc. Open Source eGovernment Reference Architecture Approach to Semantic Interoperability Cory Casanave, President.
© Tally Solutions Pvt. Ltd. All Rights Reserved 1 Cataloguing Sales Promotions in Shoper 9 HO August 2010.
What's new?. ETS4 for Experts - New ETS4 Functions - improved Workflows - improvements in relation to ETS3.
The Holmes Platform and Applications Irisel Consulting Madrid, 2008.
COMBASE: strategic content management system Soft Format, 2006.
Copyright Hub Software Engineering Ltd 2010All rights reserved Hub Workflow Product Overview.
LeadManager™- Internet Marketing Lead Management Solution May, 2009.
Usage of the memoQ web service API by LSP – a case study
A Product of Online E-Commerce (B2C) Store front Solutions Sell Direct to clients and maximize your Profits Copyright © ANGLER.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
A COMPLETELY DIFFERENT APPROACH TO CLASSIFIEDS. Our approach is different. Our experience in online communities allows us to think differently. We are.
Terrapin Trader Transformation by Oliver Stohr - Olga Kuznetsova Tyler Cordrey - Brett Holbert December 9, 2008.
September 20, 2002G2E 2002 GSA Technical Forum1 Best Of Breed (BOB) Standard.
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
U.S. ENVIRONMENTAL PROTECTION AGENCY The Web Service Catalog Presentation to the SOA-COI Meeting Lico Galindo, OIC April 14, 2010.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
‘european digital library’ (EDL) Julie Verleyen TEL-ME-MOR / M-CAST Seminar on Subject Access Prague, 24 November 2006.
User Office Status CANARIE Site Visit July, 2009.
Visit our Focus Rooms Evaluation of Implementation Proposals by Dynamics AX R&D Solution Architecture & Industry Experts Gain further insights on Dynamics.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Effort in hours Duration Over Weeks Or Months Inception Launch Web Lifecycle Methodology Maintenance Phases Copyright Wonderlane Studios.
SOA & BPM Business Architecture, SOA & BPM Learn about SOA and Business Process Management (BPM) Learn how to build process diagrams.
SCIA.ESA Professional Technology Intelligent Structural Modeling Technology watch and proof of concept Dr. J.P. Rammant, C.E.O. - SCIA Belgium Dr. M. Novak,
Jurisdictional Presentation May 21 st 2015 New Online Business Filing System.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse 2.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Web Development Process Description
Joel Bapaga on Web Design Strategies Technologies Commercial Value.
Geneva, 30 October 2009 Giuseppe Sindoni, Istat, Italy An online system for multi-channel, register-based census data collection.
What is a life cycle model? Framework under which a software product is going to be developed. – Defines the phases that the product under development.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
XBRL Tools Roadmap - Interstage XWand - Toshimitsu SUZUKI FUJITSU LIMITED.
FIIT STU Bratislava Classification and automatic concept map creation in eLearning environment Karol Furdík 1, Ján Paralič 1, Pavel Smrž.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 2 Open Data Web Services for Oracle BPM August, 2013 Forms.
Interfacing Registry Systems December 2000.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
1 PY4 Project Report Summary of incomplete PY4 IPP items.
University of Sheffield NLP Teamware: A Collaborative, Web-based Annotation Environment Kalina Bontcheva, Milan Agatonovic University of Sheffield.
TECHONOLOGY experts INDUSTRY Some of our clients Link Translation’s extensive experience includes translation for some of the world's largest and leading.
Jan/98 SAP & Microsoft Internet Integration.
Slide 12.1 Chapter 12 Implementation. Slide 12.2 Learning outcomes Produce a plan to minimize the risks involved with the launch phase of an e-business.
Ad Hoc Graphical Reports Ad Hoc Graphical Reports Copyright © Team #4 CSCI 6838 Spring CSCI Research Project and Seminar Team# 4 (
GREG CAPPS [ ASUG INSTALLATION MEMBER MEMBER SINCE:1998 ISRAEL OLIVKOVICH [ SAP EMPLOYEE MEMBER SINCE: 2004 GRETCHEN LINDQUIST [ ASUG INSTALLATION MEMBER.
Technical Overview The Fastest Way to Create Architecture!
WEB SERVICE DESCRIPTION LANGUAGE (WSDL). Introduction  WSDL is an XML language that contains information about the interface semantics and ‘administrivia’
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
1 IRS Component Asset Registry (XML Registry/Repository) Prototype Senior leaders must champion reuse by expecting that assets be reused, recognizing projects.
1 Copyright © 2008, Oracle. All rights reserved. I Course Introduction.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Accurate  Consistent  Compliant Contact: i4i the structured content company the structured content company.
TECHVERZE Oracle BI Publisher Online Training. Introduction to Oracle BI Publisher Oracle BI Publisher is the reporting solution to deliver, author, and.
1 Open Session International Organization for Standardization.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Using iRODS with the EnginFrame Grid Portal into the GRIDA3 project Francesco Locunto Marco Piras Matteo Vocale.
The Holmes Platform and Applications
Chapter 8 Environments, Alternatives, and Decisions.
Lecture 9 - Business Information Systems: Electronic Business Systems
How does a Requirements Package Vary from Project to Project?
Content & the Supply Chain
Wsdl.
Enterprise Program Management Office
Technical Outreach Expert
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Presentation transcript:

Intelligent Classifier STI Innsbruck & Excogito User-friendly Semi-Automatic Product Classification System

People 1.Supervision: Marcus Spies 2.People: Sigurd Harand, Christian Leibold 3.Contact person: Christian Leibold, 4.Industrial cooperation with Excogito, Maksym Korotkiy © STI Innsbruck & Excogito. All Rights Reserved.

Outline 1.Context: Product Classification Problem 2.Project intro, positioning and objectives 3.Workflow driven approach 4.GoldenBullet shooting market a)Improved Software architecture Java XML Registries User taxonomies b)Improved (re-)usability and quality 5.Conclusions and Future 6.Online Demo © STI Innsbruck & Excogito. All Rights Reserved.

Product Classification Problem 1.E-Catalogs contain thousands of cryptic product descriptions 1.CAREPAQ BUREAU PROSIGNIA3YRS/SITE/J+1/TEL 2.TRAINING ACT/ASEEXCEPT TRU64UNIX and OPENVMS 3.…. 2.Businesses have to deal with thousands of e-catalogs 3.Classification standards have tens of thousands of product categories (21192 in UNSPSC 8.04) 4.The result: high manual classification effort is required © STI Innsbruck & Excogito. All Rights Reserved.

many standards (e.g. UNSPSC, ebXML, GPC, …), – ~ classes, – millions of products Current SOA: Outsourcing to low-salary countries or use of (counterproductive) low level quality software tools with 25% failure rates GoldenBullet 2 research prototype offered an exclusive "semi-automatic" functionality to support the classification by manual intervention and to achieve by "learning" a classification level of 95% and speed up the process up to 60 times The development of the GB IC product into a marketable product will be an innovative creation of added value and help to reduce outsourcing of labor. GB IC Positioning and Objectives

Project intro 1.Project won ProIT funding (cooperation between transIT and CAST) 2.Duration: 1st September st August Objectives: Submission of a debugged, robust and marketable GB IC Prototype Extended Usability and Robustness Extended Reusability 4.Completed tasks & Status: Worked out contract for handling IPR between stakeholders (UIBK, Excogito NL, BvW Global Pty) Including foundation regulations for marketing and selling 1 st report with deliverable of the technical specification accepted by CAST and transIT Cooperation with industrial partner Excogito © STI Innsbruck & Excogito. All Rights Reserved.

Workflow Driven Approach 1.GoldenBullet semi-automatically classifies product descriptions into a standard (e.g. UNSPSC) by employing 1.NLP techniques to preprocess descriptions (stemming) 2.Clustering methods to generate representative sub-sets of e- catalog (currently k-means) 3.Machine learning techniques to train the system and automatically generate ranked classification options (currently Naïve Bayes) 2.The user approves or corrects the proposed classification 3.GoldenBullet constantly learns from the user choices and updates the classification options © STI Innsbruck & Excogito. All Rights Reserved.

Architecture © STI Innsbruck & Excogito. All Rights Reserved. Mapping the workflow to functional modules: Seperation of concerns Workflow support to be implemented in the GUI

© STI Innsbruck & Excogito. All Rights Reserved. Architecture © STI Innsbruck & Excogito. All Rights Reserved. Enhanced Usability and Robustness : - Provide sort and search functions for catalogue AND classification schema - Multi-language GUI and contextual help-system - Support of catalogue sizes of up to 10^6 - Action logging enables undo / redo for classification and user workflow - Implementation of strategies for the avoidance of over-fitting

© STI Innsbruck & Excogito. All Rights Reserved. Architecture © STI Innsbruck & Excogito. All Rights Reserved. Enhanced reusability: - Software can be deployed in a Java Enterprise Edition Application Server (e.g. Tomcat, all major vendors) -The Java EE XML Registry is instrumented for storing and accessing classification schema data - Enables customer catalogue taxonomies to be stored and exchanged over a common format. - Documentation (SW Design, User guide, Feature list), JUnit, JavaDoc

Conclusions and Future 1.GoldenBullet is a semi-automatic product classification system that offers significant reduction of e-catalog classification effort 2.GoldenBullet IC considerably improves (re-) usability and robustness of the system 3.In future we aim at: 1.Implementation & validation of the technical specification 2.Generation of awareness (transIT) 3.Evaluation of further (possibly new) options of marketable exploitation © STI Innsbruck & Excogito. All Rights Reserved.

Online Demo © STI Innsbruck & Excogito. All Rights Reserved. - Questions so far? -

© STI Innsbruck & Excogito. All Rights Reserved. Thank you ! © STI Innsbruck & Excogito. All Rights Reserved. Further Questions?

Backup The following slides are provided for the case that no internet connection is available or the DEMO is not reachable © STI Innsbruck & Excogito. All Rights Reserved.

GoldenBullet IC GUI Outline 1.Wizards 1.Data Import/Export 2.Simple and Expert Training 3.Classification 2.E-Catalog and UNSPSC Browsers © STI Innsbruck & Excogito. All Rights Reserved.

“CI” Style GoldenBullet IC has an integrated GUI style and continuous designed and brand-like Interface. -Recognition as product -Usability through commoly used symbols © STI Innsbruck & Excogito. All Rights Reserved.

Data Import/Export Wizards © STI Innsbruck & Excogito. All Rights Reserved.

E-Catalog Browser © STI Innsbruck & Excogito. All Rights Reserved.

Expert Training Automatically created representative sub-catalog is provided to the user for semi-automatic classification © STI Innsbruck & Excogito. All Rights Reserved.

Classification Automatically created classification options are proposed to the user for approval © STI Innsbruck & Excogito. All Rights Reserved.

UNSPSC Browser The Browser allows the user to locate an appropriate UNSPSC category and manually assign it to a product description © STI Innsbruck & Excogito. All Rights Reserved.