--Caesar Cai TEXT RECOGNITION SENIOR CAPSTONE 2012.

Slides:



Advertisements
Similar presentations
Close Reading A revision guide to question types.
Advertisements

August OBJECTIVES  Identify the Reading Stages.  Explore some Reading Strategies.  Get familiar with some tools to practice reading.
What you should understand about buying a mobile phone Developed By by.
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
Finding information: Engineering and Computing Sciences Nicola Conway October 2011.
Finding resources for your project: Computing Sciences Nicola Conway June 2011.
Finding resources for your project: Engineering Nicola Conway October 2011.
Finding Accurate and Reliable Health Information on the Internet.
Jeff Yan School of Computing Science Newcastle University, UK (Joint work with Ahmad Salah El Ahmad) Usability of CAPTCHAs Or “usability issues in CAPTCHA.
IRead Team Members: Matt Cardin, Paul Kennedy, Oscar Perez.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
WebInSight: Making Web Images Accessible Jeff Bigham Richard Ladner Ryan Kaminsky Gordon Hempton Oscar Danielsson.
ABC’s of PowerPoint (Office 2007) Part 1: Basic Vocabulary Part 2: Cursors Part 3: Insert Your Text Part 4: Insert Your Pictures Part 5: Basic Tools &
CPSC 203 Introduction to Computers T03 & T29 by Jie (Jeff) Gao.
Kate Gregory | Gregory Consulting James McNellis | Senior Engineer, Visual C++
Julia Bauder, Grinnell College & Jenny Emanuel, University of Illinois Be Where our Faculty Are: Emerging Technology Use and Faculty Information Seeking.
Research Problem. Outline 1. Learn how to define a research problem in CS field.
ONLINE DATA STORAGE & DOCUMENTS Lesson 3. Lesson 3 – Online documents In this lesson we will be covering:  Online documents  Compression and expansion.
Evaluating the use of OCR on a Mobile Device Presented by : Hamed Alharbi Supervisor by :Dr Brett Wilkinson.
PowerPoint: Clip Art and Pictures Computer Information Technology – Section 5-5 Some text and examples used with permission from:
Name 1 (Degree), name 2 (Degree), name 3 (Degree), and Name 4 (Degree) (others as needed) Objective: The Agricultural and Biological Engineering Department.
OCR GCSE ICT DATA CAPTURE METHODS. LESSON OVERVIEW In this lesson you will learn about the various methods of capturing data.
Using Technology with Atypical Students Rob Plaskett, SET-BC Consultant
CS 450 MODELING AND SIMULATION Instructor: Dr. Xenia Mountrouidou (Dr. X)
Level 2 IT Users Qualification – Unit 1 Improving Productivity
Level 2 IT Users Qualification – Unit 1 Improving Productivity Jordan Girling.
Creating an Online Portfolio. Why will making an online portfolio help me? Because you will be able to access it from any computer or internet linked.
--Caesar Cat.  Write an optical character recognition application that identifies and recognizes printed text within an image.
WWW Forms and Search. Forms URL - always fetch a particular page What if the information we want varies from time to time and from user to user?
Unit 2, cont. September 12 More HTML. Attributes Some tags are modifiable with attributes This changes the way a tag behaves Modifying a tag requires.
9 th Grade Advisory Making The Most Of High School Fall 2012.
An-Najah National University Faculty Of Engineering Computer Engineering Department Abed Al-hadi kulib.
1 Graphics CSCI 343, Fall 2015 Lecture 1 Introduction to Graphics Read: Chapter 1 of textbook.
Writing Papers in the Biological Sciences An introduction to Biological research and paper writing.
Ideas and techniques to help you get the most from your revision time.
Communicating for Results Seventh Edition Cheryl Hamilton, Ph.D.
Intro to Scanners. A scanner works by creating a digital image. When you scan a document, you are making a picture of it. This digital image can be used.
Welcome to our Media Center at Riverside Primary! Orientation for Students and Teachers.
DAY 2: TIPS ON SEARCHING WISELY Tazin Afrin August 22,
Lecture 11 Data Structures, Algorithms & Complexity Introduction Dr Kevin Casey BSc, MSc, PhD GRIFFITH COLLEGE DUBLIN.
Clustering Prof. Ramin Zabih
Tips to Create a Successful, Multipage WebQuest Liz Barr.
STAGE 2 IMAGE MANIPULATION.. STAGE 2 IMAGE MANIPUALTION Good image manipulation is the key to transferring your selected images/images successfully onto.
1 2/22/05CS120 The Information Era Chapter 4 Basic Web Page Construction TOPICS: Images and placing pages on the server.
Design and printing instructions: This document is set to the correct size - please do not change the dimensions on the File/Page Setup menu. Finished.
Overview In this tutorial you will: learn what an e-portfolio is learn about the different things e-portfolios may be used for identify some options for.
Why Visit a College or University you are interested in attending?
By: Steven Baker.  What is a CAPTCHA?  History of CAPTCHA  Applications of CAPTCHAs  Accessibility  Examples of CAPTCHAs  reCAPTCHA  Vulnerabilities.
APPULATE OPTICAL CHARACTER RECOGNITION (OCR). OCR Defined: The mechanical or electronic conversion of images of typed, handwritten* or printed text into.
POSTER TEMPLATE BY: Background Objectives Psychophysical Experiment Photo OCR Design Project Pipeline and outlines ❑ Deep Learning.
S OFTWARE FOR PARKING MANAGEMENT USING RFID TECHNOLOGY AND CAMERA Supervisor: Phan Duy Hùng Group Member: Trần Văn Phát Phạm Quang Hào Nguyễn.
License Plate Recognition of A Vehicle using MATLAB
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
CSC 241: Introduction to Computer Science I
Rotoscoping Senior Capstone Project | Ted Trisco
Vocabulary byte - The technical term for 8 bits of data.
Lainie Chang Grand Canyon University January 20, 2010 TEC 542
Create your Benner - intro
Southwestern Community College Holt Library…
Backup Car Camera Derek Wachowski.
Sign in with fingerprint or facial recognition
Optical Character Recognition
Fighting the WebBots A webbot is a program that visits web sites for all kinds of purposes. For example, Google webbots make copies of all web sites for.
2016 CSUN Accessibility Competition
CSC 241: Introduction to Computer Science I
Quick and Dirty: the art of OCR
International Conference on Physical Education and Sports Science
Logo Logo Advanced Research Techniques in Chemical Engineering
Presentation transcript:

--Caesar Cai TEXT RECOGNITION SENIOR CAPSTONE 2012

DEFINITION AND REQUIREMENT Write an optical character recognition application that identifies and recognizes printed text within an image Investigate existing algorithms and libraries. Use the Carleton College Computer Science Comps Project 2010 as a starting point. Initially, try black text on a white background. Design a uniform API so that you can plug in alternative OCR functions. Evaluate the effectiveness of your OCR compared to existing algorithms. Develop an application that employs augmented reality for text within an image (e.g geo-tag state park signs, license plates, campus building signs,..)

WHAT I DID An C# application using an open source OCR engine to identify printed text within an image.

WHAT I DID NOT DO The OCR function Some good ideas that I cannot achieve. (For example, Use a dictionary to correct words).

START WITH … 2010 Carleton College Computer Science Comps Project A lot internet researches Hello World with couple different OCR engines

OCR ENGINES Microsoft MODI Free Need to Install with Office 2003 Tesseract by Google Open Source AspriseOCR Good, but expensive

SIMPLE INTERFACE

FAST OR FULL Fast Full OR

STEPS FOR A FAST RECOGNITION Cut Lines Cut Words Send Char to OCR Engine Using Returned Chars to Produce the Text.

CUT LINES If a line of pixel has more than 98% of white pixels, it is a empty line. If there are three empty lines together, this is a gap between two lines of text. For efficiency, check one line out of three when it is not empty line. If find an empty line, the check whether its neighbors are empty lines, too.

CUT LINES

CUT WORDS 100% <20%>20% Not White Space White Space

CUT WORDS

STEPS OF IDENTIFYING TEXT W/ PICTURE Range the Text Area Pick the Text Color Denoising

RANGE THE TEXT AREA

PICK THE TEXT COLOR

SOME ISSUES (WELL, A LOT OF ISSUES)

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii

WATCH A DEMO

BEST LEARNING TECHNIQUES Internet – Online tutorial website, Google Professors Textbooks

MOST HELPFUL CS CLASSES Event Programming (C#) Theory of Computation (Algorithm)

ADVICE FOR NEXT YEAR’S SENIORS Talk with your Professors and classmates Work on regular schedule Don’t afraid to ask questions

ANY QUESTIONS