Selecting and Combining Tools F. Duveau 02/03/12 F. Duveau 02/03/12 Chapter 14.

Slides:



Advertisements
Similar presentations
1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Advertisements

PowerPoint presentation of first 25 pages of instructional manual Edith Fabiyi Essentials of Internet Access.
Networking Problems in Cloud Computing Projects. 2 Kickass: Implementation PROJECT 1.
COEN 445 Communication Networks and Protocols Lab 4
Languages for Dynamic Web Documents
Direct Congress Dan Skorupski Dan Vingo 15 October 2008.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 8: Implementing and Managing Printers.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Words & Definitions By: Naftaly Garcia Birruete. Address Bar  The space provided on a web browser that shows the addresses of websites.
Software and Multimedia
INTRO TO MAKING A WEBSITE Mark Zhang.  HTML  CSS  Javascript  PHP  MySQL  …That’s a lot of stuff!
Chapter 6: Hostile Code Guide to Computer Network Security.
Chapter 9 Working with Forms. Principles of Web Design 2nd Ed. Chapter 9 2 Principles of Web Design Chapter 9 Objectives Understand how forms work Understand.
Managing Client Access
Module 4 Managing Client Access. Module Overview Configuring the Client Access Server Role Configuring Client Access Services for Outlook Clients Configuring.
Lecturer: Ghadah Aldehim
INTRODUCTION TO WEB DATABASE PROGRAMMING
Computer Concepts 2014 Chapter 7 The Web and .
1 Creating Web Forms in HTML Web forms collect information from customers Web forms include different control elements including: –Input boxes –Selection.
About Dynamic Sites (Front End / Back End Implementations) by Janssen & Associates Affordable Website Solutions for Individuals and Small Businesses.
Student Learning Environment on the World Wide Web l CGI-programming in Perl for the connection of databases over the Internet. l Web authoring using Frontpage.
© 2012 LogiGear Corporation. All Rights Reserved Robot framework.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
By: Joshua O’Donoghue. Operating System Interface In order to interact with the operating system in python you will want to become familiar with the OS.
Learningcomputer.com SQL Server 2008 Configuration Manager.
Python’s Standard Library - Part I Josh Lawrence.
GNU Compiler Collection (GCC) and GNU C compiler (gcc) tools used to compile programs in Linux.
Web Engineering we define Web Engineering as follows: 1) Web Engineering is the application of systematic and proven approaches (concepts, methods, techniques,
Guide to Linux Installation and Administration, 2e1 Chapter 7 The Role of the System Administrator.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
1 Welcome to CSC 301 Web Programming Charles Frank.
Web Database Programming Week 7 Session Management & Authentication.
Guide to Linux Installation and Administration, 2e1 Chapter 11 Using Advanced Administration Techniques.
Information Retrieval and Web Search Crawling in practice Instructor: Rada Mihalcea.
Web Server.
File Transfer Protocol (FTP) FTP host stores files Client logs into host Client program sends command to get a file FTP host downloads the file with error.
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
Website design and structure. A Website is a collection of webpages that are linked together. Webpages contain text, graphics, sound and video clips.
 Packages:  Scrapy, Beautiful Soup  Scrapy  Website  
CIT 590 Intro to Programming Files etc. Agenda Files Try catch except A module to read html off a remote website (only works sometimes)
Internet addresses By Toni Grey & Rashida Swan HTTP Stands for HyperText Transfer Protocol Is the underlying stateless protocol used by the World Wide.
By: Aradhya Malhotra.  To interact with the OS in python you will want to become familiar with the OS module  The command “import os” is used for this.
Getting Started with Unix Naomi Altman. Connecting to a PSU Linux Cluster click on SSH click on "quick connect" Host Name: lxcluster.tlt.psu.edu User.
How Web Servers and The Internet Work The Basic Process.
Web Services Essentials. What is a web service? web service: software functionality that can be invoked through the internet using common protocols like.
Chapter 7: Using Network Clients The Complete Guide To Linux System Administration.
(class #2) CLICK TO CONTINUE done by T Batchelor.
PHP – Hypertext Preprocessor.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
4.01 How Web Pages Work.
Tonga Institute of Higher Education IT 141: Information Systems
1. GSICS Collaboration Servers Synchronisation 2
Software and Multimedia
Software and Multimedia
Chapter 27 WWW and HTTP.
Tonga Institute of Higher Education IT 141: Information Systems
Configuring Internet-related services
PHP and Forms.
Python’s Standard library part I
Tonga Institute of Higher Education IT 141: Information Systems
Recitation on AdFisher
Protocol Application TCP/IP Layer Model
Bryan Burlingame 24 April 2019
4.01 How Web Pages Work.
PHP-II.
4.01 How Web Pages Work.
Presentation transcript:

Selecting and Combining Tools F. Duveau 02/03/12 F. Duveau 02/03/12 Chapter 14

- Regular expressions Your toolkit - Shell commands - Shell scripts - Python programs To search and replace To interact with your computer at the command line To combine / automate command-line operations For more advanced processing How to chose the convenient tools for a particular task? It depends on the kind and number of input data and the amount and frequency of work to perform

- Data from user input Type of input data - Data from the Internet - Data from other programs - Data from hardware raw-input() - Data from spreadsheets sys.argv[] curl() or wget() commands, urllib2 module Output of other programs can be captured using redirection (> or >>) or the pipe operator (|) Chapter 22 gives background on ho to interact directly with the physical world Not suitable to manage large and complex datasets

Gathering data from the Internet

Data from spreadsheets More suited to small one-off projects and simple record-keeping The first step is usually to resave the data in a basic text format (eg CSV) xlrd Python package to access spreadsheet file contents.xlsx files are compressed archives containing XML files

Parsing one set of text files

Parsing many text files

- Input data: Correction of the exercise - Output: html files obtained from the MWG website csv file containing names, sequences and order numbers STEP 1:Create a list of primer sequences and names from 1 file STEP 2:Extend this list to multiple files (fixed number of files) STEP 3:Retrieve the filenames from the order numbers in manage-order.hmtl Automatically loop over all orders STEP 4:Create the list of order numbers STEP 5:Write the output file

Collecting data from the Internet The idea is to adapt the script to find the information directly from webpages instead of manually downloading html files Problem: The website is secured with HTTPS protocol. Strategy to solve this problem (to be implemented in your Python script): 1) Change the default User Agent of Python to simulate a classical web browser 2) Use a Python module (eg. httplib) that can send HTTP orders to the server 3) Send the username and password to the authentication HTTPS address 4) The server should send a response allowing your program to access your primer data