Presentation on theme: "Advanced Workshops on Vector NTI Part I – Managing Vector NTI Database"— Presentation transcript:
1Advanced Workshops on Vector NTI Part I – Managing Vector NTI Database Yi-Bu Chen, Ph.D.Ansuman Chattopadhyay, Ph.D.Health Sciences Library SystemMolecular Biology and Genetics Information ServiceUniversity of Pittsburgh
2Overview of the Workshop Navigate the Vector NTI Database ExplorerMenu and Toolbar CommentsCreate New Database SubsetsEdit or add new user-fields to database objectsSearch and report database objectsBackup and archive databasesImport, export, migrate databasesHands on Exercises
4Navigate the Database Explorer: Tables DNA/RNA Molecule Table – Stores DNA and RNA Molecules, including constructed and designed molecules.Protein Molecules Table – Stores Protein Molecules.Enzymes Table – Stores restriction enzymes.Oligos Table – Stores oligonucleotides up to 1000 bases in length.Gel Markers Table – Stores gel markers that can be used in Vector NTI’s simulated electrophoresis.Citations Table – Stores citations retrieved through the Vector NTI PubMed Search engine, from the NCBI website or from other searches. It provides complete bibliographic conversion features.BLAST Results – Stores independent of molecules in a BLAST Results database.Analysis Results – Stores results from various analysis, such as PCR analysis, PFAM analyses, etc.
5Navigate the Database Explorer: Panes and View Options Subset Pane – Displays the subsets for the selected table.Database Objects Pane – Displays the contents of a subset.View Options – Choose from “large icons, small icons, list and details”.☺MENU > View > ; OR Right click on mouse > Select the desired optionArrange and Sort Database Objects – Arrange command designates a column to be sorted; Sort command allows either ascending or descending.☺ MENU > Arrange; OR right click on mouse > Arrange Icons
6☺OR Menu|View > Options > “Details View Tab” Navigate the Database Explorer: More View OptionsEdit and Arrange Column List – Customize the columns and their layout in the Database Objects Pane.☺Move mouse cursor on any column name > Right click on mouse > Columns > Open the Options dialog window > customize the column selections and layout.☺OR Menu|View > Options > “Details View Tab”Note: The Options dialog window also allows configuration of default settings for a few other commands.
7Menu, Toolbar Commands and Selections Menu/Toolbars – Notice the switch between the panes.Popular Toolbar-- Create a new subset-- Remove a subset without affecting the database objects.-- Search database objects.-- Rename a subset or database object.Selecting multiple database objects☺ SHIFT + CLICK to select all objects within a range☺ CTRL + CLICK to select non-contiguous objects in a list☺ “Wire-frame” selection (press the mouse button somewhere outside item names/icons and drag the selection rectangle)
8Creating A New Database Subset Select Table > New > Subset OR click the New Subset button to create a new (empty) subset with a default name, Group 1. The default name can be changed later.Select Table > New > Subset (using Subset Editor) to create a new subset and select its contents in the Edit Subset dialog window.To create a new subset and place objects in it at the same time:> Select objects for the new subset in the Database Objects Pane > click the New Subset button > use the default name or enter a new name.
9Adding or Removing Objects from A Database Subset The simple way of adding Open the source subset in the Subset Pane > Select and drag selected objects from the list to the new subset.Add or remove objects from a subsetSelect the subset > Menu|Table > Edit Subset > Make selections in the Editing Subset dialog window.
10Editing A Database Subset Dismiss a subsetSelect the subset > Menu|Table > Dismiss Subset, OR click the Dismiss Subset button. This operation removes a selected subset, but DOES NOT affect the database objects.Clear a subsetSelect the subset > Menu|Table > Table > Clear Subset to exclude all objects from the selected subset. The subset itself is emptied but not removed. This operation DOES NOT affect database objects.Delete content of a subsetSelect the subset > Menu|Table > Delete Contents from Database to physically delete from the database all objects of the specified subset. The objects in the subset are permanently deleted from the Vector NTI database, and the subset itself is removed.Rename a subsetSelect the subset > Menu|Table > Rename Subset OR click the Rename button.OR: Mouse click right buttonAll above operations can be done by clicking the right mouse button while the desired subset is selected > then choose desire actions from the menu.
11Editing a Subset Based on the Existing Subsets In the Subset Pane, select the subset > Menu|Table|Edit Subset > In the Editing subset dialog box, select all desired source subsets in the Source Subsets column > Configure criteria (Intersection, Difference, Union) for the desired Source Content > Select the desired molecules in the Source Content column > Click Add button to add desired molecules to the Resulting Content column, which represents the subset being edited. > Click OK button to complete the editing.Intersection – Only molecules presented in ALL selected Source Subsets are displayed in the Source Content column.Difference – Only molecules presented in ONE of the selected Source Subsets are displayed.Union – ALL molecules in the selected Source Subsets are displayed.
12Import Molecules into a Database Subset Drag and Drop ImportIn Windows Explorer, select molecule(s) from text files/or a folder containing the files > Drag and drop the selections onto the desired subset in Vector NTI Explorer.(Exercise: drag and drop nm_ gb and nm_ fasta.txt into a subset from the “Vector NTI workshop” folder on the desktop.)Import using menu options > Select Menu|Table > Import, select one of the following options:1. Import Molecule From Text File – imports DNA/RNA/protein molecule data.Accepted format: GenBank/GenPept, EMBL/SWISS-PROT, and FASTA files.2. Import Sequence From Text File – imports a nucleotide or amino acid sequence text file, creating a new DNA/RNA/protein molecule with the sequence.Accepted format: ASCII format text files.3. Import Molecules From Archive – imports objects from a Vector NTI archive.Accepted format: Vector NTI archive .mr format.4. Import Molecules From Directory of Text Files (molecules and oligos only)Accepted format: GenBank (DNA/RNA), GenPept (protein) or Vector NTI’s oligotext format (oligonucleotides).
13Batch Import Molecules into a Database Subset 1. Create a text file containing multiple moleculesUse Notepad or MS Word to create a file contains multiple sequences from your source; make sure that each sequence is separated by blank lines; make sure all sequences are in the same format (anyone of the accepted format); save the sequence file as a .txt fileAccepted format: GenBank/GenPept, EMBL/SWISS-PROT, and FASTA files.2. Import Multiple Sequences From the Text File►In Vector NTI Database Explorer, select the appropriate Table►Create a new subset to store the molecules to be imported►Open Windows Explorer, locate the file containing the multiple sequences; > Drag and drop the file into the newly created subset in Vector NTI Explorer►The multiple sequences in the sequence file will be converted to multiple individual molecule file in the selected subset, with each molecule file corresponds to each sequence in the original sequence file.(Exercise: drag and drop 2 genes-fasta.txt and 2 genes-genbank format.txt into a subset from the “Vector NTI workshop” folder on the desktop.)
14Export a Subset Export subset into a Vector NTI Archive Select Menu|Table > Export > Subset into Archive to export all objects of the specified subset to a VNTI archive > In the Write To dialog box, indicate the directory or folder, and file name for archive storage > Click the Save button.Export All subset objects into a directory of text files (molecules and oligos only)Select Menu|Table > Export > Subset to Directory of Text Files to export all objects of the specified subset into a directory of text files in GenBank (DNA/RNA molecules), GenPept (protein molecules) or Vector NTI’s oligo text format (oligonucleotides) > In the dialog box, browse for the location and enter the file name under which the subset is to be stored.
15Important Operations on Database Objects Most of the commands below are applicable to all database objects, and many can be accessed from the short cut menu or the toolbars (OR right mouse click while pointing at a database object).1. Open – opens selected objects; available only for DNA/RNA/protein molecules.2. Edit – modify user fields, comments, keywords, etc. for an object selected in the Database Objects Pane. This command is enabled only with a single selected object. You can also use the Edit button.3. Exclude from a subset – removes the selected objects from current subset. The objects are still stored in the database and are included in all other subsets where they were included before this operation.4. Delete from the database – physically deletes the selected object from the entire database. If a DNA molecule is deleted, Vector NTI scans all its descendants and disconnects them from the deleted molecule.5. Export – allows to export molecules or sequence to text files (for DNA/RNA/protein objects); or export any selected objects to an archive.6. Report – Produce a report for selected DNA/RNA/protein molecules. Very useful for lab documentation.
16Basic Steps for Database Search The Database Explorer allows sophisticated search by different criteria to locate any object in the database.1. Select a desired database table – Any searches are confined within a specific type of database objects, one must specify a database type before the search.2. Select filters – Menu|Database > Search OR Click the Search button > in the Database Search Dialog Box, check the boxes for desired filters.
17Basic Steps for Database Search—Continued 3. Define filters – Open the Filter Setup Dialog Box for desired filters and further configure the filters. When multiple filters are checked, a search that satisfies all the search criteria (an AND search) will be performed. Check ONLY the filter(s) that you want to include into the search.4. Start search – Inspect the final configuration of the Search Database Dialog Box > Click Start Search button.5. Save the search results – when the search is finished, the Insert Search Results into Subset dialog box will appear > select an existing subset and Vector NTI will remove all existing contents in that subset before placing the search results into it; OR Enter a new name for the subset and all the search results will be saved to this new subset.
18Configure the Database Search Filters — the Attribute Filters for DNA/RNA/Protein Molecules
19Configure the Database Search Filters — the Text Filter The Text Filter enables text search from selected field(s).Users can use the Text Filter to search for the database objects containing desired author names, organisms names, and other textual information.Select only the fields where the desired text should occur.Example:Search all molecules created by yibu:Text filter > check Author field > type “yibu” in “New substring” field > Click Add button > Ok buttonStart Search > save the search results in a new subset yibu.
20Configure the Database Search Filters — the Keyword Filter The Keyword Filter enables text search ONLY in the Keyword field.Keywords can be entered by editing the existing objects, or when a new object is created.Keywords search are very helpful for locating specific objects in a large database. Always enter keywords when you create a new object so you can find it easily later on.Popular keywords: user’s last name, organism name, gene name, project name, etc.
21Configure the Database Search Filters — the Ancestors Filter The Ancestors Filter allows user to find molecules that have specified ancestor(s) molecules.The Ancestors Filter is useful for finding constructed or designed molecules.
22Configure the Database Search Filters — the Oligo & Peptides Filter The Oligo & Peptides Filter allows user to find molecules that are similar to the sequences of the specified Oligo/peptides.Click the Add Oligo or Add Peptide button > in the subsequent dialog box, enter the oligos from the existing oligo database by clicking the Oligo Database, OR enter new oligo or peptide by clicking the Add New… button.Select the desired genetic code to be used for the search from the Genetic Code drop-down box. Automatic translation/back translation is carried out whenever needed.
23Configure the Database Search Filters — the Features Filter The Features Filter allows user to find molecules with specific features.Click the <Add… button > in the subsequent dialog box, select desired feature type and then one or multiple features within the group > Click the OK button to complete the filter setup.Feature types differ for DNA/RNA searches and protein searches.
24Configure the Enzyme Database Search 1. Recognition String Filter – Enter a nucleotide string that must be found in the recognition site of the desired enzyme.2. Attributes Filter – Specify enzyme attributes necessary for the search. “Ambiguous” recognition site – the site containing any nucleotide besides A, T, G, or C.3. Text Filter – Filter search results based on text in various object fields.4. Keyword Filter – Filter search results based on keywords.
25Configure the Oligo Database Search 1. Nucleotide String Filter – Enter a Nucleotide String filter that must be found in thesequence of the desired enzyme and select the strand to search, direct or complementary.2. Attributes Filter – Check to limit the search space to gel markers with specified length of the longest fragment. If all are checked or unchecked, the filter has no effect.3. Text Filter – Filter search results based on text inside various object fields.4. Keyword Filter – Filter search results based on keywords.
26Configure the Gel Marker Database Search Gel Markers are sets of fragments of known sizes. They are used as standards for measuring migration speed in Vector NTI simulated electrophoresis.1. Attributes Filter – Check to limit the search space to gel markers with specified length of the longest fragment. If all are checked or unchecked, the filter has no effect..2. Text Filter – Filter search results based on text inside various object fields.3. Keyword Filter – Filter search results based on keywords.
27Configure the Citation Database Search The Citation Database Search allows users to search citations by Citation, Text, and/or Keywords.
28Using the Vector NTI Citation Database Users can use the Citation Table to generate references list for a manuscript just like what they can do with reference software, such as Endnote.Formatting Bibliographic References embedded in a document1. Tag the reference – In Citation Table > Select desired reference > Menu|Citation|Copy Tag > Paste the tag in your document opened with a word processor. Save your document as a Rich Text Format file when done.2. Generate the references – In Citation Table > Menu|Citation|Format Manuscript > in the Format manuscript dialog box, click Browse button to select the RTF document > in the Format reference for drop down menu, select a desired journal name > Click Start button > the tagged the document will be formatted with proper citations and bibliographies.
29Using the Vector NTI Citation Database -- continued Users can also use the Citation Table to copy selected references in desired journal format for later use.Copy Selected References1. Select desired references – In Citation Table > Select desired references with Ctrl + left mouse click2. Generate the references – In Citation Table > Menu|Citation|Copy Bibliography > In the Copy Bibliography dialog box, select the journal whose formatting the bibliography should follow > click OK button.3. Save the references – Open your word processing document and paste the formatted citations.
30Vector NTI Database Maintenance 1. Database BackupBacking up your database is extremely important. It should be performed at least every month, but a weekly backup is recommended. Backups should be stored on both the hard drive and a removable media, such as a CD.– Menu|Database > Database Backup > in the Choose Backup Directory dialog box, select an existing folder OR create a new folder to store the backup.Note: Vector NTI Database Backups are NOT cross-platform compatible (i.e. the Vector NTI Database on a PC cannot be restored using a VNTI Database Backup made from Vector NTI on a Macintosh, and vice versa. For cross-platform backup/restore, users must use VNTI Archive command with each subset (Export and Import command).2. Database RestoreThis operation restores ALL database files from a specified backup.– Menu|Database > Database Restore > in the Choose Backup Directory dialog box, select the directory where the last backup was stored > the database will be restored.Warning: the restored database will overwrite all current database files.3. Database CleanupThis operation removes accumulated “junk” data. It may also be used to recover lost or damaged files or tables. The cleanup should be performed on monthly basis.– Menu|Database > Database Cleanup.
31Vector NTI Database Migration Strategies 1. Moving/copying entire database from PC to PCFrom Version 9.0 or later to Version 10> Copy the entire database folder in the source PC ( MB) > Move the source folder to the hard drive of the target PC > Open ONLY the Vector NTI Database Explorer on the target PC > set the local database by Menu|Database|Select local database > navigate to locate the database folder > the Vector NTI Database Explorer will restart to take the newly selected database as the default database.Note: this is the BEST strategy as it moves everything (data and program settings) without too much efforts.From Version older than 9.0 – same as the strategy used for moving between PC and MAC as described below.2. Moving/copying entire database between PC and MAC> Archive each subset of each database table from the source computer > Import each archived file using Vector NTI Database Explorer on the target computer.Note: This strategy requires some efforts, but it ensures that all subsets will be correctly moved between different platform or between old and new version of Vector NTI.3. Copy entire database using the database backup function or Vector NTI Database Migration UtilityThese two methods are NOT recommended as they only move data (molecules, enzymes etc) but not subset information.
32Vector NTI Database Management Workshop Exercise 1 – Edit A Molecule Object Modify a DNA molecule CREB1 by adding your last name to the keyword field, and generate a report about this molecule.The answer keySelect the DNA/RNA Molecule Table > Sort the name column to locate CREB1 molecule > click right mouse button > Edit.In the Edit CREB1 dialog box > browse all the tabs > under the keywords tab, enter your last name and click Add. Note the Author change for CREB1.Click right mouse button on CREB1 > Report|General > In Select Report Type dialog box, select New Report > In Molecular Display Setup dialog box, select Default > Browse the options in the Set General Report Options > Browse the feature options in the Set Feature Map Options dialog box > Click Finish button > Print the report, or Click Camera button > in the Camera dialog box, make desired selection and copy-to destination. The contents can be pasted to a word processor.
33Vector NTI Database Management Workshop Exercise 2 – Create a new subset Create a new subset (subset name: “your last name-kinase”) which consists protein kinase molecules with a size between amino acid residues.The answer keySelect the Protein Molecule Table > Click on the Search Database Button.Configure the search criteria: Check the Text Filter > Set up the search for “kinase” in the Description Field > Check the Attribute Filter > Check the size “500 aa aa” > Click Start Search ButtonSave the search results to a new subset: In the Insert search results into subset dialog box, enter the new subset name > Click Ok button.
34Vector NTI Database Management Workshop Exercise 3 – Search and Create a new subset Search the existing protein molecules database to find all Transmembrane proteins and store them in a new subsetThe answer keySelect the Protein Molecule Table > Click on the Search Database Button.Configure the search criteria: Check the Feature Filter > Click Add > In the Add Features dialog Window, locate the “Transmembrane” feature under the Protein Functional Features > Domains> Check Add all features of this type > Click OK > Click OK > Click Start SearchSave the search results to a new subset: In the Insert search results into subset dialog box, enter the new subset name > Click Ok.
35Vector NTI Database Management Workshop Exercise 4 – Archive/Export a subset Archive and export the newly created subset.The answer key1. Export a Subset:Select the newly created subset > Menu|Table|Export|Subset into a directory of text file > in Export Protein To dialog box, create a new folder on the desktop and enter the subset name as the file name, click OK > all molecules in the new subset are now exported into the directory.2. Archive a Subset:Select the newly created subset > Menu|Table|Export|Subset into Archive > in Write Proteins To dialog box, enter the subset name as the archive file name and select the desktop as the destination, click Save button > a single archive file is created.3. Delete the newly created subset: > Select the subset in the Subset Pane > Table|Delete Contents from Database > Table|Dismiss Subset
36Vector NTI Database Management Workshop Exercise 5 – Import Molecules Import protein molecules from the archive file and the directory of text files created from the previous exercise. The imported molecules are to be stored in new subsets (name your own).The answer key1. Import molecules from a directory of text files:Select the proper database table, Menu|Table|Protein Molecules > Menu|Table|Import|Molecules from Directory of Text Files > in the Import Proteins dialog box, locate the folder, and select the protein list file (.pls file) > click Open > in the Insert Protein Molecules into Subset dialog box, enter the name for the new subset > click OK, and all the molecules in the directory are now successfully imported into the new subset.2. Import molecules from an archive file:Select the proper database table, Menu|Table|Protein Molecules > Menu|Table|Import|Molecules from Archive > in the Import Protein Archive From dialog box, locate the protein archive file (.pa4 file), click Open > in the Insert Protein Molecules into Subset dialog box, enter the name for the new subset > click OK, and all the molecules in the archive file are successfully imported into the new subset.
37Vector NTI Database Management Workshop Exercise 6 – Database Backup and Restore Backup the entire local database and then restore it.The answer key1. Backup the database:Menu|Database|Database Backup > in the Choose a Backup Directory dialog box, select a desired drive (hard drive or a removable drive), and click New Dir button to create a new directory for the database backup > click OK > the entire database is now backed up.2. Restore the database from a backup:Menu|Database|Database Restore > in the Choose a Backup Directory dialog box, navigate to locate the backup directory, click OK > click Ok to overwrite all current database files > click Ok to the “Are You Sure?” question > click Yes to replace the existing address book > the entire database is now restored.