Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Submitter's Guide to GenBank®: Using BankIt for Small-Scale Nucleotide Sequence Submissions Part 1 December 17, 2014 12/17/14.

Similar presentations


Presentation on theme: "A Submitter's Guide to GenBank®: Using BankIt for Small-Scale Nucleotide Sequence Submissions Part 1 December 17, 2014 12/17/14."— Presentation transcript:

1 A Submitter's Guide to GenBank®: Using BankIt for Small-Scale Nucleotide Sequence Submissions
Part 1 December 17, 2014 12/17/14

2 Webinar Logistics - 1 Closed captioning: and enter All content, including a video recording, will be available at If you are having technical difficulties, please use the question pod to enter your question 12/17/14

3 Webinar Logistics - 2 Introductions
Bonnie Maidak, Presenter & Webinar Co-creator Majda Valjavec-Gratian, Webinar Co-creator Both are part of the NCBI general Help Desk team who are trained to assist data submitters Michael Fetchko, GenBank Staff Member All can be reached via 12/17/14

4 Webinar Logistics - 3 Slides are available now at: ftp://ftp.ncbi.nlm.nih.gov/pub/education/public_webinars/2014/Dec17_BankIt/ Some links on the slides are abbreviated: Example: <ncbi>/genbank/submit/ To get the full link, replace <ncbi>/ by: The illustrations in this webinar were generated by using the screen images of the NCBI site or by combining and modifying the images obtained from the Office.com Clip Art (Royalty-free photos and illustrations) 12/17/14

5 Webinar Logistics - 4 If you have questions during the webinar, use the question pod Responses to all questions will be posted online Today, ‘TEST’ Submissions in BankIt – Do not click ‘Finish Submission’ 12/17/14

6 Webinar Goals After this webinar, you will be able to:
Recognize if your sequence data are applicable for submission by using BankIt Understand what you have to know about your sequence before you start your submission process: Sequence source information Biological function of the sequence Become familiar with GenBank submission terminology Understand and use the FASTA sequence format Understand steps and input forms in BankIt that allow you to complete your sequence submission 12/17/14

7 Outline Introduction Small-scale Submissions to GenBank®
Kinds of sequence data Submission tools The BankIt submission process: Before you start: What do you need to know about your sequences? Example submissions of two data sets Other submission considerations 12/17/14

8 Introduction 12/17/14

9 Data Submissions to NCBI
Epigenomics Data Gene Expression Data Assembled Nucleotide Sequences from Large Projects/ Small-Scale studies Traces from Capillary (Sanger) Sequencing Large Projects Registrations/ Descriptions Genomic Variants Large Projects Sample Registrations/ Descriptions Short Sequence Reads from Next Generation Sequencing (NGS) Human Clinical Data Genetic Tests Biomolecular data Manuscripts from NIH-funded authors Chemicals At NCBI, we receive, process, archive, and curate various types of data. 12/17/2014

10 Scope of this Webinar Epigenomics Gene Expression Capillary Traces
Assembled Nucleotide Sequence from Small-Scale studies Project Description Variation: small scale/large scale variants Sample Description Short Reads Human Clinical Data Genetic Tests Manuscripts from NIH-funded authors Chemicals In this seminar we will focus on the assembled nucleotide sequences from small-scale studies that are submitted to GenBank with a web-based tool BankIt. 12/17/14

11 Sequence Data Submissions to GenBank via BankIt Can Include:
Single or multiple sequences of: Complete (or partial) sequences > 200 nt long Protein coding genes Ribosomal RNA genes (16S, 5S, …) Internal transcribed spacers (ITS) Microsatellite markers (but NOT sequence tagged sites, STS) Complete viral or phage genomes Complete mitochondrial genomes Complete chloroplast or other plastids genomes 12/17/14

12 General Submission Info
About GenBank submissions:    <ncbi>/genbank/submit/ The kinds of data that will not be accepted by GenBank: <ncbi>/books/NBK53707/#gbankquickstart.what_kind_of_data_will_2 Requirements for GenBank Submissions through BankIt (lists sequence types <200 nt length which are accepted): <ncbi>/WebSub/html/requirements.html 12/17/14

13 Acceptable Technology for Submissions to GenBank
Sequences obtained by traditional (Sanger) sequencing Next generation sequences (NGS): NGS sequences must be assembled to be submitted to GenBank (no raw reads!) The underlying raw reads do not have to be deposited to SRA (Sequence Read Archive db) 12/17/14

14 What is BankIt and What You Need to Use It
BankIt is a web-based submission tool where all submission steps are completed on the web You need a computer with an internet connection and a web browser: Internet Explorer 9 or above Current versions of Firefox, Chrome, and Safari See Browser advice at: <ncbi>/guide/browsers/ 12/17/14

15 What is BankIt and What You Need to Use It - 2
NCBI account is required to sign in to BankIt. Establish your (free) account at: Access BankIt and sign in at: 12/17/14

16 BankIt Tool Alternatives
Alternative submission tools for GenBank submissions are: Sequin: downloadable software, most often used alternative ( ) 16S rRNA Submission Tool: a specialized tool available in the Submission Portal, only for submission of uncultured Bacterial or Archaeal 16S ribosomal RNA sequences ( ) Barcode Submission Tool: a specialized tool for submission of a 5’ region of the mitochondrial COI genes from animals ( ) tbl2asn: a command line software seldom used as an alternative, designed for different data sets External submission tools often do not work 12/17/14

17 Outline Introduction Small-scale Submissions to GenBank®
Kinds of sequence data Submission tools The BankIt submission process: Before you start: What do you need to know about your sequences? Example submissions of two data sets Other submission considerations 12/17/14

18 “Dear Sir/Madam I want to submit one of my gene sequence in gene bank. So can you please help me how to submit my gene's sequence in gene bank. I looking forward to you kind response. With Best Regards,” 12/17/14

19 Example 1: Single Sequence Data Submission
GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCCGATCTGATTATTGCTAGAAGCTCTCAAGTATTAACAGATCAAATCAGAAAAAAAGTGGCAATGTTTTGTAATGTTGAGAGCACTTCTATTATTGATAATATTGATGTTTCTACTATTTATGAAATTCCTATATCTTTTTACAAACAAGGCGTACATGAGATTTTAAGTTCTAAGTTGAATATCAAGGTTGATCCAAAAATAGAAGAGCTTTCAAAGCTTGCAGGAGTTATAAAATCTAATTTTTTTGCGCCTAAAAAAATTATTAATATTGCCATTTGTGGTAAATATGCTGAACTTGATGATTCTTATGCGTCAATTCGAGAGTCTTTGGTTCATGTTGGAGCTAATTTGGACTTACTTATTAAAAGCACTCTAATTGATTCTAATGATTTAAATGAGAGTTATTTAAAAGAGTTTGACGGCATTATTGTTCCCGGCGGCTTTGGGGGGAAAGGATATGAGGGTAAAATTATTGCTATTAAATATGCTCGTGAAAATAACATTCCCTTTCT 12/17/14

20 How Would You Describe Your Sequence?
What gene(s) does it represent? Did you sequence the whole gene or just a part? Where did you get your sequence from: what organism did you sequence? Where, when, and how did you obtain your organism? Is it a genomic sequence or obtained from mRNA? Is it from the genome or a plasmid, plastid, or mitochondrion? 12/17/14

21 Sequence Source Information
-Where did you get your sequence from: what organism did you sequence? -Where, when, and how did you obtain your organism? He wants to know about your sequence source information. Most importantly, what organism is it coming from? 12/17/14

22 Sequence Source information
-The sequence is from the Borrelia garinii bacterium which causes lyme disease. -This particular Borrelia garinii is strain Fuji that I bought from a commercial culture collection. -The Fuji strain was originally isolated from a tick. 12/17/14

23 Molecule Type and Location
Is it a genomic sequence or obtained from mRNA? Is it from the genome or a plasmid, plastid, or mitochondrion? She wants to know about your molecule type and the location of the sequence. 12/17/14

24 Sequence Source information
The sequence is genomic DNA, not from a plasmid or plastid. 12/17/14

25 Biological Meaning of the Sequence
-What gene(s) does it represent? -Did you sequence the whole gene or just a part? He wants to know about the biological function of the sequence. 12/17/14

26 Sequence Source Information
-I sequenced the CTP synthase (pyrG) gene. It is a protein coding gene. As is typical in bacteria, all of the gene is the coding region, so this is an intronless gene. -However, I did not sequence the entire gene. Just the middle part. 12/17/14

27 Single Sequence Data Submission
GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCCGATCTGATTATTGCTAGAAGCTCTCAAGTATTAACAGATCAAATCAGAAAAAAAGTGGCAATGTTTTGTAATGTTGAGAGCACTTCTATTATTGATAATATTGATGTTTCTACTATTTATGAAATTCCTATATCTTTTTACAAACAAGGCGTACATGAGATTTTAAGTTCTAAGTTGAATATCAAGGTTGATCCAAAAATAGAAGAGCTTTCAAAGCTTGCAGGAGTTATAAAATCTAATTTTTTTG… Partial genomic sequence of the CTP synthase (pyrG) gene from the bacterium Borrelia garinii Fuji strain. The bacterium was isolated from a tick. We have one more step to do before we are ready to start our BankIt submission: 12/17/14

28 FASTA Format for Nucleotide Sequence Data
In its simplest form: Begins with the “greater than” sign: “>” “>” allows software to detect the beginning of a new sequence Followed by a sequence identifier (‘seqID’) Temporary, will be replaced when submission is completely processed If seqID has meaning, this should be repeated in a data field Keep seqIDs as simple as possible Both together form a FASTA Definition line: >seq1 The FASTA Definition line is followed by the raw sequence data 12/17/14

29 Additional SeqID Formatting
Create short yet unique Sequence IDs (SeqID or local_ID) composed of letters and/or digits >Seq1 >Camellia_sinensis_TrEC_Canada_134293_marsh_soil Spaces not allowed within the SeqID Use hyphens or underscores to connect sections of the SeqID Okay NOT okay (too long) 12/17/14

30 Single Sequence Data Submission
GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCCGATCTGATTATTGCTAGAAGCTCTCAAGTATTAACAGATCAAATCAGAAAAAAAGTGGCAATGTTTTGTAATGTTGAGAGCACTTCTATTATTGATAATATTGATGTTTCTACTATTTATGAAATTCCTATATCTTTTTACAAACAAGGCGTACATGAGATTTTAAGTTCTAAGTTGAATATCAAGGTTGATCCAAAAATAGAAGAGCTTTCAAAGCTTGCAGGAGTTATAAAATCTAATTTTTTTG… FASTA-formatted sequence ready for our BankIt submission 12/17/14

31 BankIt Access from NCBI Homepage
12/17/14

32 BankIt Access from NCBI Homepage - 2
12/17/14

33 12/17/14

34 Login Page 12/17/14

35 Live Demonstration of BankIt Web Input Forms with a Single Sequence
12/17/14

36 BankIt Submission Confirmation Page
12/17/14

37 Example 1: Summary FASTA formatted nucleotide sequence data important to use A single sequence can be submitted and annotated using BankIt web input forms The Source Modifiers step allows for providing sequence source information Select those modifiers that are pertinent to your sequence The Features page allows for annotating gene, coding regions and/or other features on the sequence 12/17/14

38 Example 2: Multiple Sequence Submission
GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCCGATCTGATTATTGCTAGAAGCTCTCAAGTATTAACAGATCAAATCAGAAAAAAAGTGGCAATGTTTTGTAATGTTGAGAGCACTTCTATTATTGATAATATTGATGTTTCTACTATTTATGAAATTCCTATATCTTTTTACAAACAAGGCGTACATGAGATTTTAAGTTCTAAGTTGAATATCAAGGTTGATCCAAAAATAGAAGAGCTTTCAAAGCTTGCAGGAGTTATAAAATCTAATTTTTTTGCGCCTAAAAAAATTATTAATATTGCCATTTGTGGTAAATATGCTGAACTTGATGATTCTTATGCGTCAATTCGAGAGTCTTTGGTTCATGTTGGAGCTAATTTGGACTTACTTATTAAAAGCACTCTAATTGATTCTAATGATTTAAATGAGAGTTATTTAAAAGAGTTTGACGGCATTATTGTTCCCGGCGGCTTTGGGGGGAAAGGATATGAGGGTAAAATTATTGCTATTAAATATGCTCGTGAAAATAACATTCCCTTTCT TGGTGGAACCGTAGGAGATATGGAAAATATTTTATTTATTGAGGCGGTAAGACAAATAAGACAGGAGATTGGGAGTGGTAATATTTCTTTTATTCATTTAACATATGTGCCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCTACTCAACAAAGTGTTAAAACCTTAAATAAAGCAGGTATTTTCCCCGATTTAATTATTGCTAGAAGTTCACAAGTATTGACAGACCAAATCAGAAAAAAAGTGGCAATGTTTTGCAATGTTGAGAGCACTTCTATTATTGACAATGTTGATGTTTCTACTATTTATGAAATTCCTATATCTTTTTATAAGCAGGGTGTACATGAGATTTTAAGCTCTAAGTTAAATATTAAGGTTGATCCAAAAATAGAAGAGCTTTCAAAGCTTGTAGGAGTTATAAAATCTAATTTTTTTGTGCCTAAAAAAATTATTAATATTGCTATTTGTGGTAAATATGCTGAACTTGATGATTCTTATGCATCAATTAGAGAGTCTTTGGTTCATGTTGCAGCCCATTTGGATTTGCTTATTAAAAGCACTTTAATTGATTCTAATGATTTAAATGAGAGCTGTTTAAAAGAGTTTGACGGYATTATTGTTCCTGGCGGCTTTGGAGGCAAAGGATATGAAGGTAAAATTATGGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTCT TATGGAAAATATTTTATTTATTGAGACAGTAAGGCAAATAAGACATGAGGTTGGAAGTGGTAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGTCGGAATTAATGAGCAAAAATCTAAACCCACTCAACAAAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCTGATTTGATTATTGCTAGAAGTTCTCAAGTATTGACAGATCAAATCAGAAAAAAAATAGCAATGTTTTGTAATGTTGAGAGTACTTCTATTATTGATAATATTGATGTTTCTACTATTTATGAAATTCCCATATCTTTTTACAAACAAGGTGTGCATGAGATTTTAAGTTCTAAGTTGAATATTAAGGTTGACCCAAAAATAGAAGGGCTTTCAAGGCTTGTAGGGATTATAAAATCTAATTTTTTTGTGCCTAAAAAAATTATTAATATTGCTATTTGTGGTAAGTATGCTGAACTTGATGATTCTTATGCGTCAATTAGAGAGTCTTTGGTTCATGTTGCAGCTAATTTGGATTTACTTATTAAAAGCACTCTCATTGATTCTAATGATTTAAATGAAAGCTGTTTGAAGGATTTTGACGGTATTATTGTTCCCGGCGGCTTTGGGGGTAAAGGATATGAGGGCAAGATTATTGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTCT Sequence 1 Sequence 2 Sequence 3 You have three sequences to submit … 12/17/14

39 How Would You Describe Your Sequence?
What gene(s) does it represent? Did you sequence the whole gene or just a part? Where did you get your sequence from: what organism did you sequence? Where, when, and how did you obtain your organism? Is it a genomic sequence or obtained from mRNA? Is it from the genome or a plasmid, plastid, or mitochondrion? 12/17/14

40 Description of the Three Sequences
FEATURE ANNOTATION (gene, CDS) ORGANISM and SOURCE MODIFIERS (strain, isolation source) Molecule type? Location? These sequences are from three different species of Borellia: one obtained from a tick, two from human. They are genomic DNA sequences; all three represent a partial pyrG gene sequence. -Borrelia garinii -Strain=Fuji -isolation_source= tick. CTP synthase (pyrG) gene Partial CDS Genomic DNA -Borrelia burgdorferi -Strain=297 -Isolation source= human cerebrospinal fluid. CTP synthase (pyrG) gene Partial CDS Genomic DNA CTP synthase (pyrG) gene Partial CDS -Borrelia afzelii -Strain=BO23 -Isolation source= human skin Genomic DNA 12/17/14

41 Multiple Sequence Submission
GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGC… >Seq2 TGGTGGAACCGTAGGAGATATGGAAAATATTTTATTTATTGAGGCGGTAAGACAAATAAGACAGGAGATTGGGAGTGGTAATATTTCTTTTATTCATTTAACATATGTGCCAAGT… >Seq3 TATGGAAAATATTTTATTTATTGAGACAGTAAGGCAAATAAGACATGAGGTTGGAAGTGGTAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGTCGGAATTAATGA… Sequence 1: Borrelia garinii Strain=Fuji Isolation source=tick Genomic DNA, partial pyrG Sequence 2: Borrelia burgdorferi Strain=297 Isolation source=human cerebrospinal fluid Genomic DNA, partial pyrG Sequence 3: Borrelia afzelii Strain=BO23 Isolation source=human skin Genomic DNA, partial pyrG 12/17/14

42 More FASTA Definition Line Formatting - 1
Enter a single space after each SeqID, and then enter the organism in square brackets, [organism=xxx] One or more Source Modifiers can be added within square brackets, for example [clone=6b] End the Definition line with free text of gene and other feature description 12/17/14

43 More FASTA Definition Line Formatting - 2
Information outside the field delimiting characters ‘[]’ (‘[organism=Carpodacus mexicanus]’ for instance), is not parsed by the submission tools into the applicable backend database fields. The final Definition line is created when GenBank Staff does the final review and processing of the record, just before adding it to the public sequence database. FASTA formatted files are always in plain txt (*.txt) format, not MS-Word or other format. 12/17/14

44 FASTA Format for Nucleotide Sequence Data
More information about the Definition line is here: <ncbi>/WebSub/html/help/fasta.html <ncbi>/Sequin/QuickGuide/sequin.htm#DefinitionLine 12/17/14

45 New Definition Lines Sequence 1: Borrelia garinii Strain=Fuji Isolation source=tick partial pyrG CDS >Seq1 [organism=Borrelia garinii] [strain=Fuji] [isolation_source=tick] CTP synthase (pyrG) gene, partial coding sequence >Seq2 [organism=Borrelia burgdorferi] [strain=297] [isolation_source=human cerebrospinal fluid] synthase (pyrG) gene, partial coding sequence >Seq3 [organism=Borrelia afzelii] [strain=BO23] [isolation_source=human skin] CTP synthase (pyrG) gene, partial coding sequence Sequence 2: Borrelia burgdorferi Strain=297 Isolation source=human cerebrospinal fluid partial pyrG CDS Sequence 3: Borrelia afzelii Strain=BO23 Isolation source=human skin partial pyrG CDS Sequence information is now formatted into FASTA definition lines. Let’s add them to the sequences… 12/17/14

46 FASTA Formatted Data >Seq1 [organism=Borrelia garinii] [strain=Fuji] [isolation_source=tick] CTP synthase (pyrG) gene, partial coding sequence GCATGAGATTGGAAGCAGCAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCCACTCAACAGAGTGTTAAAACTTTAAATAAAGCAGGCATTTTCCCCGATCTGATTATTGCTAGAAG… >Seq2 [organism=Borrelia burgdorferi] [strain=297] [isolation_source=human cerebrospinal fluid] synthase (pyrG) gene, partial coding sequence TGGTGGAACCGTAGGAGATATGGAAAATATTTTATTTATTGAGGCGGTAAGACAAATAAGACAGGAGATTGGGAGTGGTAATATTTCTTTTATTCATTTAACATATGTGCCAAGTCCAGCTGGAATTAATGAGCAAAAATCTAAACCTAC… >Seq3 [organism=Borrelia afzelii] [strain=BO23] [isolation_source=human skin] CTP synthase (pyrG) gene, partial coding sequence TATGGAAAATATTTTATTTATTGAGACAGTAAGGCAAATAAGACATGAGGTTGGAAGTGGTAATATTGCTTTTATTCATTTAACCTATGTACCAAGTCCAGTCGGAATTAATGAGCAAAAATCTAAACCCACTCAACAA… 12/17/14

47 Live Demonstration of BankIt Web Input Forms with Multiple FASTA-formatted Sequences
12/17/14

48 Example 2: Summary Several sequences can be submitted and annotated using BankIt web input forms when they require simple annotation. For example, a single (same) gene for all sequences. The Source Modifiers can be included in the FASTA Definition line. For multiple sequences, using the FASTA Definition line makes adding Source Modifier information easier. 5’ partial coding sequences (CDS) can be easily annotated with the web input forms if all translate in the same reading frame (have the same codon_start). 12/17/14

49 Preparing Input Files A plain text (.txt) file containing sequences in the FASTA format A Source Modifier file can also be prepared and imported into BankIt A Feature Table file containing features for each sequence Use the same SeqIDs in the FASTA file and in the other auxiliary data input files 12/17/14

50 More BankIt Submission Info – Unknown Bases
Use the letter ‘N‘ to indicate an unknown base or a series of Ns to indicate a few unknown bases: (…TGGTNGGAANNCCCATTGCCAAT…) Do not add Ns to extend your sequence at the ends: (…TGGTNGGAANNCCCATTGCCAATNNNNNN) BankIt does not accept '-' characters in the FASTA sequence 12/17/14

51 More BankIt Submission Info – Sequencing Gaps
Use a string of ‘N’s to indicate sequencing gaps (…TGGTGGAANNNNNNNNNNNNNNNNCCCA…) insert the exact number of Ns when gap length is known insert 100 Ns when the gap length is unknown BankIt will accept submissions where 'n' is used, as long as the number of 'n's in the entire sequence is less than 50% of the total Each gap needs to be identified with a miscellaneous feature (‘misc_feature’) 12/17/14

52 More BankIt Submission Info – Sequencing Gaps - 2
The Gapped Sequences Format allows you to define the gap in the FASTA file: >Seq1 TGGTGGAA >?unk12 CCCA >?unk100 CCTGTGTGT Examples in: <ncbi>/Sequin/QuickGuide/sequin.htm#GappedSequences <ncbi>/Sequin/sequin.hlp.html#FASTAFormatforGappedSe quence 12/17/14

53 More BankIt Submission Info – Alignments
BankIt does NOT accept alignment formats Use Sequin for alignment formats such as: FASTA+GAP PHYLIP NEXUS 12/17/14

54 Additional BankIt Submission Info
If you have questions while creating the BankIt submission, do not click on the ‘Finish Submission’ button on the last ‘Review and Correct’ page. Send a message to with your question and the BankIt submission number. Once the ‘Finish Submission’ button is clicked and the submission is in the 'Completed' state, it cannot be edited. A new BankIt submission must be done. 12/17/14

55 For More Information Webinars:
<ncbi>/education/webinars/ General NCBI updates (will include submission/tools updates): <ncbi>/news/announcements-list/ NCBI News: <ncbi>/news/ NCBI Insights Blog: 12/17/14

56 Outline Introduction Small-scale Submissions to GenBank®
Kinds of sequence data Submission tools The BankIt submission process: Before you start: What do you need to know about your sequences? Example submissions of two data sets Other submission considerations 12/17/14

57 Webinar Goals After this webinar, you will be able to:
Recognize if your sequence data are applicable for submission by using BankIt Understand what you have to know about your sequence before you start your submission process: Sequence source information Biological function of the sequence Become familiar with GenBank submission terminology Understand and use the FASTA sequence format Understand steps and input forms in BankIt that allow you to complete your sequence submission 12/17/14

58 Next BankIt Webinar Wednesday, January 7, 2015, at 1-2 PM EST
Multiple Features within a Sequence Feature Table file format BankIt submission for two data sets 12/17/14

59 Provide feedback to webinars@ncbi
12/17/14

60 Questions? 12/17/14

61 A Submitter's Guide to GenBank®: Using BankIt for Small-Scale Nucleotide Sequence Submissions
Part 1 December 17, 2014 12/17/14


Download ppt "A Submitter's Guide to GenBank®: Using BankIt for Small-Scale Nucleotide Sequence Submissions Part 1 December 17, 2014 12/17/14."

Similar presentations


Ads by Google